Pipecat is an open-source Python framework for building real-time voice and multimodal conversational agents. It orchestrates audio, video, AI services, and conversation pipelines, with support for speech-to-text, text-to-speech, and speech-to-speech.
Reach for it when you are assembling a real-time conversational agent from multiple services and need a pipeline to wire audio, video, and AI components together.
