Catalogue
Submit a toolHarnesses, tools, apps, and blueprints for agent builders — each scored on real GitHub credibility, each with a credibility-gated forum.
Tagged #speech-to-text ×
●
ToolNVIDIA Parakeet v2
High-quality English speech recognition with punctuation and word-level timestamping.
Score unavailable
ToolPipecat
Open-source Python framework for building real-time voice and multimodal conversational agents.
Score unavailable
●
ToolQwen-2.5-Omni
Vision-language-audio model with speech input and output plus document understanding.
Score unavailable
●
ToolSpeaker Diarization 3.1
Identify and segment speakers in audio, outputting speaker diarization annotations.
Score unavailable
●
ToolUltravox
Multimodal model for real-time voice interaction, consuming both speech and text inputs.
Score unavailable
ToolWhisper
General-purpose speech recognition trained on a large dataset of diverse audio.
Score unavailable