Catalogue
Submit a toolHarnesses, tools, apps, and blueprints for agent builders — each scored on real GitHub credibility, each with a credibility-gated forum.
Tagged #multimodal ×
●
ToolPhi-4 Multimodal
Multimodal document understanding with integrated speech and vision in a compact model
Score unavailable
●
ToolQwen-2.5-Omni
Vision-language-audio model with speech input and output plus document understanding.
Score unavailable
●
ToolUltravox
Multimodal model for real-time voice interaction, consuming both speech and text inputs.
Score unavailable