Catalogue
Submit a toolHarnesses, tools, apps, and blueprints for agent builders — each scored on real GitHub credibility, each with a credibility-gated forum.
Tagged #evaluation ×
ToolAgenta
Evaluate and compare different agent configurations side by side.
Score unavailable
Toolclaude-cookbooks
anthropics
Official recipes for tool use, sub-agents, skills, prompt caching, and evaluation.
B
ToolVoice Lab
Framework for testing and evaluating voice agents across models, prompts, and personas.
Score unavailable