Browse
→ AI & Models
→ MCP Bench
MCP Bench
Benchmarking framework by Accenture for evaluating LLM tool-use via MCP. End-to-end pipeline assessing how effectively models discover, select, and use tools.
MCP unverified
Integration
| Transport | stdio |
| Auth | none |
| Endpoint | python -m mcp_bench |
Use Cases
| 01 | Benchmark LLM tool-use capabilities across models |
| 02 | Evaluate tool discovery and selection accuracy |
| 03 | Compare model performance on complex real-world MCP tasks |
Tags
benchmarking evaluation tool-use llm research
Machine-readable: /api/servers.json
· JSON-LD schema embedded in <head>