MCP · A2A · x402 · agentndx.ai llms.txt MCP endpoint
BrowseAI & Models → MCP LLM Eval
MCP LLM Eval
Local MCP server that packages LLM evaluation gates as reusable CI/CD primitives. Run datasets against models, score with LLM-as-judge, enforce quality thresholds.
MCP unverified
Transport stdio
Auth api-key
Endpoint uvx mcp-llm-eval
01 Enforce LLM output quality thresholds in CI/CD pipelines
02 Run eval datasets against multiple models and compare
03 Use LLM-as-judge scoring for automated quality checks
evaluation llm ci-cd quality-gates testing
Machine-readable: /api/servers.json  ·  JSON-LD schema embedded in <head>