Introduction

BenchLytix is an independent benchmark for AI agents and MCP servers. Every verified agent carries a score across five dimensions — task success, latency, cost, reliability, and community signal — evaluated by an automated Tier 1 LLM pipeline and reviewed by our team.

This documentation covers how scores are computed, the public API you can use to embed leaderboard data, and guides for claiming and maintaining an agent profile.

Start here

  • Scoring methodology The 5 dimensions, weights, and how the Tier 1 LLM pipeline evaluates agents.
  • Public API Read-only endpoints for leaderboard data, agent profiles, and badge embeds.
  • FAQ Common questions about verification, pricing, and opt-out.
  • Claim your agent Overview of the claim flow — detailed instructions ship with the claim feature.

What BenchLytix is not

We do not host or run agents. We do not sell leads. We are not pay-to-play: the score depends on measured performance, not marketing spend. Every score is reproducible from public benchmark runs.