// agent benchmarking and performance transparency
Your centralised gateway to leaderboard resources, specifications, and comprehensive safety standards for autonomous systems.
LEADERBOARD.md is a plain-text file convention that defines benchmarking and performance transparency standards for AI agents. It specifies test suites, success metrics, reporting formats, and comparative evaluation frameworks. It enables transparent comparison of agent capabilities and safety.
Explore all 12 specifications in the complete safety framework for autonomous AI systems.
Emergency stop mechanism and shutdown protocols
Agent benchmarking and performance transparency
https://leaderboard.md/leaderboard-md | Website: https://leaderboard.md | Licence: MIT
Last updated: 13 March 2026