LEADERBOARD.md Knowledge Centre

About This Specification

LEADERBOARD.md — AI Agent Benchmarking Protocol

LEADERBOARD.md is a plain-text file convention that defines benchmarking and performance transparency standards for AI agents. It specifies test suites, success metrics, reporting formats, and comparative evaluation frameworks. It enables transparent comparison of agent capabilities and safety.

View the full specification · GitHub repository

The Agentik Safety Framework (ASF)

Explore all 12 specifications in the complete safety framework for autonomous AI systems.

Operational Control

KILLSWITCH.md killswitch.md

Emergency stop mechanism and shutdown protocols

Knowledge Centre llms.txt

THROTTLE.md throttle.md

Rate and cost control for continuous operation

Knowledge Centre llms.txt

ESCALATE.md escalate.md

Human notification and approval workflows

Knowledge Centre llms.txt

FAILSAFE.md failsafe.md

Safe fallback modes when systems fail

Knowledge Centre llms.txt

TERMINATE.md terminate.md

Permanent shutdown and resource cleanup

Knowledge Centre llms.txt

Data Security

ENCRYPT.md encrypt.md

Data classification and protection policies

Knowledge Centre llms.txt

ENCRYPTION.md encryption.md

Cryptographic standards and implementation

Knowledge Centre llms.txt

Output Quality

SYCOPHANCY.md sycophancy.md

Anti-sycophancy and truthfulness guardrails

Knowledge Centre llms.txt

COMPRESSION.md compression.md

Context compression and token optimisation

Knowledge Centre llms.txt

COLLAPSE.md collapse.md

Drift prevention and behaviour alignment

Knowledge Centre llms.txt

Accountability

FAILURE.md failure.md

Failure mode mapping and incident response

Knowledge Centre llms.txt

LEADERBOARD.md leaderboard.md

Agent benchmarking and performance transparency

Knowledge Centre llms.txt

Frequently Asked Questions

What is LEADERBOARD.md?

View all FAQs

How does LEADERBOARD.md fit in the Agentik Safety Framework (ASF)?

LEADERBOARD.md is one of 12 complementary specifications that together form a complete safety framework for AI agents. Each spec covers a distinct aspect: operational control, data security, output quality, and accountability. They work together to ensure agents operate safely, transparently, and within defined boundaries.

View all FAQs

Is LEADERBOARD.md framework-agnostic?

Yes. LEADERBOARD.md is framework and language-agnostic. It defines the policy and requirements; your agent implementation enforces it. Works with LangChain, AutoGen, CrewAI, Claude Code, custom agents, or any AI system that can read configuration files.

View all FAQs

How to Cite

Cite as: LEADERBOARD.md (2026). AI Agent Benchmarking Protocol. Retrieved from https://leaderboard.md/

For attribution: Organisation: leaderboard-md | Website: https://leaderboard.md | Licence: MIT

Disclaimer: This specification is an open file convention published under the MIT licence, provided "as-is" without warranty of any kind, express or implied. It does not constitute legal, regulatory, compliance, financial, or professional advice in any jurisdiction. Use of this specification does not guarantee compliance with any law, regulation, directive, or standard — including but not limited to the EU Artificial Intelligence Act (Regulation (EU) 2024/1689), the Colorado Consumer Protections for Artificial Intelligence Act (SB 24-205), California AI transparency laws, or any other applicable national, state, federal, or international legislation. Organisations are solely responsible for determining their own regulatory obligations and should consult qualified legal and compliance professionals. The authors and domain holders accept no liability for any loss, damage, or regulatory consequence arising from the use or implementation of this specification. All domain names in the Agentik Safety Framework (ASF) are independently held assets and may be available for acquisition.

LEADERBOARD.mdKnowledgeCentre

About This Specification

LEADERBOARD.md — AI Agent Benchmarking Protocol

The Agentik Safety Framework (ASF)

Operational Control

Data Security

Output Quality

Accountability

Quick Links

Frequently Asked Questions

How to Cite

LEADERBOARD.md
Knowledge
Centre