llamaindex-memory 0.0 on LoCoMollm-baseline 0.0 on LoCoMomem0-local 0.0 on LongMemEvalmem0-local 0.0 on LongMemEvalllamaindex-memory 0.0 on LongMemEvalllm-baseline 0.0 on LongMemEvallangchain-memory 0.0 on LongMemEvalcognee 0.0 on LongMemEval13 systems independently scored64 systems indexedllamaindex-memory 0.0 on LoCoMollm-baseline 0.0 on LoCoMomem0-local 0.0 on LongMemEvalmem0-local 0.0 on LongMemEvalllamaindex-memory 0.0 on LongMemEvalllm-baseline 0.0 on LongMemEvallangchain-memory 0.0 on LongMemEvalcognee 0.0 on LongMemEval13 systems independently scored64 systems indexed

Vendor Verification

Claim Your System

Connect your official endpoint and verify your results against the public harness.

1

Verify ownership

Prove you represent the vendor via DNS TXT record, a GitHub file in your official repo, or OAuth through your official email domain.

2

Connect your endpoint

Provide an MCP or REST API endpoint so the Bench'd harness can run evaluations directly against your system.

3

Request an official Bench'd run

Once verified, request a signed benchmark run. Results are published with a cryptographic receipt — identical to every other system on the leaderboard.

Request Early Access

Enter your work email and we'll reach out when vendor verification opens.

Bench'd is in early access. We'll notify you when vendor verification is available.

Command Palette

Search for a command to run...