llamaindex-memory 0.0 on LoCoMollm-baseline 0.0 on LoCoMomem0-local 0.0 on LongMemEvalmem0-local 0.0 on LongMemEvalllamaindex-memory 0.0 on LongMemEvalllm-baseline 0.0 on LongMemEvallangchain-memory 0.0 on LongMemEvalcognee 0.0 on LongMemEval13 systems independently scored64 systems indexedllamaindex-memory 0.0 on LoCoMollm-baseline 0.0 on LoCoMomem0-local 0.0 on LongMemEvalmem0-local 0.0 on LongMemEvalllamaindex-memory 0.0 on LongMemEvalllm-baseline 0.0 on LongMemEvallangchain-memory 0.0 on LongMemEvalcognee 0.0 on LongMemEval13 systems independently scored64 systems indexed
Methodology
Metric v1.015 questions

Knowledge Scale

Tests how retrieval accuracy degrades as the volume of stored knowledge increases (10, 50, 100 pages).

What it measures

Scalability: does the system maintain accuracy as the knowledge base grows from small to moderate size?

How it works

  1. Run three tiers: 10 pages, 50 pages, 100 pages of content.
  2. At each tier, ingest the full corpus then query with 5 fact-retrieval questions.
  3. Score using exact match with containment fallback.
  4. Report accuracy at each tier and the degradation curve.

Scoring method

Deterministic (exact match + containment) at each tier.

Dimensions tested: recall

Purpose alignment

How this metric relates to each track (v1.0):

TrackAlignment
conversationalorthogonal
knowledge-braincore
graphcore
agent-memoryadjacent
baselinecore

Expected failure modes

  • RETRIEVAL_MISS — expected answer not in returned context
  • OVER_RETRIEVAL — returns too much context, diluting the answer
  • PARTIAL_ANSWER — finds some but not all requested information

See the full failure taxonomy for all 20+ reason codes.

Dataset source

Bench'd synthetic knowledge corpus with planted retrievable facts.

Known limitations

  • 100 pages is modest; real-world knowledge bases can be 10,000+ pages.
  • Synthetic corpus may not capture domain-specific retrieval challenges.

Stable URL: benchd.ai/methodology/metrics/knowledge-scale
This URL is referenced in signed manifests. It will not change.

Command Palette

Search for a command to run...