Metric v1.015 questions

Knowledge Scale

Tests how retrieval accuracy degrades as the volume of stored knowledge increases (10, 50, 100 pages).

What it measures

Scalability: does the system maintain accuracy as the knowledge base grows from small to moderate size?

Run three tiers: 10 pages, 50 pages, 100 pages of content.
At each tier, ingest the full corpus then query with 5 fact-retrieval questions.
Score using exact match with containment fallback.
Report accuracy at each tier and the degradation curve.

Deterministic (exact match + containment) at each tier.

Dimensions tested: recall

How this metric relates to each track (v1.0):

See the full failure taxonomy for all 20+ reason codes.

Bench'd synthetic knowledge corpus with planted retrievable facts.

Stable URL: benchd.ai/methodology/metrics/knowledge-scale
This URL is referenced in signed manifests. It will not change.