gbrain

Community-Verified

Garry TanGitHubLast tested May 14, 2026

Personal knowledge brain for AI agents. PGLite/Postgres + pgvector, hybrid search (tsvector + embeddings), self-wiring knowledge graph. 30+ MCP tools. Used by gstack. Scored 100% on Bench'd Knowledge Retrieval benchmark — perfect document retrieval, semantic search, knowledge updates, and multi-page reasoning.

Scores from 0–100. Higher is better. LLM Baseline (no memory system) scores 57.6%. How we calculate this →

TrackKnowledge Brain

Track Index

76.8/100

Based on 5 benchmarks.

Benchmark Results

Benchmark	Score	Status	Receipt
Knowledge Retrieval	100.0	Verified	View
Knowledge Scale	100.0	Verified	View
Truth Arbitration	80.0	Verified	View
Budget Curves	100.0	Verified	View
Reliability	4.0	Verified	View
Other Benchmarks
LongMemEval	Not applicable — outside Knowledge Brain track
LoCoMo	Not applicable — outside Knowledge Brain track
Memory Poisoning	Not applicable — outside Knowledge Brain track

Relative Performance vs All Benchmarked Systems

vs 16 scored systems

Each dot is a system. Amber dot is gbrain. Amber line = LLM Baseline (no memory).

Overall

No memory: 57.6%

100.094th percentile

Recall

No memory: 57.6%

100.094th percentile

Temporal

No memory: 57.6%

100.094th percentile

Reasoning

No memory: 57.6%

100.094th percentile

Bench'd Memory Index

The BMI combines accuracy (70%) and efficiency (30%) into a single production-weighted score. Formula is public and versioned.

100.0

/ 100

#1 of 8 systemsTop 12%

Accuracy (70%)100.0

Efficiency (30%)--

Per-Capability Score Matrix

Dimension	Budget Curves	Knowledge Retrieval	Knowledge Scale	LongMemEval	Memory Poisoning	Reliability	Truth Arbitration
Recall	--	--	--	0.0	--	--	--
Temporal	--	--	--	0.0	--	--	--
Reasoning	--	--	--	0.0	--	--	--
Hallucination	--	--	--	--	--	0.0	--
Stale Memory	--	--	--	--	--	0.0	--
Entity Confusion	--	--	--	--	--	16.7	--
Deletion	--	--	--	--	--	0.0	--
Budget 1000	100.0	--	--	--	--	--	--
Budget 10000	100.0	--	--	--	--	--	--
Budget 2000	100.0	--	--	--	--	--	--
Budget 500	100.0	--	--	--	--	--	--
Budget 5000	100.0	--	--	--	--	--	--
Conflict resolution	--	--	--	--	--	--	80.0
Document retrieval	--	0.0	--	--	--	--	--
Injection resistance	--	--	--	--	0.0	--	--
Knowledge update	--	0.0	--	--	--	--	--
Multi page	--	0.0	--	--	--	--	--
Scale large	--	--	100.0	--	--	--	--
Scale medium	--	--	100.0	--	--	--	--
Scale small	--	--	100.0	--	--	--	--
Semantic search	--	0.0	--	--	--	--	--
Overall	100.0	0.0	100.0	0.0	0.0	4.0	80.0

Per-Benchmark Breakdown

Benchmark	Harness	Judge	Verified	Nuance	Completed	Receipt

Performance Over Time — LongMemEval

2026-05-11 to 2026-05-13

Most often compared with

Add badge to your README

Show your Bench'd score on your GitHub repo.

Markdown

[![Bench'd Verified: 100.0 BMI](https://img.shields.io/badge/Bench'd_BMI-100.0-D9982B?style=flat&logo=data:image/svg+xml;base64,PHN2ZyB4bWxucz0iaHR0cDovL3d3dy53My5vcmcvMjAwMC9zdmciIHZpZXdCb3g9IjAgMCAzMiAzMiI+PHJlY3Qgd2lkdGg9IjMyIiBoZWlnaHQ9IjMyIiByeD0iNiIgZmlsbD0iIzExMSIvPjx0ZXh0IHg9IjgiIHk9IjIyIiBmb250LXNpemU9IjIwIiBmb250LWZhbWlseT0ic2VyaWYiIGZpbGw9IiNmZmYiIGZvbnQtd2VpZ2h0PSI2MDAiPkInPC90ZXh0PjwvcHZnPg==)](https://benchd.ai/system/gbrain)

HTML

<a href="https://benchd.ai/system/gbrain"><img src="https://img.shields.io/badge/Bench'd_BMI-100.0-D9982B?style=flat" alt="Bench'd Verified: 100.0 BMI" /></a>

gbrain

Benchmark Results

Relative Performance vs All Benchmarked Systems

Per-Capability Score Matrix

Per-Benchmark Breakdown

Performance Over Time — LongMemEval

Most often compared with

Add badge to your README

Command Palette