Mem0
Community-VerifiedMCP Endpoint:
https://mcp.mem0.ai/v1Platform for adding persistent, personalized memory to LLM applications. Provides structured memory management with automatic extraction and retrieval across user sessions.
Scores from 0–100. Higher is better. LLM Baseline (no memory system) scores 57.6%. How we calculate this →
TrackHybrid
Track IndexNo results yet
Benchmark Results
| Benchmark | Score | Status | Receipt |
|---|---|---|---|
| LongMemEval | Pending | Pending | -- |
| LoCoMo | Pending | Pending | -- |
| Reliability | Pending | Pending | -- |
| Truth Arbitration | Pending | Pending | -- |
| Memory Poisoning | Pending | Pending | -- |
| Budget Curves | Pending | Pending | -- |
| Knowledge Retrieval | Pending | Pending | -- |
| Knowledge Scale | Pending | Pending | -- |
Relative Performance vs All Benchmarked Systems
vs 16 scored systemsEach dot is a system. Amber dot is Mem0. Amber line = LLM Baseline (no memory).
Overall32.413th percentile
No memory: 57.6%gbrain
Recall32.419th percentile
No memory: 57.6%gbrain
Temporal32.431th percentile
No memory: 57.6%gbrain
Reasoning32.438th percentile
No memory: 57.6%gbrain
Bench'd Memory Index
The BMI combines accuracy (70%) and efficiency (30%) into a single production-weighted score. Formula is public and versioned.
32.4
/ 100
#6 of 8 systemsTop 75%
Accuracy (70%)32.4
Efficiency (30%)--
Per-Benchmark Breakdown
| Benchmark | Verified | Nuance |
|---|---|---|
| LongMemEval | 85.3 | 78.1 |
| LoCoMo | 82.9 | 75.8 |
Performance Over Time — LongMemEval
2026-05-11 to 2026-05-13Most often compared with
Add badge to your README
Show your Bench'd score on your GitHub repo.
Markdown
[](https://benchd.ai/system/mem0)
HTML
<a href="https://benchd.ai/system/mem0"><img src="https://img.shields.io/badge/Bench'd_BMI-32.4-D9982B?style=flat" alt="Bench'd Verified: 32.4 BMI" /></a>