Elephant Alpha lands mixed EQBench v3 scores
A local EQBench v3 run places Elephant Alpha in the middle of the pack: strong on analytic and moral subtests, weak on the human-facing slice. It sits around GPT-4.5-preview and o4-mini overall, and only a hair above DeepSeek-V3-0324.
My read is that Elephant Alpha looks like a sharp, structured model rather than a naturally warm one: it can reason through emotionally loaded prompts, but it does not turn that into standout human rapport.
- –The 8.2 analytic score is the clearest signal here; the model seems better at parsing scenarios than performing empathy theater.
- –The 4.3 human score, paired with a 2.6 sycophancy note, suggests it resists glazing but may feel flatter than chat-first models.
- –A 5.4 moral score is respectable for a stealth model and keeps it from looking one-dimensional.
- –The gap to DeepSeek-V3-0324 is small enough that this reads as incremental progress, not a leaderboard breakout.
- –EQ-Bench v3 is Opus-judged and Elo-based, so the absolute numbers matter less than the shape of the profile.
DISCOVERED
57d ago
2026-04-16
PUBLISHED
58d ago
2026-04-16
RELEVANCE
AUTHOR
nivvis