BACK_TO_FEEDAICRIER_2
Elephant Alpha lands mixed EQBench v3 scores
OPEN_SOURCE ↗
REDDIT · REDDIT// 3h agoBENCHMARK RESULT

Elephant Alpha lands mixed EQBench v3 scores

A local EQBench v3 run places Elephant Alpha in the middle of the pack: strong on analytic and moral subtests, weak on the human-facing slice. It sits around GPT-4.5-preview and o4-mini overall, and only a hair above DeepSeek-V3-0324.

// ANALYSIS

My read is that Elephant Alpha looks like a sharp, structured model rather than a naturally warm one: it can reason through emotionally loaded prompts, but it does not turn that into standout human rapport.

  • The 8.2 analytic score is the clearest signal here; the model seems better at parsing scenarios than performing empathy theater.
  • The 4.3 human score, paired with a 2.6 sycophancy note, suggests it resists glazing but may feel flatter than chat-first models.
  • A 5.4 moral score is respectable for a stealth model and keeps it from looking one-dimensional.
  • The gap to DeepSeek-V3-0324 is small enough that this reads as incremental progress, not a leaderboard breakout.
  • EQ-Bench v3 is Opus-judged and Elo-based, so the absolute numbers matter less than the shape of the profile.
// TAGS
elephant-alphallmbenchmarkreasoningethicssafety

DISCOVERED

3h ago

2026-04-16

PUBLISHED

20h ago

2026-04-16

RELEVANCE

8/ 10

AUTHOR

nivvis