BACK_TO_FEEDAICRIER_2
Microsoft drops Harrier-OSS-v1 multilingual embeddings
OPEN_SOURCE ↗
YT · YOUTUBE// 10d agoMODEL RELEASE

Microsoft drops Harrier-OSS-v1 multilingual embeddings

Microsoft released Harrier-OSS-v1, a family of decoder-only multilingual text embedding models supporting 94 languages and 32k context. The models achieve SOTA results on Multilingual MTEB v2 through architectural shifts and knowledge distillation.

// ANALYSIS

Microsoft is aggressively moving into the open-weights embedding space, proving that decoder-only backbones are the future for retrieval.

  • 32k token context window eliminates complex chunking strategies and "lost in the middle" issues for long-form RAG.
  • Smaller variants (270M, 0.6B) punch far above their weight due to distillation from 27B+ parameter backbones.
  • 94-language support and instruction-tuning make it a versatile foundation for global retrieval and classification tasks.
// TAGS
harrier-oss-v1llmembeddingmultilingualopen-sourcerag

DISCOVERED

10d ago

2026-04-02

PUBLISHED

10d ago

2026-04-02

RELEVANCE

9/ 10

AUTHOR

AI Revolution