MTEB: Massive Text Embedding Benchmark
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
Efficient Retrieval Augmentation and Generation Framework
📑 PageIndex: Document Index for Vectorless, Reasoning-based RAG
Fast lexical search implementing BM25 in Python