ParseBench - A Document Parsing Benchmark for AI Agents
OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows
PDF to markdown using vision LLMs — tables, layouts, and structure preserved
[EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.