Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonNanoNets/docext

docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

80.7/100
1.9KForks: 135
View on GitHubHomepage →
Loading report...

Similar Projects

AdalFlow

84

AdalFlow: The library to build & auto-optimize LLM applications.

Python4.1K

open-webui

94

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Python127.6K

awesome-llm-apps

85

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Python102.6K

MinerU

88

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python56.4K
Back to List