Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonNVIDIA-NeMo/Curator

Curator

Scalable data pre processing and curation toolkit for LLMs

77.6/100
1.4KForks: 227
View on GitHub
Loading report...

Similar Projects

llama_index

93

LlamaIndex is the leading document agent and OCR platform

Python47.5K

pandas-ai

74

Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.

Python23.3K

DeepAnalyze

74

DeepAnalyze is the first agentic LLM for autonomous data science. 🎈你的AI数据分析师,自动分析大量数据,一键生成专业分析报告!

Python3.8K

docetl

87

A system for agentic LLM-powered data processing and ETL

Python3.7K
Back to List