Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythonallenai/dolma

dolma

Data and tools for generating and inspecting OLMo pre-training data.

66.7/100
1.5KForks: 190
View on GitHubHomepage →
Loading report...

Similar Projects

langextract

91

A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.

Python35.8K

Chinese-LLaMA-Alpaca

89

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python18.9K

Chinese-LLaMA-Alpaca-2

83

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

Python7.1K

Chinese-LLaMA-Alpaca-3

78

中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3

Python2.0K
Back to List