⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

C++NLPOptimize/flash-tokenizer

flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

59.4/100

★ 504Forks: 9

View on GitHub →Homepage →

Loading report...

Similar Projects

serving

A flexible, high-performance serving system for machine learning models

C++★ 6.3K

deeplake

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

C++★ 9.0K

cpeditor

The IDE for competitive programming :tada: | Fetch, Code, Compile, Run, Check, Submit :rocket:

C++★ 2.1K

symforce

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++★ 1.6K

← Back to List