Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
C++NLPOptimize/flash-tokenizer

flash-tokenizer

EFFICIENT AND OPTIMIZED TOKENIZER ENGINE FOR LLM INFERENCE SERVING

59.4/100
504Forks: 9
View on GitHubHomepage →
Loading report...

Similar Projects

serving

77

A flexible, high-performance serving system for machine learning models

C++6.3K

deeplake

87

Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai

C++9.0K

cpeditor

83

The IDE for competitive programming :tada: | Fetch, Code, Compile, Run, Check, Submit :rocket:

C++2.1K

symforce

76

Fast symbolic computation, code generation, and nonlinear optimization for robotics

C++1.6K
Back to List