Official implementation of Half-Quadratic Quantization (HQQ)
Accessible large language models via k-bit quantization for PyTorch.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Private AI platform for agents, assistants and enterprise search. Built-in Agent Builder, Deep research, Document analysis, Multi-model support, and API connectivity for agents.
Machine Learning Engineering Open Book