Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonBinWang28/audio-ai-hub

audio-ai-hub

The hub for audio AI research: papers, open models, benchmarks & datasets across audio LLMs, speech recognition, TTS, music & audio generation.

66.4/100
931Forks: 48
View on GitHubHomepage →
Loading report...

Similar Projects

CosyVoice

75

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python21.6K

AI-Waifu-Vtuber

56

AI Vtuber for Streaming on Youtube/Twitch

Python1.1K

transformers

98

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python161.5K

FunASR

91

Industrial-grade speech recognition toolkit: 170x realtime, 50+ languages, speaker diarization, emotion detection, streaming, and OpenAI-compatible API.

Python17.6K
Back to List