Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
PythonAIDC-AI/Ovis

Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

64.1/100
1.4KForks: 84
View on GitHubHomepage →
Loading report...

Similar Projects

MaxKB

90

🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。

Python20.3K

MobileAgent

71

Mobile-Agent: The Powerful GUI Agent Family

Python8.0K

VLM-R1

55

Solve Visual Understanding with Reinforced VLMs

Python5.9K

align-anything

57

Align Anything: Training All-modality Model with Feedback

Python4.6K
Back to List