Back to List
Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.
Pythoncambrian-mllm/cambrian-s

cambrian-s

Cambrian-S: Towards Spatial Supersensing in Video

52.7/100
503Forks: 19
View on GitHubHomepage →
Loading report...

Similar Projects

InternVL

67

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python9.9K

sparrow

88

Structured data extraction and instruction calling with ML, LLM and Vision LLM

Python5.1K

star-vector

59

StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and textual inputs to produce high-quality SVG code with remarkable precision.

Python4.3K

mlx-vlm

82

MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.

Python2.2K
Back to List