⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

PythonTIGER-AI-Lab/VLM2Vec

VLM2Vec

This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]

76.5/100

★ 627Forks: 58

View on GitHub →Homepage →

Loading report...

Similar Projects

all-in-rag

🔍大模型应用开发实战一：RAG 技术全栈指南，在线阅读地址：https://datawhalechina.github.io/all-in-rag/

Python★ 6.5K

ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-R1, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

Python★ 13.9K

VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python★ 5.9K

UltraRAG

[GitHub Trending #2] A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines

Python★ 5.5K

← Back to List