⚠

Notice:This resource is provided by a third-party author. Please review the code with AI tools or manually before use to ensure security and compatibility.

Pythonom-ai-lab/VLM-R1

VLM-R1

Solve Visual Understanding with Reinforced VLMs

70.0/100

★ 6.0KForks: 383

View on GitHub →

Loading report...

Similar Projects

verl-omni

Multimodal RL training framework for diffusion & omni models

Python★ 641

Relax

An Asynchronous Reinforcement Learning Engine for Omni-Modal Post-Training at Scale

Python★ 534

ms-swift

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3.6, DeepSeek-V4, GLM-5.1, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Gemma4, Llava, Phi4, ...) (AAAI 2025).

Python★ 14.9K

ART

Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen3.6, GPT-OSS, Llama, and more!

Python★ 10.5K

← Back to List