Hi! What can I help you with?
Q: What is the main goal of the R1-V project?
A: The main goal is to enhance the generalization ability of Vision-Language Models using Reinforcement Learning with Verifiable Rewards, achieving significant results with minimal cost.
Q: What datasets are used in the R1-V project?
A: The project uses the CLEVR-70k and R1-Distilled Visual Reasoning Dataset for training and evaluation.
Q: How can I contribute to the R1-V project?
A: Contributions are welcome, especially for issues marked with "help wanted" labels. You can explore the training codebase and provide ideas or pull requests.
Q: What are the key updates in the R1-V project?
A: Key updates include the release of the evaluation script, training codebase, and visual reasoning traces, along with ongoing improvements and community engagement.