Deep-Agent / R1-V Public

Notifications You must be signed in to change notification settings
Fork 226
Star 2.9k

Code
Issues 61
Pull requests 5
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: Deep-Agent/R1-V

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

61 Open 50 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

输入图片的size必须是相同大小的，否则会报错

#137 opened Feb 25, 2025 by JarvisFei

Proposal to Evaluate on the HumanEval-V Benchmark for Enhanced Visual Reasoning and Code Generation

#135 opened Feb 25, 2025 by zfj1998

vllm grpo trainer inputs_ids bug

#134 opened Feb 24, 2025 by tcy6

vllm grpo trainer不支持qwen2.5VL

#133 opened Feb 24, 2025 by tcy6

could not load weight to vllm after the first training step

#132 opened Feb 24, 2025 by llliuxiao

Invalidate trace cache @ step 0 and module 4374: cache has only 0 modules

#131 opened Feb 24, 2025 by JarvisFei

你们博客中的CoT SFT数据集会开源吗？

#130 opened Feb 24, 2025 by chiaitian

What is the Aha moment in R1-V

#128 opened Feb 24, 2025 by ruolinsss

flash_attn_2 error

#127 opened Feb 24, 2025 by dfgan

Completion Length Static (Wrong length logged to WANDB)

#125 opened Feb 22, 2025 by Syazvinski

Why does format reward equal to zero?

#124 opened Feb 21, 2025 by XavierCHEN34

Qwen2.5-VL RuntimeError: Split with sizes expects split sizes to sum exactly to 1(调用model.generate时报错）

#123 opened Feb 21, 2025 by Youngluc

About the computation of total training steps （关于训练step数量计算）

#122 opened Feb 21, 2025 by SpursGoZmy

Why not SFT-cold-start first?

#121 opened Feb 20, 2025 by dszpr

Flash attention error when training in latest environment

#120 opened Feb 19, 2025 by daydayup2100

GEOQA-8k datasets

#119 opened Feb 19, 2025 by PinxueGuo

torch_dtype Can not passed in Qwen2VLGRPOTrainer。

#118 opened Feb 19, 2025 by robinjoe93

Aria 无法正常执行

#116 opened Feb 19, 2025 by DeadLining

为什么多模态关于规范格式的prompt不写在system中

#115 opened Feb 18, 2025 by munian08

会支持internvl系列的grpo吗？

#114 opened Feb 18, 2025 by OrlandoBloom16

不能复现结果

#112 opened Feb 18, 2025 by zhiwenhou1227

关于多图片输入的问题

#111 opened Feb 18, 2025 by AIaimuti

support qwen2.5 vl in sft

#109 opened Feb 18, 2025 by LiuRicky

Qwen2.5-VL-3B OOM

#107 opened Feb 17, 2025 by Liuziyu77

different results of GRPO on qwen2-vl 2b

#106 opened Feb 17, 2025 by munian08

Previous 1 2 3 Next

Previous Next

ProTip! Updated in the last three days: updated:>2025-02-22.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly