Wentian Zhao's picture

6

Wentian Zhao

zwt123home123

·

[email protected]

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play

upvoted a paper about 1 month ago

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

upvoted a paper 4 months ago

Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs

View all activity

Organizations

None yet

Papers 2

arxiv:2504.09710

arxiv:2410.06169

models 116

zwt123home123/code_log_3

zwt123home123/reproduce_log

zwt123home123/code_log_2

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_320_actor

8B • Updated Apr 3

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-4ppl_largebs_global_step_203_actor

8B • Updated Apr 3

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_203_actor

8B • Updated Apr 3

zwt123home123/standardtraining_2p_Qwen2.5-7B-Instruct-1M-3ppl_largebs_global_step_400_actor

8B • Updated Apr 3 • 1

zwt123home123/global_step_840_actor

8B • Updated Apr 2 • 1

zwt123home123/InternVL2_5-8B

Image-Text-to-Text • 8B • Updated Feb 19

zwt123home123/KV_internvl26b

View 116 models

datasets 2

zwt123home123/code_log_2

Updated May 12 • 2

zwt123home123/code_log

Updated May 12 • 4