arxiv:2402.05808
KevinChen
KevinChenwx
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
Critique-RL: Training Language Models for Critiquing through Two-Stage
Reinforcement Learning