R-PRM: Reasoning-Driven Process Reward Modeling
Shuaijie She
kevinpro
AI & ML interests
Reasoning, Chain of Thoughts, Alignment, Factual Consistency, Summarization
Recent Activity
liked
a dataset
2 days ago
speechcolab/gigaspeech
liked
a dataset
2 days ago
parler-tts/mls_eng
liked
a dataset
2 days ago
parler-tts/mls_eng_10k