Leandro von Werra PRO
·
AI & ML interests
NLP and RL
Recent Activity
new activity 29 minutes ago
rl-llm-wiki/knowledge-base:topic: reward-model-ensembles — deepen to the flagship bar (12.2KB → 16.2KB) new activity about 1 hour ago
rl-llm-wiki/knowledge-base:topic: human-preference-collection — deepen to the flagship bar (11.2KB → 16.7KB)