LaSeR - a Keven16 Collection

Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Keven16 's Collections

LaSeR

LaSeR

updated 18 days ago

Models from the paper "LaSeR: Reinforcement Learning with Last-Token Self-Rewarding"

Keven16/ORZ-7B-LaSeR

8B • Updated 19 days ago • 15
Keven16/Qwen2.5-7B-LaSeR

8B • Updated 19 days ago • 17
Keven16/OctoThinker-3B-Short-LaSeR

4B • Updated 19 days ago • 16
Keven16/LaSeR_training_data

Viewer • Updated 18 days ago • 104k • 51 • 1
LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

Paper • 2510.14943 • Published 18 days ago • 37

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs