POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 6 items • Updated May 23 • 13
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper • 2501.03262 • Published Jan 4 • 102
Reward Bench Collection Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated Apr 30 • 9
view article Article Accelerated Inference with Optimum and Transformers Pipelines May 10, 2022 • 3