MOTIF paper MOTIF trained model and Vanilla GRPO trained model, compared in the paper. purbeshmitra/MOTIF Text Generation • Updated Jul 7 • 23 • 1 purbeshmitra/vanillaGRPO Text Generation • Updated Jul 7 • 11 MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs Paper • 2507.02851 • Published Jul 3
MOTIF paper MOTIF trained model and Vanilla GRPO trained model, compared in the paper. purbeshmitra/MOTIF Text Generation • Updated Jul 7 • 23 • 1 purbeshmitra/vanillaGRPO Text Generation • Updated Jul 7 • 11 MOTIF: Modular Thinking via Reinforcement Fine-tuning in LLMs Paper • 2507.02851 • Published Jul 3