Fuzheng Zhang's picture

9

Fuzheng Zhang

Edrex

·

Edrex1128

AI & ML interests

None yet

Recent Activity

updated a model 6 days ago

Edrex/gemma-3-1B-it-reasoning

published a model 6 days ago

Edrex/gemma-3-1B-it-reasoning

authored a paper about 2 months ago

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

View all activity

Organizations

None yet

authored a paper about 2 months ago

CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning

Paper • 2509.20712 • Published Sep 25 • 19

authored a paper 4 months ago

RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning

Paper • 2507.07451 • Published Jul 10 • 5

authored a paper 6 months ago

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Paper • 2505.19650 • Published May 26 • 5

authored 2 papers 7 months ago

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

Paper • 2504.10068 • Published Apr 14 • 30

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

Paper • 2504.06122 • Published Apr 8 • 5