-
LNS-Madam: Low-Precision Training in Logarithmic Number System using Multiplicative Weight Update
Paper • 2106.13914 • Published • 1 -
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
Paper • 2506.15196 • Published • 3 -
Ascend HiFloat8 Format for Deep Learning
Paper • 2409.16626 • Published • 1 -
Recipes for Pre-training LLMs with MXFP8
Paper • 2506.08027 • Published • 1
zhangwenbin
ExceedZhang
AI & ML interests
None yet
Recent Activity
liked
a model
3 days ago
deepseek-ai/DeepSeek-OCR
liked
a model
3 days ago
Alibaba-NLP/Tongyi-DeepResearch-30B-A3B
upvoted
a
paper
4 days ago
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language
Model
Organizations
None yet