1 21 5

Jeffrey Magder

jmagder

jmagder

AI & ML interests

None yet

Recent Activity

liked a Space 5 days ago

HuggingFaceTB/smol-training-playbook

updated a collection about 1 month ago

To read

upvoted a paper about 1 month ago

Efficient Estimation of Word Representations in Vector Space

View all activity

Organizations

None yet

upvoted a paper about 1 month ago

Efficient Estimation of Word Representations in Vector Space

Paper • 1301.3781 • Published Jan 16, 2013 • 7

upvoted a paper 3 months ago

Why do LLMs attend to the first token?

Paper • 2504.02732 • Published Apr 3 • 2

upvoted 2 papers 4 months ago

Accuracy is Not All You Need

Paper • 2407.09141 • Published Jul 12, 2024 • 3

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 269

upvoted a collection 4 months ago

Unsloth Dynamic 2.0 Quants

Collection

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 53 items • Updated about 17 hours ago • 235

upvoted 2 papers 5 months ago

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26 • 55

GLaM: Efficient Scaling of Language Models with Mixture-of-Experts

Paper • 2112.06905 • Published Dec 13, 2021 • 2

upvoted an article 6 months ago

Article

The Large Language Model Course

•

Jan 16

• 209

upvoted a paper about 1 year ago

MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

Paper • 2409.07314 • Published Sep 11, 2024 • 56

upvoted 11 papers over 1 year ago

FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning

Paper • 2307.08691 • Published Jul 17, 2023 • 9

Jeffrey Magder

AI & ML interests

Recent Activity

Organizations

jmagder's activity

The Large Language Model Course