Trevor Miller

MicrowaveJack

MicrowaveJack

AI & ML interests

None yet

Recent Activity

upvoted an article 26 days ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

liked a Space 29 days ago

nanotron/ultrascale-playbook

liked a Space 29 days ago

HuggingFaceFW/blogpost-fineweb-v1

View all activity

Organizations

None yet

upvoted an article 26 days ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

27 days ago

•

liked 3 Spaces 29 days ago

The Ultra-Scale Playbook

🌌

3.52k

The ultimate guide to training LLM on large GPU Clusters

FineWeb: decanting the web for the finest text data at scale

🍷

1.19k

Generate high-quality text data for LLMs using FineWeb

The Smol Training Playbook

📚

2.47k

The secrets to building world-class LLMs

liked a model about 2 months ago

microsoft/UserLM-8b

Text Generation • 8B • Updated Oct 9 • 2.07k • 354

liked a Space about 1 year ago

Qwen2.5 Coder Artifacts

🐢

1.69k

Generate code for applications

liked a model about 1 year ago

BAAI/bge-small-en-v1.5

Feature Extraction • 33.4M • Updated Feb 22, 2024 • 3.52M • • 382

liked a dataset about 1 year ago

gretelai/gretel-math-gsm8k-v1

Viewer • Updated Oct 16, 2024 • 24.9k • 442 • 39

liked a dataset over 1 year ago

TIGER-Lab/SKGInstruct

Preview • Updated Apr 9, 2024 • 179 • 28

liked 4 models over 1 year ago

liked a dataset over 1 year ago

TIGER-Lab/MMLU-Pro

Viewer • Updated Oct 25 • 12.1k • 55k • 395

upvoted a paper over 1 year ago

DynaVis: Dynamically Synthesized UI Widgets for Visualization Editing

Paper • 2401.10880 • Published Jan 19, 2024 • 1

liked a model almost 2 years ago

TheBloke/CodeLlama-70B-hf-GGUF

Text Generation • 69B • Updated Jan 30, 2024 • 725 • 42

updated a collection almost 2 years ago

Research

Collection

1 item • Updated Jan 29, 2024

upvoted a paper almost 2 years ago

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26, 2024 • 74

liked a model almost 2 years ago

mistralai/Mixtral-8x7B-Instruct-v0.1

47B • Updated Jul 24 • 386k • 4.61k

Trevor Miller

AI & ML interests

Recent Activity

Organizations

MicrowaveJack's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

The Ultra-Scale Playbook

FineWeb: decanting the web for the finest text data at scale

The Smol Training Playbook

Qwen2.5 Coder Artifacts