Collections
Discover the best community collections!
Collections including paper arxiv:2402.17764
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
CLEAR: Character Unlearning in Textual and Visual Modalities
Paper • 2410.18057 • Published • 210 -
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Paper • 2410.22366 • Published • 84 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 4.57M • • 2.71k -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation • 71B • Updated • 9.75k • • 2.06k -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
LLM360/TxT360
Updated • 17.9k • 241
-
CohereLabs/c4ai-command-r-plus-08-2024
Text Generation • 104B • Updated • 2.67k • 277 -
meta-llama/Meta-Llama-3-8B
Text Generation • 8B • Updated • 2.08M • • 6.4k -
meta-llama/Meta-Llama-3-70B
Text Generation • 71B • Updated • 85.8k • • 870 -
impira/layoutlm-document-qa
Document Question Answering • 0.1B • Updated • 22.5k • 1.15k
-
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper • 2408.15545 • Published • 38 -
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 -
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Paper • 2408.10914 • Published • 44 -
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 40
-
CohereLabs/c4ai-command-r-plus-08-2024
Text Generation • 104B • Updated • 2.67k • 277 -
meta-llama/Meta-Llama-3-8B
Text Generation • 8B • Updated • 2.08M • • 6.4k -
meta-llama/Meta-Llama-3-70B
Text Generation • 71B • Updated • 85.8k • • 870 -
impira/layoutlm-document-qa
Document Question Answering • 0.1B • Updated • 22.5k • 1.15k
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
CLEAR: Character Unlearning in Textual and Visual Modalities
Paper • 2410.18057 • Published • 210 -
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders
Paper • 2410.22366 • Published • 84 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 95
-
openai/whisper-large-v3-turbo
Automatic Speech Recognition • 0.8B • Updated • 4.57M • • 2.71k -
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF
Text Generation • 71B • Updated • 9.75k • • 2.06k -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 627 -
LLM360/TxT360
Updated • 17.9k • 241
-
SciLitLLM: How to Adapt LLMs for Scientific Literature Understanding
Paper • 2408.15545 • Published • 38 -
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 -
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Paper • 2408.10914 • Published • 44 -
Automated Design of Agentic Systems
Paper • 2408.08435 • Published • 40