Collections
Discover the best community collections!
Collections including paper arxiv:2402.17764
-
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Paper ⢠2208.07339 ⢠Published ⢠5 -
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Paper ⢠2210.17323 ⢠Published ⢠8 -
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Paper ⢠2211.10438 ⢠Published ⢠6 -
QLoRA: Efficient Finetuning of Quantized LLMs
Paper ⢠2305.14314 ⢠Published ⢠56
-
crystalai/thoth-guardian-ai-auto-train-cybersecurity-shield
Updated ⢠1 -
crystalai/thoth-guardian-cybersecurity-shield
Updated ⢠1 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
nyu-mll/glue
Viewer ⢠Updated ⢠1.49M ⢠371k ⢠448
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper ⢠2501.08313 ⢠Published ⢠298 -
Group Sequence Policy Optimization
Paper ⢠2507.18071 ⢠Published ⢠306 -
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
Paper ⢠2509.03867 ⢠Published ⢠208
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
nanonets/Nanonets-OCR-s
Image-Text-to-Text ⢠4B ⢠Updated ⢠143k ⢠1.55k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image ⢠Updated ⢠292k ⢠⢠2.38k -
15.4k
DeepSite v3
š³Generate any application by Vibe Coding
-
Rewnozom/agent-zero-v1-a-01
Text Generation ⢠4B ⢠Updated ⢠1 -
TheBloke/MythoMax-L2-13B-GGUF
13B ⢠Updated ⢠57k ⢠195 -
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation ⢠18B ⢠Updated ⢠50.7k ⢠381 -
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF
Text Generation ⢠8B ⢠Updated ⢠13.1k ⢠118
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
MiniMax-01: Scaling Foundation Models with Lightning Attention
Paper ⢠2501.08313 ⢠Published ⢠298 -
Group Sequence Policy Optimization
Paper ⢠2507.18071 ⢠Published ⢠306 -
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth
Paper ⢠2509.03867 ⢠Published ⢠208
-
LLM.int8(): 8-bit Matrix Multiplication for Transformers at Scale
Paper ⢠2208.07339 ⢠Published ⢠5 -
GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers
Paper ⢠2210.17323 ⢠Published ⢠8 -
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Paper ⢠2211.10438 ⢠Published ⢠6 -
QLoRA: Efficient Finetuning of Quantized LLMs
Paper ⢠2305.14314 ⢠Published ⢠56
-
crystalai/thoth-guardian-ai-auto-train-cybersecurity-shield
Updated ⢠1 -
crystalai/thoth-guardian-cybersecurity-shield
Updated ⢠1 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
nyu-mll/glue
Viewer ⢠Updated ⢠1.49M ⢠371k ⢠448
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper ⢠2402.17764 ⢠Published ⢠625 -
nanonets/Nanonets-OCR-s
Image-Text-to-Text ⢠4B ⢠Updated ⢠143k ⢠1.55k -
black-forest-labs/FLUX.1-Kontext-dev
Image-to-Image ⢠Updated ⢠292k ⢠⢠2.38k -
15.4k
DeepSite v3
š³Generate any application by Vibe Coding
-
Rewnozom/agent-zero-v1-a-01
Text Generation ⢠4B ⢠Updated ⢠1 -
TheBloke/MythoMax-L2-13B-GGUF
13B ⢠Updated ⢠57k ⢠195 -
DavidAU/Llama-3.2-8X3B-MOE-Dark-Champion-Instruct-uncensored-abliterated-18.4B-GGUF
Text Generation ⢠18B ⢠Updated ⢠50.7k ⢠381 -
QuantFactory/DarkIdol-Llama-3.1-8B-Instruct-1.2-Uncensored-GGUF
Text Generation ⢠8B ⢠Updated ⢠13.1k ⢠118