Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
nm-testing 's Collections
Models in CI
FP8-Block Quantized Models
LLM Compressor testing
Speculators testing
Sparse-Llama-3.1-8B-2of4
SparseGPT LLMs
FP8 Models

LLM Compressor testing

updated 4 days ago
Upvote
-

  • nm-testing/tinysmokellama-3.2

    354k • Updated Sep 17 • 24.5k

  • nm-testing/llama2.c-stories42M-pruned2.4

    Updated 23 days ago • 352

  • nm-testing/tinyllama-fp8-dynamic-compressed

    1B • Updated Oct 9, 2024 • 411

  • nm-testing/tinyllama-w4a16-compressed

    0.3B • Updated Oct 9, 2024 • 284

  • nm-testing/tinyllama-w8a8-compressed

    1B • Updated Oct 9, 2024 • 586

  • nm-testing/tinyllama-w8a16-dense

    1B • Updated Oct 9, 2024 • 61

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-compressed

    1B • Updated Jan 14 • 469

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-FP8-Dynamic-uncompressed

    1B • Updated Jan 14 • 140

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-compressed

    0.3B • Updated Jan 14 • 10

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W4A16-G128-uncompressed

    1B • Updated Jan 14 • 4

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-compressed

    1B • Updated Jan 14 • 12

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A8-Dynamic-Per-Token-uncompressed

    1B • Updated Jan 14 • 4

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-compressed

    0.4B • Updated Jan 14 • 470

  • nm-testing/TinyLlama-1.1B-Chat-v1.0-W8A16-G128-uncompressed

    1B • Updated Jan 14 • 144
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs