redponike commited on
Commit
643ad27
·
verified ·
1 Parent(s): e84c9a7

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -0
README.md ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ GGUF quants of [nvidia/AceMath-72B-Instruct](https://huggingface.co/nvidia/AceMath-72B-Instruct)
2
+
3
+ [Paper link on arXiv](https://arxiv.org/abs/2412.15084)
4
+
5
+ Using llama.cpp b4682 (commit 0893e0114e934bdd0eba0ff69d9ef8c59343cbc3)
6
+
7
+ The importance matrix was generated with [groups_merged-enhancedV3.txt](https://github.com/ggerganov/llama.cpp/files/15440637/groups_merged-enhancedV3.txt) by InferenceIllusionist (later renamed calibration_datav3.txt), an edited version of kalomaze's original groups_merged.txt.
8
+
9
+ All quants were generated/calibrated with the imatrix, including the K quants.