INC4AI
/

phi-2-int4-inc

Text Generation

text-generation-inference

4-bit precision

intel/auto-round

Model card Files Files and versions

n1ck-guo commited on Oct 22, 2024

Commit

702dc57

·

verified ·

1 Parent(s): 3efe8d3

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ datasets:
 ---
 ## Model Details
-This model is an int4 model with group_size128 and sym quantization of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)  generated by [intel/auto-round](https://github.com/intel/auto-round).  We found there is a large accuracy drop of asym kernel for this model.
@@ -70,7 +70,7 @@ She is curious and brave and
 ### Evaluate the model
-pip install lm-eval==0.4.2
 ```bash
 auto-round --eval --model Intel/phi-2-int4-inc --device cuda:0 --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,mmlu --batch_size 16

 ---
 ## Model Details
+This model is an int4 model with group_size128 and sym quantization of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)  generated by [intel/auto-round](https://github.com/intel/auto-round).  We found there is a large accuracy drop of asym kernel for this model. If you need AutoGPTQ format, please load the model with revision 5973e3a
 ### Evaluate the model
+pip install lm-eval==0.4.4
 ```bash
 auto-round --eval --model Intel/phi-2-int4-inc --device cuda:0 --tasks lambada_openai,hellaswag,piqa,winogrande,truthfulqa_mc1,openbookqa,boolq,arc_easy,arc_challenge,mmlu --batch_size 16