gibberish output

#1
by freegheist - opened

hey! i ran in latest vLLM and its gives gibberish in the output (no worries just leaving feedback)

hey! i ran in latest vLLM and its gives gibberish in the output (no worries just leaving feedback)

Has there been any progress?

Thanks for reporting. I think I forgot to ignore the same layers as the official FP8 quant. Will fix ASAP.

Sign up or log in to comment