Math & Code Benchmark/Testing for GGUFs

by bobchenyx - opened Apr 26

Apr 26

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

shimmyshimmer

Unsloth AI org Apr 26

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

bobchenyx

Apr 27

•

edited Apr 27

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

Did you use that framework to test these GGUF models?
Caused I'v tried before and they didn't seems to be working.
and they are truly slow.

shimmyshimmer

Unsloth AI org Apr 28

Hi, thanks for releasing such great quantization versions.

I would like to ask if there are any open source frameworks/tools that could be used to test code/math benchmark for GGUF models?

Thanks!

You could use elethur ai's lm harness

Did you use that framework to test these GGUF models?
Caused I'v tried before and they didn't seems to be working.
and they are truly slow.

Yes we did. However they did not match official MMLU scores so we needed to make our own custom evaluation framework

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment