This is 01-ai's Yi-6B, converted to GGUF without quantization. No other changes were made.

The model was converted using convert.py from Georgi Gerganov's llama.cpp repo as it appears here (that is, the last change to the file was in commit #898aeca.)

All credit belongs to 01-ai for training and releasing this model. Thank you!

Downloads last month
9
GGUF
Model size
6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

We're not able to determine the quantization variants.

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support