9d7ca90 981bf16
1
2
3
4
5
6
7
8
--- license: mit library_name: transformers pipeline_tag: text-generation --- Follwoing LUFFY, we change to rope_theta from 10000 to 40000 and extend the context window to 16k.