Qwen2.5-Math-7B-L / README.md
yangzhch6's picture
Update README.md
981bf16 verified
metadata
license: mit
library_name: transformers
pipeline_tag: text-generation

Follwoing LUFFY, we change to rope_theta from 10000 to 40000 and extend the context window to 16k.