gair-prox
/

math-doc-refining-lm

Text Generation

text-generation-inference

Model card Files Files and versions

Math-doc-refining-lm

Math-doc-refining-lm is an adapted 0.7B-ProX model, fine-tuned for doc level refining via program generation, and can be applied over math pre-training corpus such as open-web-math.

Citation

@article{zhou2024programming,
  title={Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale},
  author={Zhou, Fan and Wang, Zengzhi and Liu, Qian and Li, Junlong and Liu, Pengfei},
  journal={arXiv preprint arXiv:2409.17115},
  year={2024}
}

Downloads last month: 1

Safetensors

Model size

0.8B params

Tensor type

F32

·

Model tree for gair-prox/math-doc-refining-lm

Base model

gair-prox/RedPJ-ProX-0.7B

Finetuned

(1)

this model

Dataset used to train gair-prox/math-doc-refining-lm

Collection including gair-prox/math-doc-refining-lm

ProX Refining Models

Adapted small language models used to generate data refining programs • 5 items • Updated Oct 10, 2024 • 5