Prakamya commited on
Commit
137b674
·
verified ·
1 Parent(s): eabfe94

Upload folder using huggingface_hub

Browse files
Files changed (2) hide show
  1. LICENSE +304 -304
  2. README.md +178 -181
LICENSE CHANGED
@@ -1,304 +1,304 @@
1
- SAND-MATH [Open RAIL-MSD]
2
-
3
- Licensed Artifact(s):
4
-
5
- - Model, Source Code, Data
6
-
7
- Section I: PREAMBLE
8
-
9
- BY ACCESSING, DOWNLOADING, INSTALLING, OR USING THE ARTIFACT, YOU AGREE
10
- TO BE BOUND BY THIS LICENSE. IF YOU DO NOT AGREE TO ALL OF THE TERMS AND
11
- CONDITIONS OF THIS LICENSE, DO NOT ACCESS, DOWNLOAD, INSTALL, OR USE THE
12
- ARTIFACT.
13
-
14
- 1. Definitions
15
-
16
- (a) Application refers to a sequence of instructions or statements
17
- written in machine code language, including object code (that is the
18
- product of a compiler), binary code (data using a two-symbol system)
19
- or an intermediate language (such as register transfer language).
20
-
21
- (b) Artifact refers to a software application (in either binary or
22
- source code format), Data, Model, and/or Source Code, in accordance
23
- with what is specified above as the Licensed Artifact”.
24
-
25
- (c) Contribution means any work, including any modifications or
26
- additions to an Artifact, that is intentionally submitted to
27
- Licensor for inclusion or incorporation in the Artifact directly or
28
- indirectly by the rights owner. For the purposes of this definition,
29
- submitted means any form of electronic, verbal, or written
30
- communication sent to the Licensor or its representatives, including
31
- but not limited to communication on electronic mailing lists, source
32
- code control systems, and issue tracking systems that are managed
33
- by, or on behalf of, the Licensor for the purpose of discussing,
34
- sharing and improving the Artifact, but excluding communication that
35
- is conspicuously marked or otherwise designated in writing by the
36
- contributor as Not a Contribution.”
37
-
38
- (d) Contributor means Licensor or any other individual or legal entity
39
- that creates or owns a Contribution that is added to or incorporated
40
- into an Artifact or its Derivative.
41
-
42
- (e) Data means a collection of information and/or content extracted
43
- from the dataset used with a given Model, including to train,
44
- pretrain, or otherwise evaluate the Model.
45
-
46
- (f) Derivative means a work derived from or based upon an Artifact,
47
- and includes all modified versions of such Artifact.
48
-
49
- (g) Distribution means any transmission, reproduction, publication or
50
- other sharing of an Artifact or Derivative to a Third Party,
51
- including providing a hosted service incorporating the Artifact,
52
- which is made available by electronic or other remote means -
53
- e.g. API-based or web access.
54
-
55
- (h) Harm includes but is not limited to physical, mental,
56
- psychological, financial and reputational damage, pain, or loss.
57
-
58
- (i) License means the terms and conditions for use, reproduction, and
59
- Distribution as defined in this document.
60
-
61
- (j) Licensor means the rights owner (by virtue of creation or
62
- documented transfer of ownership) or entity authorized by the rights
63
- owner (e.g., exclusive licensee) that is granting the rights in this
64
- License.
65
-
66
- (k) Model means any machine-learning based assembly or assemblies
67
- (including checkpoints), consisting of learnt weights, parameters
68
- (including optimizer states), corresponding to the model
69
- architecture as embodied in the Source Code.
70
-
71
- (l) Output means the results of operating a Model as embodied in
72
- informational content resulting therefrom.
73
-
74
- (m) Source Code means any collection of text written using
75
- human-readable programming language, including the code and scripts
76
- used to define, run, load, benchmark or evaluate a Model or any
77
- component thereof, and/or used to prepare data for training or
78
- evaluation, if any. Source Code includes any accompanying
79
- documentation, tutorials, examples, etc, if any. For clarity, the
80
- term Source Code as used in this License includes any and all
81
- Derivatives of such Source Code.
82
-
83
- (n) Third Parties means individuals or legal entities that are not
84
- under common control with Licensor or You.
85
-
86
- (o) Use includes accessing, using, copying, modifying, and/or
87
- distributing an Artifact; in connection with a Model as Artifact,
88
- Use also includes creating content, fine-tuning, updating, running,
89
- training, evaluating and/or re-parametrizing such Model.
90
-
91
- (p) You (or Your) means an individual or legal entity receiving and
92
- exercising permissions granted by this License and/or making use of
93
- the Artifact for permitted purposes and in any permitted field of
94
- use, including usage of the Artifact in an end-use application -
95
- e.g. chatbot, translator, image generator, etc.
96
-
97
- Section II: INTELLECTUAL PROPERTY RIGHTS
98
-
99
- Both copyright and patent grants may apply to the Artifact. The Artifact
100
- is subject to additional terms and conditions as described in Section III
101
- below.
102
-
103
- 2. Grant of Copyright License. Conditioned upon compliance with Section
104
- III below and subject to the terms and conditions of this License, each
105
- Contributor hereby grants to You a worldwide, non-exclusive, royalty-free copyright license to
106
- reproduce, use, publicly display, publicly perform, sublicense, and
107
- distribute the Artifact and Derivatives thereof.
108
-
109
- 3. Grant of Patent License. Conditioned upon compliance with Section III
110
- below and subject to the terms and conditions of this License, and only
111
- where and as applicable, each Contributor hereby grants to You a worldwide, non-exclusive,
112
- royalty-free, irrevocable (except as stated in this paragraph) patent
113
- license to make, have made, use, sell, offer to sell, import, and
114
- otherwise transfer the Artifact where such license applies only to those
115
- patent claims licensable by such Contributor that are necessarily
116
- infringed by their Contribution(s) alone or by combination of their
117
- Contribution(s) with the Artifact to which such Contribution(s) was
118
- submitted. If You institute patent litigation against any entity
119
- (including a cross-claim or counterclaim in a lawsuit) alleging that the
120
- Artifact and/or a Contribution incorporated within the Artifact
121
- constitutes direct or contributory patent infringement, then any patent
122
- licenses granted to You under this License in connection with the
123
- Artifact shall terminate as of the date such litigation is asserted or
124
- filed.
125
-
126
- Licensor and Contributor each have the right to grant the licenses
127
- above.
128
-
129
- Section III: CONDITIONS OF USAGE, DISTRIBUTION AND REDISTRIBUTION
130
-
131
- 4. Use-based restrictions. The restrictions set forth in Attachment A
132
- are mandatory Use-based restrictions. Therefore You may not Use the
133
- Artifact in violation of such restrictions. You may Use the Artifact
134
- only subject to this License. You shall require all of Your users who
135
- use the Artifact or its Derivative to comply with the terms of this
136
- paragraph.
137
-
138
- 5. The Output You Generate with a Model (as Artifact). Except as set
139
- forth herein, Licensor claims no rights in the Output You generate. You
140
- are accountable for the Output You generate and its subsequent uses. No
141
- use of the Output may contravene any provision as stated in this
142
- License.
143
-
144
- 6. Distribution and Redistribution. You may host for Third Party remote
145
- access purposes (e.g. software-as-a-service), reproduce and distribute
146
- copies of the Artifact or its Derivatives in any medium, with or without
147
- modifications, provided that You meet the following conditions:
148
-
149
- 6.1. Use-based restrictions in paragraph 4 MUST be included as a
150
- condition precedent to effect any type of legal agreement (e.g. a
151
- license) governing the use and/or distribution of the Artifact or
152
- its Derivatives, and You shall give such notice to any subsequent
153
- Third Party recipients;
154
- 6.2. You shall give any Third Party recipients of the Artifact or its
155
- Derivatives a copy of this License;
156
- 6.3. You shall cause any modified files to carry prominent notices
157
- stating that You changed the files;
158
- 6.4. You shall retain all copyright, patent, trademark, and attribution
159
- notices excluding those notices that do not pertain to any part of
160
- the Artifact or its Derivatives.
161
-
162
- You may add Your own copyright statement to Your modifications and may
163
- provide additional or different license terms and conditions with
164
- respect to paragraph 6.1., to govern the use, reproduction, or
165
- Distribution of Your modifications, or for any Derivative, provided that
166
- Your use, reproduction, and Distribution of the Artifact or its
167
- Derivative otherwise complies with the conditions stated in this
168
- License. In other words, the Use-based restrictions in Attachment A form
169
- the minimum set of terms for You to license to Third Parties any
170
- Artifact or its Derivative, but You may add more restrictive terms if
171
- You deem it necessary.
172
-
173
- Section IV: OTHER PROVISIONS
174
-
175
- 7. Updates and Runtime Restrictions. To the maximum extent permitted by
176
- law, Licensor reserves the right to restrict (remotely or otherwise)
177
- usage of the Artifact in violation of this License or update the
178
- Artifact through electronic means.
179
-
180
- 8. Trademarks and Related. Nothing in this License permits You to make
181
- use of Licensors trademarks, trade names, logos or to otherwise suggest
182
- endorsement or misrepresent the relationship between the parties; and
183
- any rights not expressly granted herein are reserved by the Licensors.
184
-
185
- 9. Disclaimer of Warranty. Unless required by applicable law or agreed
186
- to in writing, Licensor provides the Artifact (and each Contributor
187
- provides its Contributions) on an AS IS BASIS, WITHOUT WARRANTIES OR
188
- CONDITIONS OF ANY KIND, either express or implied, including, without
189
- limitation, any warranties or conditions of TITLE, NON-INFRINGEMENT,
190
- MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE. You are solely
191
- responsible for determining the appropriateness of using the Artifact,
192
- and assume any risks associated with Your exercise of permissions under
193
- this License.
194
-
195
- 10. Limitation of Liability. In no event and under no legal theory,
196
- whether in tort (including negligence), contract, or otherwise, unless
197
- required by applicable law (such as deliberate and grossly negligent
198
- acts) or agreed to in writing, shall any Contributor be liable to You
199
- for damages, including any direct, indirect, special, incidental, or
200
- consequential damages of any character arising as a result of this
201
- License or out of the use or inability to use the Artifact (including
202
- but not limited to damages for loss of goodwill, work stoppage, computer
203
- failure or malfunction, or any and all other commercial damages or
204
- losses), even if such Contributor has been advised of the possibility of
205
- such damages.
206
-
207
- 11. If any provision of this License is held to be invalid, illegal or
208
- unenforceable, the remaining provisions shall be unaffected thereby and
209
- remain valid as if such provision had not been set forth herein.
210
-
211
- 12. Term and Termination. The term of this License will commence upon
212
- the earlier of Your (a) acceptance of this License or (b) accessing the
213
- Artifact; and will continue in full force and effect until terminated in
214
- accordance with the terms and conditions herein. Licensor may terminate
215
- this License if You are in breach of any term or condition of this
216
- License. Upon termination of this License, all licenses granted to You
217
- will terminate and You must promptly delete and cease use of the
218
- Artifact. Sections 1, 7, 8, 9, 10, 11, and 12 survive termination of
219
- this License.
220
-
221
- END OF TERMS AND CONDITIONS
222
-
223
- Attachment A
224
-
225
- AMD Responsible AI Use Policy
226
-
227
- AMD is committed to the responsible use of its Artificial Intelligence
228
- (AI) products and technologies (AMD AI). AMD AI may include
229
- artificial intelligence or machine learning technologies that use
230
- algorithms to analyze data and generate output using predictions based
231
- on patterns in data. This policy explains the uses that AMD
232
- specifically prohibits.
233
-
234
- If you use any AMD AI, you are agreeing to use the AMD AI in compliance
235
- with applicable laws and not for any of the following prohibited uses.
236
-
237
- Prohibited Uses:
238
-
239
- 1) No Illegal Acts. Do not use AMD AI in violation of any applicable
240
- national, state, local, or other jurisdictional law, rule, regulation,
241
- or sanction.
242
-
243
- 2) No Explicit Content. Do not use AMD AI to submit (as input),
244
- generate, or disseminate content depicting violent or sexually explicit
245
- content or to create sexual chatbots.
246
-
247
- 3) No Harm. Do not use AMD AI for any potentially harmful uses,
248
- including fraud, deception, discrimination, abuse, or harassment,
249
- including the following:
250
-
251
- a) Harm or abuse of a minor, including grooming and child sexual
252
- exploitation.
253
-
254
- b) Impersonation of human beings for purposes of deception.
255
-
256
- c) Generation or dissemination of information you know to be false
257
- for the purpose of harming others.
258
-
259
- d) Intentionally defame, disparage, or otherwise harass others.
260
-
261
- e) Intentionally attempting to materially distort the behavior of a
262
- person in a manner that causes or is likely to cause that person
263
- or another person physical or psychological harm.
264
-
265
- f) Providing medical advice or interpretation of medical results that
266
- is intended to be a substitute for professional medical advice,
267
- diagnosis, or treatment.
268
-
269
- g) Engaging in the unlawful or unauthorized practice of any
270
- profession, including financial, legal, medical, health, or
271
- related professional practices.
272
-
273
- h) Judgment of, discrimination against, or harm to individuals or
274
- groups based on legally protected characteristics or categories,
275
- online or offline social behavior, or known or predicted personal
276
- or personality characteristics, including any of the foregoing
277
- uses in social credit systems.
278
-
279
- 4) No High-Risk Activity. Do not use AMD AI in any high-risk activities
280
- or applications that create a risk of personal injury, death, or
281
- severe property or environmental damage, including in weapons or
282
- military applications.
283
-
284
- 5) No Personal Information. Do not use AMD AI to collect, process, or
285
- disclose personal data, including heath or sensitive personal
286
- information, without the necessary rights or consents.
287
-
288
- 6) No Infringement. Do not use AMD AI to generate or disseminate any
289
- information that infringes upon or misappropriates the intellectual
290
- property rights of others, including copyright, trademark, patent, and
291
- trade secret rights, rights to privacy, and publicity rights.
292
-
293
- 7) No Malware. Do not use AMD AI to generate or disseminate malware or
294
- any other content to be used for the purpose of facilitating unpermitted
295
- access to, or use of, computer systems or data.
296
-
297
- 8) No Obfuscation. Do not inappropriately obfuscate or fail to disclose
298
- to end users the presence of AI in any application in which AMD AI is
299
- deployed, along with any known risks or dangers of using AI without
300
- appropriate safeguards, oversight and human control.
301
-
302
- 9) No Reliance. Do not rely on any information generated using AMD AI
303
- without assessing it for accuracy, potential for harm, or other specific
304
- risks applicable to the use case.
 
1
+ SAND-MATH [Open RAIL-MSD]
2
+
3
+ Licensed Artifact(s):
4
+
5
+ - Model, Source Code, Data
6
+
7
+ Section I: PREAMBLE
8
+
9
+ BY ACCESSING, DOWNLOADING, INSTALLING, OR USING THE ARTIFACT, YOU AGREE
10
+ TO BE BOUND BY THIS LICENSE. IF YOU DO NOT AGREE TO ALL OF THE TERMS AND
11
+ CONDITIONS OF THIS LICENSE, DO NOT ACCESS, DOWNLOAD, INSTALL, OR USE THE
12
+ ARTIFACT.
13
+
14
+ 1. Definitions
15
+
16
+ (a) Application refers to a sequence of instructions or statements
17
+ written in machine code language, including object code (that is the
18
+ product of a compiler), binary code (data using a two-symbol system)
19
+ or an intermediate language (such as register transfer language).
20
+
21
+ (b) Artifact refers to a software application (in either binary or
22
+ source code format), Data, Model, and/or Source Code, in accordance
23
+ with what is specified above as the Licensed Artifact�.
24
+
25
+ (c) Contribution means any work, including any modifications or
26
+ additions to an Artifact, that is intentionally submitted to
27
+ Licensor for inclusion or incorporation in the Artifact directly or
28
+ indirectly by the rights owner. For the purposes of this definition,
29
+ submitted means any form of electronic, verbal, or written
30
+ communication sent to the Licensor or its representatives, including
31
+ but not limited to communication on electronic mailing lists, source
32
+ code control systems, and issue tracking systems that are managed
33
+ by, or on behalf of, the Licensor for the purpose of discussing,
34
+ sharing and improving the Artifact, but excluding communication that
35
+ is conspicuously marked or otherwise designated in writing by the
36
+ contributor as Not a Contribution.�
37
+
38
+ (d) Contributor means Licensor or any other individual or legal entity
39
+ that creates or owns a Contribution that is added to or incorporated
40
+ into an Artifact or its Derivative.
41
+
42
+ (e) Data means a collection of information and/or content extracted
43
+ from the dataset used with a given Model, including to train,
44
+ pretrain, or otherwise evaluate the Model.
45
+
46
+ (f) Derivative means a work derived from or based upon an Artifact,
47
+ and includes all modified versions of such Artifact.
48
+
49
+ (g) Distribution means any transmission, reproduction, publication or
50
+ other sharing of an Artifact or Derivative to a Third Party,
51
+ including providing a hosted service incorporating the Artifact,
52
+ which is made available by electronic or other remote means -
53
+ e.g. API-based or web access.
54
+
55
+ (h) Harm includes but is not limited to physical, mental,
56
+ psychological, financial and reputational damage, pain, or loss.
57
+
58
+ (i) License means the terms and conditions for use, reproduction, and
59
+ Distribution as defined in this document.
60
+
61
+ (j) Licensor means the rights owner (by virtue of creation or
62
+ documented transfer of ownership) or entity authorized by the rights
63
+ owner (e.g., exclusive licensee) that is granting the rights in this
64
+ License.
65
+
66
+ (k) Model means any machine-learning based assembly or assemblies
67
+ (including checkpoints), consisting of learnt weights, parameters
68
+ (including optimizer states), corresponding to the model
69
+ architecture as embodied in the Source Code.
70
+
71
+ (l) Output means the results of operating a Model as embodied in
72
+ informational content resulting therefrom.
73
+
74
+ (m) Source Code means any collection of text written using
75
+ human-readable programming language, including the code and scripts
76
+ used to define, run, load, benchmark or evaluate a Model or any
77
+ component thereof, and/or used to prepare data for training or
78
+ evaluation, if any. Source Code includes any accompanying
79
+ documentation, tutorials, examples, etc, if any. For clarity, the
80
+ term Source Code as used in this License includes any and all
81
+ Derivatives of such Source Code.
82
+
83
+ (n) Third Parties means individuals or legal entities that are not
84
+ under common control with Licensor or You.
85
+
86
+ (o) Use includes accessing, using, copying, modifying, and/or
87
+ distributing an Artifact; in connection with a Model as Artifact,
88
+ Use also includes creating content, fine-tuning, updating, running,
89
+ training, evaluating and/or re-parametrizing such Model.
90
+
91
+ (p) You (or Your) means an individual or legal entity receiving and
92
+ exercising permissions granted by this License and/or making use of
93
+ the Artifact for permitted purposes and in any permitted field of
94
+ use, including usage of the Artifact in an end-use application -
95
+ e.g. chatbot, translator, image generator, etc.
96
+
97
+ Section II: INTELLECTUAL PROPERTY RIGHTS
98
+
99
+ Both copyright and patent grants may apply to the Artifact. The Artifact
100
+ is subject to additional terms and conditions as described in Section III
101
+ below.
102
+
103
+ 2. Grant of Copyright License. Conditioned upon compliance with Section
104
+ III below and subject to the terms and conditions of this License, each
105
+ Contributor hereby grants to You a worldwide, non-exclusive, royalty-free copyright license to
106
+ reproduce, use, publicly display, publicly perform, sublicense, and
107
+ distribute the Artifact and Derivatives thereof.
108
+
109
+ 3. Grant of Patent License. Conditioned upon compliance with Section III
110
+ below and subject to the terms and conditions of this License, and only
111
+ where and as applicable, each Contributor hereby grants to You a worldwide, non-exclusive,
112
+ royalty-free, irrevocable (except as stated in this paragraph) patent
113
+ license to make, have made, use, sell, offer to sell, import, and
114
+ otherwise transfer the Artifact where such license applies only to those
115
+ patent claims licensable by such Contributor that are necessarily
116
+ infringed by their Contribution(s) alone or by combination of their
117
+ Contribution(s) with the Artifact to which such Contribution(s) was
118
+ submitted. If You institute patent litigation against any entity
119
+ (including a cross-claim or counterclaim in a lawsuit) alleging that the
120
+ Artifact and/or a Contribution incorporated within the Artifact
121
+ constitutes direct or contributory patent infringement, then any patent
122
+ licenses granted to You under this License in connection with the
123
+ Artifact shall terminate as of the date such litigation is asserted or
124
+ filed.
125
+
126
+ Licensor and Contributor each have the right to grant the licenses
127
+ above.
128
+
129
+ Section III: CONDITIONS OF USAGE, DISTRIBUTION AND REDISTRIBUTION
130
+
131
+ 4. Use-based restrictions. The restrictions set forth in Attachment A
132
+ are mandatory Use-based restrictions. Therefore You may not Use the
133
+ Artifact in violation of such restrictions. You may Use the Artifact
134
+ only subject to this License. You shall require all of Your users who
135
+ use the Artifact or its Derivative to comply with the terms of this
136
+ paragraph.
137
+
138
+ 5. The Output You Generate with a Model (as Artifact). Except as set
139
+ forth herein, Licensor claims no rights in the Output You generate. You
140
+ are accountable for the Output You generate and its subsequent uses. No
141
+ use of the Output may contravene any provision as stated in this
142
+ License.
143
+
144
+ 6. Distribution and Redistribution. You may host for Third Party remote
145
+ access purposes (e.g. software-as-a-service), reproduce and distribute
146
+ copies of the Artifact or its Derivatives in any medium, with or without
147
+ modifications, provided that You meet the following conditions:
148
+
149
+ 6.1. Use-based restrictions in paragraph 4 MUST be included as a
150
+ condition precedent to effect any type of legal agreement (e.g. a
151
+ license) governing the use and/or distribution of the Artifact or
152
+ its Derivatives, and You shall give such notice to any subsequent
153
+ Third Party recipients;
154
+ 6.2. You shall give any Third Party recipients of the Artifact or its
155
+ Derivatives a copy of this License;
156
+ 6.3. You shall cause any modified files to carry prominent notices
157
+ stating that You changed the files;
158
+ 6.4. You shall retain all copyright, patent, trademark, and attribution
159
+ notices excluding those notices that do not pertain to any part of
160
+ the Artifact or its Derivatives.
161
+
162
+ You may add Your own copyright statement to Your modifications and may
163
+ provide additional or different license terms and conditions with
164
+ respect to paragraph 6.1., to govern the use, reproduction, or
165
+ Distribution of Your modifications, or for any Derivative, provided that
166
+ Your use, reproduction, and Distribution of the Artifact or its
167
+ Derivative otherwise complies with the conditions stated in this
168
+ License. In other words, the Use-based restrictions in Attachment A form
169
+ the minimum set of terms for You to license to Third Parties any
170
+ Artifact or its Derivative, but You may add more restrictive terms if
171
+ You deem it necessary.
172
+
173
+ Section IV: OTHER PROVISIONS
174
+
175
+ 7. Updates and Runtime Restrictions. To the maximum extent permitted by
176
+ law, Licensor reserves the right to restrict (remotely or otherwise)
177
+ usage of the Artifact in violation of this License or update the
178
+ Artifact through electronic means.
179
+
180
+ 8. Trademarks and Related. Nothing in this License permits You to make
181
+ use of Licensors trademarks, trade names, logos or to otherwise suggest
182
+ endorsement or misrepresent the relationship between the parties; and
183
+ any rights not expressly granted herein are reserved by the Licensors.
184
+
185
+ 9. Disclaimer of Warranty. Unless required by applicable law or agreed
186
+ to in writing, Licensor provides the Artifact (and each Contributor
187
+ provides its Contributions) on an AS IS BASIS, WITHOUT WARRANTIES OR
188
+ CONDITIONS OF ANY KIND, either express or implied, including, without
189
+ limitation, any warranties or conditions of TITLE, NON-INFRINGEMENT,
190
+ MERCHANTABILITY, or FITNESS FOR A PARTICULAR PURPOSE. You are solely
191
+ responsible for determining the appropriateness of using the Artifact,
192
+ and assume any risks associated with Your exercise of permissions under
193
+ this License.
194
+
195
+ 10. Limitation of Liability. In no event and under no legal theory,
196
+ whether in tort (including negligence), contract, or otherwise, unless
197
+ required by applicable law (such as deliberate and grossly negligent
198
+ acts) or agreed to in writing, shall any Contributor be liable to You
199
+ for damages, including any direct, indirect, special, incidental, or
200
+ consequential damages of any character arising as a result of this
201
+ License or out of the use or inability to use the Artifact (including
202
+ but not limited to damages for loss of goodwill, work stoppage, computer
203
+ failure or malfunction, or any and all other commercial damages or
204
+ losses), even if such Contributor has been advised of the possibility of
205
+ such damages.
206
+
207
+ 11. If any provision of this License is held to be invalid, illegal or
208
+ unenforceable, the remaining provisions shall be unaffected thereby and
209
+ remain valid as if such provision had not been set forth herein.
210
+
211
+ 12. Term and Termination. The term of this License will commence upon
212
+ the earlier of Your (a) acceptance of this License or (b) accessing the
213
+ Artifact; and will continue in full force and effect until terminated in
214
+ accordance with the terms and conditions herein. Licensor may terminate
215
+ this License if You are in breach of any term or condition of this
216
+ License. Upon termination of this License, all licenses granted to You
217
+ will terminate and You must promptly delete and cease use of the
218
+ Artifact. Sections 1, 7, 8, 9, 10, 11, and 12 survive termination of
219
+ this License.
220
+
221
+ END OF TERMS AND CONDITIONS
222
+
223
+ Attachment A
224
+
225
+ AMD Responsible AI Use Policy
226
+
227
+ AMD is committed to the responsible use of its Artificial Intelligence
228
+ (AI) products and technologies (AMD AI). AMD AI may include
229
+ artificial intelligence or machine learning technologies that use
230
+ algorithms to analyze data and generate output using predictions based
231
+ on patterns in data. This policy explains the uses that AMD
232
+ specifically prohibits.
233
+
234
+ If you use any AMD AI, you are agreeing to use the AMD AI in compliance
235
+ with applicable laws and not for any of the following prohibited uses.
236
+
237
+ Prohibited Uses:
238
+
239
+ 1) No Illegal Acts. Do not use AMD AI in violation of any applicable
240
+ national, state, local, or other jurisdictional law, rule, regulation,
241
+ or sanction.
242
+
243
+ 2) No Explicit Content. Do not use AMD AI to submit (as input),
244
+ generate, or disseminate content depicting violent or sexually explicit
245
+ content or to create sexual chatbots.
246
+
247
+ 3) No Harm. Do not use AMD AI for any potentially harmful uses,
248
+ including fraud, deception, discrimination, abuse, or harassment,
249
+ including the following:
250
+
251
+ a) Harm or abuse of a minor, including grooming and child sexual
252
+ exploitation.
253
+
254
+ b) Impersonation of human beings for purposes of deception.
255
+
256
+ c) Generation or dissemination of information you know to be false
257
+ for the purpose of harming others.
258
+
259
+ d) Intentionally defame, disparage, or otherwise harass others.
260
+
261
+ e) Intentionally attempting to materially distort the behavior of a
262
+ person in a manner that causes or is likely to cause that person
263
+ or another person physical or psychological harm.
264
+
265
+ f) Providing medical advice or interpretation of medical results that
266
+ is intended to be a substitute for professional medical advice,
267
+ diagnosis, or treatment.
268
+
269
+ g) Engaging in the unlawful or unauthorized practice of any
270
+ profession, including financial, legal, medical, health, or
271
+ related professional practices.
272
+
273
+ h) Judgment of, discrimination against, or harm to individuals or
274
+ groups based on legally protected characteristics or categories,
275
+ online or offline social behavior, or known or predicted personal
276
+ or personality characteristics, including any of the foregoing
277
+ uses in social credit systems.
278
+
279
+ 4) No High-Risk Activity. Do not use AMD AI in any high-risk activities
280
+ or applications that create a risk of personal injury, death, or
281
+ severe property or environmental damage, including in weapons or
282
+ military applications.
283
+
284
+ 5) No Personal Information. Do not use AMD AI to collect, process, or
285
+ disclose personal data, including heath or sensitive personal
286
+ information, without the necessary rights or consents.
287
+
288
+ 6) No Infringement. Do not use AMD AI to generate or disseminate any
289
+ information that infringes upon or misappropriates the intellectual
290
+ property rights of others, including copyright, trademark, patent, and
291
+ trade secret rights, rights to privacy, and publicity rights.
292
+
293
+ 7) No Malware. Do not use AMD AI to generate or disseminate malware or
294
+ any other content to be used for the purpose of facilitating unpermitted
295
+ access to, or use of, computer systems or data.
296
+
297
+ 8) No Obfuscation. Do not inappropriately obfuscate or fail to disclose
298
+ to end users the presence of AI in any application in which AMD AI is
299
+ deployed, along with any known risks or dangers of using AI without
300
+ appropriate safeguards, oversight and human control.
301
+
302
+ 9) No Reliance. Do not rely on any information generated using AMD AI
303
+ without assessing it for accuracy, potential for harm, or other specific
304
+ risks applicable to the use case.
README.md CHANGED
@@ -1,181 +1,178 @@
1
- ---
2
- license: other
3
- license_link: LICENSE
4
- library_name: transformers
5
- pipeline_tag: text-generation
6
- datasets:
7
- - amd/SAND-Post-Training-Dataset
8
-
9
- language:
10
- - en
11
- base_model:
12
- - Qwen/Qwen2.5-32B-Instruct
13
- ---
14
-
15
- # SAND-Reasoning: Best-in-class Large Reasoning Model Built with Synthetic Data only using AMD GPUs
16
-
17
- <div align="center">
18
-
19
- | [**📄 Technical Report**](https://arxiv.org/pdf/2507.20527) | [**💾 Synthetic Datasets**](https://huggingface.co/datasets/amd/SAND-Post-Training-Dataset) | [**💻 GitHub Repository**](https://huggingface.co/datasets/amd/SAND-Post-Training-Dataset) | [**📝 Blog Post**](https://rocm.blogs.amd.com/artificial-intelligence/sand-math/README.html) |
20
- | :---: | :---: | :---: | :---: |
21
-
22
- </div>
23
-
24
- ---
25
-
26
- ## Model Summary
27
-
28
- We introduce **SAND-Math-Qwen2.5-32B** and **SAND-MathScience-DeepSeek-Qwen32B**, reasoning models built entirely using a synthetic data pipeline running on the **AMD ROCm™ stack** and **AMD Instinct™ MI325 GPUs**.
29
-
30
- By prioritizing data difficulty along with quantity, we demonstrate that high-difficulty synthetic data can elevate prior-generation models to match or exceed modern proprietary models. `SAND-Math-Qwen2.5-32B` is fine-tuned from **Qwen2.5-32B-Instruct** on just **14k synthetic math samples**, achieving strong reasoning capabilities with minimal data outperforming other data distillation and post training approaches. `SAND-MathScience-DeepSeek-Qwen32B` is fine-tuned from **DeepSeek-R1-Distill-Qwen-32B** on a compact dataset of **27k samples** (15k Math + 12k Science), achieving a generational leap in performance that rivals **Qwen3-32B**.
31
-
32
- We are releasing the models, datasets, and code to empower the community to build their own state-of-the-art reasoning models using AMD hardware.
33
-
34
- ## 📊 Benchmark Results
35
-
36
- We conducted extensive experiments to validate that our pipeline yields superior results compared to models trained on significantly larger datasets.
37
-
38
- ### 1. Bridging the Generational Gap
39
- Fine-tuning the Qwen2.5-based **DeepSeek-R1-Distill-Qwen-32B** on our mixed Math/Science dataset allows it to rival and even surpass the next-generation **Qwen3-32B** on key benchmarks.
40
-
41
- | Model | AIME24 | AIME25 | MATH500 | GPQA |
42
- | :--- | :---: | :---: | :---: | :---: |
43
- | DeepSeek-Distilled-Qwen32B (Base) | 72.6 | 54.9 | 94.3 | 62.1 |
44
- | EXAONE Deep 32B | 72.1 | 65.8 | 95.8 | 66.1 |
45
- | Qwen3-32B (Thinking mode) | 81.4 | 72.9 | **97.0** | 68.4 |
46
- | **SAND-MathScience-DeepSeek-Qwen32B (Ours)** | **83.85** | **78.33** | 93.85 | **68.72** |
47
-
48
- ### 2. Efficiency: Unlocking Reasoning with Less Data
49
- Using only **14k synthetic math samples** and standard SFT (no RL), our approach outperforms models trained on datasets 5x to 50x larger.
50
-
51
- | Model | Data Size | AIME24 | AIME25 | MATH500 | GPQA |
52
- | :--- | :--- | :---: | :---: | :---: | :---: |
53
- | Qwen2.5-32B-Instruct (Base) | - | 16.7 | 13.3 | 83.4 | 53.5 |
54
- | DeepSeek-R1-Distill-Qwen-32B | 800k | 72.6 | 54.9 | 94.3 | 62.1 |
55
- | Light-R1-32B | 79k | 73.0 | 64.3 | 93.3 | 60.6 |
56
- | OpenThinker-32B | 114k | 66.0 | 53.3 | 89.4 | 57.6 |
57
- | **SAND-Math-Qwen2.5-32B (Ours)** | **14k** | **74.01** | **68.18** | **92.05** | **60.8** |
58
-
59
- ---
60
-
61
- ## ⚙️ The Synthetic Data Pipeline
62
-
63
- Our results are powered by a 4-stage automated pipeline running on AMD hardware that prioritizes **difficulty and novelty** over volume. Unlike datasets that recycle easy problems, our pipeline leverages a Teacher Model (`GPT-OSS120b`) to generate, validate, and systematically "hike" the difficulty of reasoning problems.
64
-
65
- ![Pipeline Overview](PipelineSimple.png)
66
-
67
- ### Pipeline Stages
68
-
69
- 1. **Stage 1: QA Generation & Consistency** 🛠️
70
- - Generates novel problems from scratch
71
- - Enforces correctness by requiring the teacher to generate multiple independent solution paths
72
- - Only questions where all answers align are kept
73
-
74
- 2. **Stage 2: De-duplication & Decontamination** 🧹
75
- - Removes internal duplicates via embedding similarity
76
- - **Crucial Step:** Scans against known test sets (AIME, MATH, GPQA) to ensure zero contamination
77
-
78
- 3. **Stage 3: Difficulty Hiking** 🏔️
79
- - Moderately challenging questions are rewritten by the teacher model
80
- - Introduces deeper reasoning chains, added constraints, or cross-domain logic
81
- - Systematically elevates complexity
82
- - Configurable step primarily used when initial generation yields insufficient volume of high-difficulty samples
83
-
84
- ---
85
-
86
- ## 🚀 Quick Start
87
-
88
- ### Python Inference (Transformers)
89
-
90
- ```python
91
- from transformers import AutoModelForCausalLM, AutoTokenizer
92
-
93
- model_name = "amd/SAND-Math-Qwen2.5-32B"
94
-
95
- model = AutoModelForCausalLM.from_pretrained(
96
- model_name,
97
- torch_dtype="auto",
98
- device_map="auto"
99
- )
100
- tokenizer = AutoTokenizer.from_pretrained(model_name)
101
-
102
- # Example prompt
103
- prompt = "Find the number of pairs of positive integers $(m, n)$ such that $m^2 + n < 22$ and $n^2 + m < 22$."
104
- messages = [
105
- {"role": "user", "content": prompt}
106
- ]
107
- text = tokenizer.apply_chat_template(
108
- messages,
109
- tokenize=False,
110
- add_generation_prompt=True
111
- )
112
- model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
113
-
114
- generated_ids = model.generate(
115
- **model_inputs,
116
- max_new_tokens=4096,
117
- temperature=0.7, # Recommended temperature
118
- do_sample=True
119
- )
120
- generated_ids = [
121
- output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
122
- ]
123
-
124
- response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
125
- print("Response:", response)
126
- ```
127
-
128
- ### Serving (vLLM & SGLang)
129
-
130
- You can easily serve this model as an OpenAI-compatible API endpoint.
131
-
132
- **Using SGLang:**
133
- ```bash
134
- python -m sglang.launch_server --model-path amd/SAND-Math-Qwen2.5-32B --max-model-len 32768
135
- ```
136
-
137
- **Using vLLM:**
138
- ```bash
139
- vllm serve amd/SAND-Math-Qwen2.5-32B --max-model-len 32768
140
- ```
141
-
142
- ---
143
-
144
- ## 💡 Usage Recommendations
145
-
146
- To replicate our performance benchmarks and achieve the best reasoning results, we strongly recommend the following configurations:
147
-
148
- * **Temperature:** Set `temperature=0.7`. **DO NOT use greedy decoding**, as it can lead to performance degradation and repetitive loops.
149
- * **Prompting:** For mathematical problems, include a directive to enforce structure:
150
- > "Please reason step by step, and put your final answer within \boxed{}."
151
- * **Context Length:** We recommend allowing an output length of **32,768 tokens**. This ensures the model has sufficient space for long Chain-of-Thought (CoT) generation.
152
- * **Thinking Token:** It is recommended to enforce the model to initiate its response with the `<think>\n` token to trigger the reasoning mode effectively.
153
- * **Evaluation:** When benchmarking, conduct multiple passes (Pass@K) and average the results for stability.
154
-
155
- ---
156
-
157
- ## 📜 License
158
-
159
- This project is licensed under the **Open RAIL-MSD** license. This is an open, royalty-free license that permits commercial use, modification, and distribution of the dataset, models, and source code.
160
-
161
- The license includes standard use-based restrictions to prevent harmful applications (e.g., illegal activities, generating harmful content, high-risk applications). These restrictions are designed to promote responsible AI development while keeping the license permissive for legitimate use cases.
162
-
163
- For full license terms and conditions, please see the [LICENSE](https://github.com/AMD-AGI/sand-pipeline/blob/main/LICENSE.txt) file.
164
-
165
- ---
166
-
167
- ## Citation
168
-
169
- If you use this model, dataset, or pipeline in your research, please cite our work:
170
-
171
- ```bibtex
172
- @misc{manem025sandmathusingllmsgenerate,
173
- title={SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers},
174
- author={Chaitanya Manem and Pratik Prabhanjan Brahma and Prakamya Mishra and Zicheng Liu and Emad Barsoum},
175
- year={2025},
176
- eprint={2507.20527},
177
- archivePrefix={arXiv},
178
- primaryClass={cs.CL},
179
- url={https://arxiv.org/abs/2507.20527},
180
- }
181
- ```
 
1
+ ---
2
+ license: other
3
+ license_link: LICENSE
4
+ library_name: transformers
5
+ pipeline_tag: text-generation
6
+ datasets:
7
+ - amd/SAND-Post-Training-Dataset
8
+
9
+ language:
10
+ - en
11
+ base_model:
12
+ - Qwen/Qwen2.5-32B-Instruct
13
+ ---
14
+
15
+ # SAND-Reasoning: Best-in-class Large Reasoning Model Built with Synthetic Data only using AMD GPUs
16
+
17
+ <div align="center">
18
+
19
+ | [![Paper](https://img.shields.io/badge/ArXiv-2507.20527-B31B1B.svg)](https://arxiv.org/pdf/2507.20527) | [![Hugging Face Dataset](https://img.shields.io/badge/🤗%20Hugging%20Face-Dataset-green)](https://huggingface.co/datasets/amd/SAND-Post-Training-Dataset) | [![GitHub](https://img.shields.io/badge/GitHub-Repository-black)](https://github.com/AMD-AGI/sand-pipeline) | [![Blog Post](https://img.shields.io/badge/Blog%20Post-Read%20More-blue)](https://rocm.blogs.amd.com/artificial-intelligence/sand-math/README.html) |
20
+ | :---: | :---: | :---: | :---: |
21
+ </div>
22
+
23
+ ## Model Summary
24
+
25
+ We introduce **SAND-Math-Qwen2.5-32B** and **SAND-MathScience-DeepSeek-Qwen32B**, reasoning models built entirely using a synthetic data pipeline running on the **AMD ROCm™ stack** and **AMD Instinct™ MI325 GPUs**.
26
+
27
+ By prioritizing data difficulty along with quantity, we demonstrate that high-difficulty synthetic data can elevate prior-generation models to match or exceed modern proprietary models. `SAND-Math-Qwen2.5-32B` is fine-tuned from **Qwen2.5-32B-Instruct** on just **14k synthetic math samples**, achieving strong reasoning capabilities with minimal data outperforming other data distillation and post training approaches. `SAND-MathScience-DeepSeek-Qwen32B` is fine-tuned from **DeepSeek-R1-Distill-Qwen-32B** on a compact dataset of **27k samples** (15k Math + 12k Science), achieving a generational leap in performance that rivals **Qwen3-32B**.
28
+
29
+ We are releasing the models, datasets, and code to empower the community to build their own state-of-the-art reasoning models using AMD hardware.
30
+
31
+ ## 📊 Benchmark Results
32
+
33
+ We conducted extensive experiments to validate that our pipeline yields superior results compared to models trained on significantly larger datasets.
34
+
35
+ ### 1. Bridging the Generational Gap
36
+ Fine-tuning the Qwen2.5-based **DeepSeek-R1-Distill-Qwen-32B** on our mixed Math/Science dataset allows it to rival and even surpass the next-generation **Qwen3-32B** on key benchmarks.
37
+
38
+ | Model | AIME24 | AIME25 | MATH500 | GPQA |
39
+ | :--- | :---: | :---: | :---: | :---: |
40
+ | DeepSeek-Distilled-Qwen32B (Base) | 72.6 | 54.9 | 94.3 | 62.1 |
41
+ | EXAONE Deep 32B | 72.1 | 65.8 | 95.8 | 66.1 |
42
+ | Qwen3-32B (Thinking mode) | 81.4 | 72.9 | **97.0** | 68.4 |
43
+ | **SAND-MathScience-DeepSeek-Qwen32B (Ours)** | **83.85** | **78.33** | 93.85 | **68.72** |
44
+
45
+ ### 2. Efficiency: Unlocking Reasoning with Less Data
46
+ Using only **14k synthetic math samples** and standard SFT (no RL), our approach outperforms models trained on datasets 5x to 50x larger.
47
+
48
+ | Model | Data Size | AIME24 | AIME25 | MATH500 | GPQA |
49
+ | :--- | :--- | :---: | :---: | :---: | :---: |
50
+ | Qwen2.5-32B-Instruct (Base) | - | 16.7 | 13.3 | 83.4 | 53.5 |
51
+ | DeepSeek-R1-Distill-Qwen-32B | 800k | 72.6 | 54.9 | 94.3 | 62.1 |
52
+ | Light-R1-32B | 79k | 73.0 | 64.3 | 93.3 | 60.6 |
53
+ | OpenThinker-32B | 114k | 66.0 | 53.3 | 89.4 | 57.6 |
54
+ | **SAND-Math-Qwen2.5-32B (Ours)** | **14k** | **74.01** | **68.18** | **92.05** | **60.8** |
55
+
56
+ ---
57
+
58
+ ## ⚙️ The Synthetic Data Pipeline
59
+
60
+ Our results are powered by a 4-stage automated pipeline running on AMD hardware that prioritizes **difficulty and novelty** over volume. Unlike datasets that recycle easy problems, our pipeline leverages a Teacher Model (`GPT-OSS120b`) to generate, validate, and systematically "hike" the difficulty of reasoning problems.
61
+
62
+ ![Pipeline Overview](PipelineSimple.png)
63
+
64
+ ### Pipeline Stages
65
+
66
+ 1. **Stage 1: QA Generation & Consistency** 🛠️
67
+ - Generates novel problems from scratch
68
+ - Enforces correctness by requiring the teacher to generate multiple independent solution paths
69
+ - Only questions where all answers align are kept
70
+
71
+ 2. **Stage 2: De-duplication & Decontamination** 🧹
72
+ - Removes internal duplicates via embedding similarity
73
+ - **Crucial Step:** Scans against known test sets (AIME, MATH, GPQA) to ensure zero contamination
74
+
75
+ 3. **Stage 3: Difficulty Hiking** 🏔️
76
+ - Moderately challenging questions are rewritten by the teacher model
77
+ - Introduces deeper reasoning chains, added constraints, or cross-domain logic
78
+ - Systematically elevates complexity
79
+ - Configurable step primarily used when initial generation yields insufficient volume of high-difficulty samples
80
+
81
+ ---
82
+
83
+ ## 🚀 Quick Start
84
+
85
+ ### Python Inference (Transformers)
86
+
87
+ ```python
88
+ from transformers import AutoModelForCausalLM, AutoTokenizer
89
+
90
+ model_name = "amd/SAND-Math-Qwen2.5-32B"
91
+
92
+ model = AutoModelForCausalLM.from_pretrained(
93
+ model_name,
94
+ torch_dtype="auto",
95
+ device_map="auto"
96
+ )
97
+ tokenizer = AutoTokenizer.from_pretrained(model_name)
98
+
99
+ # Example prompt
100
+ prompt = "Find the number of pairs of positive integers $(m, n)$ such that $m^2 + n < 22$ and $n^2 + m < 22$."
101
+ messages = [
102
+ {"role": "user", "content": prompt}
103
+ ]
104
+ text = tokenizer.apply_chat_template(
105
+ messages,
106
+ tokenize=False,
107
+ add_generation_prompt=True
108
+ )
109
+ model_inputs = tokenizer([text], return_tensors="pt").to(model.device)
110
+
111
+ generated_ids = model.generate(
112
+ **model_inputs,
113
+ max_new_tokens=4096,
114
+ temperature=0.7, # Recommended temperature
115
+ do_sample=True
116
+ )
117
+ generated_ids = [
118
+ output_ids[len(input_ids):] for input_ids, output_ids in zip(model_inputs.input_ids, generated_ids)
119
+ ]
120
+
121
+ response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
122
+ print("Response:", response)
123
+ ```
124
+
125
+ ### Serving (vLLM & SGLang)
126
+
127
+ You can easily serve this model as an OpenAI-compatible API endpoint.
128
+
129
+ **Using SGLang:**
130
+ ```bash
131
+ python -m sglang.launch_server --model-path amd/SAND-Math-Qwen2.5-32B --max-model-len 32768
132
+ ```
133
+
134
+ **Using vLLM:**
135
+ ```bash
136
+ vllm serve amd/SAND-Math-Qwen2.5-32B --max-model-len 32768
137
+ ```
138
+
139
+ ---
140
+
141
+ ## 💡 Usage Recommendations
142
+
143
+ To replicate our performance benchmarks and achieve the best reasoning results, we strongly recommend the following configurations:
144
+
145
+ * **Temperature:** Set `temperature=0.7`. **DO NOT use greedy decoding**, as it can lead to performance degradation and repetitive loops.
146
+ * **Prompting:** For mathematical problems, include a directive to enforce structure:
147
+ > "Please reason step by step, and put your final answer within \boxed{}."
148
+ * **Context Length:** We recommend allowing an output length of **32,768 tokens**. This ensures the model has sufficient space for long Chain-of-Thought (CoT) generation.
149
+ * **Thinking Token:** It is recommended to enforce the model to initiate its response with the `<think>\n` token to trigger the reasoning mode effectively.
150
+ * **Evaluation:** When benchmarking, conduct multiple passes (Pass@K) and average the results for stability.
151
+
152
+ ---
153
+
154
+ ## 📜 License
155
+
156
+ This project is licensed under the **Open RAIL-MSD** license. This is an open, royalty-free license that permits commercial use, modification, and distribution of the dataset, models, and source code.
157
+
158
+ The license includes standard use-based restrictions to prevent harmful applications (e.g., illegal activities, generating harmful content, high-risk applications). These restrictions are designed to promote responsible AI development while keeping the license permissive for legitimate use cases.
159
+
160
+ For full license terms and conditions, please see the [LICENSE](./LICENSE) file.
161
+
162
+ ---
163
+
164
+ ## Citation
165
+
166
+ If you use this model, dataset, or pipeline in your research, please cite our work:
167
+
168
+ ```bibtex
169
+ @misc{manem025sandmathusingllmsgenerate,
170
+ title={SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers},
171
+ author={Chaitanya Manem and Pratik Prabhanjan Brahma and Prakamya Mishra and Zicheng Liu and Emad Barsoum},
172
+ year={2025},
173
+ eprint={2507.20527},
174
+ archivePrefix={arXiv},
175
+ primaryClass={cs.CL},
176
+ url={https://arxiv.org/abs/2507.20527},
177
+ }
178
+ ```