yi-01-ai
commited on
Commit
·
475e0ae
1
Parent(s):
a24d3f5
Auto Sync from git://github.com/01-ai/Yi.git/commit/5ed73fe931e7b95ad6dbd4d8091cd21b199d9bbb
Browse files
README.md
CHANGED
|
@@ -87,7 +87,7 @@ pipeline_tag: text-generation
|
|
| 87 |
- [Models](#models)
|
| 88 |
- [Chat models](#chat-models)
|
| 89 |
- [Base models](#base-models)
|
| 90 |
-
- [
|
| 91 |
- [News](#news)
|
| 92 |
- [How to use Yi?](#how-to-use-yi)
|
| 93 |
- [Quick start](#quick-start)
|
|
@@ -276,11 +276,35 @@ Yi-6B-200K | • [🤗 Hugging Face](https://huggingface.co/01-ai/Yi-6B-200K)
|
|
| 276 |
|
| 277 |
- For chat and base models
|
| 278 |
|
| 279 |
-
|
| 280 |
-
|
| 281 |
-
|
| 282 |
-
|
| 283 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 284 |
|
| 285 |
- For chat models
|
| 286 |
|
|
|
|
| 87 |
- [Models](#models)
|
| 88 |
- [Chat models](#chat-models)
|
| 89 |
- [Base models](#base-models)
|
| 90 |
+
- [Model info](#model-info)
|
| 91 |
- [News](#news)
|
| 92 |
- [How to use Yi?](#how-to-use-yi)
|
| 93 |
- [Quick start](#quick-start)
|
|
|
|
| 276 |
|
| 277 |
- For chat and base models
|
| 278 |
|
| 279 |
+
<table>
|
| 280 |
+
<thead>
|
| 281 |
+
<tr>
|
| 282 |
+
<th>Model</th>
|
| 283 |
+
<th>Intro</th>
|
| 284 |
+
<th>Default context window</th>
|
| 285 |
+
<th>Pretrained tokens</th>
|
| 286 |
+
<th>Training Data Date</th>
|
| 287 |
+
</tr>
|
| 288 |
+
</thead>
|
| 289 |
+
<tbody><tr>
|
| 290 |
+
<td>6B series models</td>
|
| 291 |
+
<td>They are suitable for personal and academic use.</td>
|
| 292 |
+
<td rowspan="3">4K</td>
|
| 293 |
+
<td>3T</td>
|
| 294 |
+
<td rowspan="3">Up to June 2023</td>
|
| 295 |
+
</tr>
|
| 296 |
+
<tr>
|
| 297 |
+
<td>9B series models</td>
|
| 298 |
+
<td>It is the best at coding and math in the Yi series models.</td>
|
| 299 |
+
<td>Yi-9B is continuously trained based on Yi-6B, using 0.8T tokens.</td>
|
| 300 |
+
</tr>
|
| 301 |
+
<tr>
|
| 302 |
+
<td>34B series models</td>
|
| 303 |
+
<td>They are suitable for personal, academic, and commercial (particularly for small and medium-sized enterprises) purposes. It's a cost-effective solution that's affordable and equipped with emergent ability.</td>
|
| 304 |
+
<td>3T</td>
|
| 305 |
+
</tr>
|
| 306 |
+
</tbody></table>
|
| 307 |
+
|
| 308 |
|
| 309 |
- For chat models
|
| 310 |
|