Add/update the quantized ONNX model files and README.md for Transformers.js v3
Browse files## Applied Quantizations
### β
Based on `model.onnx` *with* slimming
β³ β
`int8`: `model_int8.onnx` (added)
β³ β
`uint8`: `model_uint8.onnx` (added)
β³ β
`q4`: `model_q4.onnx` (added)
β³ β
`q4f16`: `model_q4f16.onnx` (added)
β³ β
`bnb4`: `model_bnb4.onnx` (added)
### β
Based on `model.onnx` *with* slimming
β³ β
`int8`: `model_int8.onnx` (added)
β³ β
`uint8`: `model_uint8.onnx` (added)
β³ β
`q4`: `model_q4.onnx` (added)
β³ β
`q4f16`: `model_q4f16.onnx` (added)
β³ β
`bnb4`: `model_bnb4.onnx` (added)
- README.md +3 -3
- onnx/model_bnb4.onnx +3 -0
- onnx/model_int8.onnx +3 -0
- onnx/model_q4.onnx +3 -0
- onnx/model_q4f16.onnx +3 -0
- onnx/model_uint8.onnx +3 -0
README.md
CHANGED
|
@@ -8,15 +8,15 @@ https://huggingface.co/google/electra-base-discriminator with ONNX weights to be
|
|
| 8 |
|
| 9 |
## Usage (Transformers.js)
|
| 10 |
|
| 11 |
-
If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@
|
| 12 |
```bash
|
| 13 |
-
npm i @
|
| 14 |
```
|
| 15 |
|
| 16 |
**Example:** Feature extraction w/ `Xenova/electra-base-discriminator`.
|
| 17 |
|
| 18 |
```javascript
|
| 19 |
-
import { pipeline } from '@
|
| 20 |
|
| 21 |
// Create feature extraction pipeline
|
| 22 |
const extractor = await pipeline('feature-extraction', 'Xenova/electra-base-discriminator');
|
|
|
|
| 8 |
|
| 9 |
## Usage (Transformers.js)
|
| 10 |
|
| 11 |
+
If you haven't already, you can install the [Transformers.js](https://huggingface.co/docs/transformers.js) JavaScript library from [NPM](https://www.npmjs.com/package/@huggingface/transformers) using:
|
| 12 |
```bash
|
| 13 |
+
npm i @huggingface/transformers
|
| 14 |
```
|
| 15 |
|
| 16 |
**Example:** Feature extraction w/ `Xenova/electra-base-discriminator`.
|
| 17 |
|
| 18 |
```javascript
|
| 19 |
+
import { pipeline } from '@huggingface/transformers';
|
| 20 |
|
| 21 |
// Create feature extraction pipeline
|
| 22 |
const extractor = await pipeline('feature-extraction', 'Xenova/electra-base-discriminator');
|
onnx/model_bnb4.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ab7f4520015d00f47f1ec37a204831703c7f8c08bed16a02c9c81ee9e69ad596
|
| 3 |
+
size 143893455
|
onnx/model_int8.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:fec7cfe97ca7a7e82635deb4f35095a34d28065f66d65faa7acb21ea73cacd0b
|
| 3 |
+
size 109622387
|
onnx/model_q4.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3b092edb34d1e50ca0b868176c1135cd1c7b000477e82216c7bd2bdb75ed0315
|
| 3 |
+
size 149201343
|
onnx/model_q4f16.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:1f8b285aca72ea6c319184f032c51eb23561666305d95eaa08328d0ca852a14c
|
| 3 |
+
size 95979116
|
onnx/model_uint8.onnx
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:d60554cafc252df44e2653fdba6faec3b93b9d120c03cc3d7f0ccc4778bcf39d
|
| 3 |
+
size 109622432
|