HuggingAlex1247
/

gbert-large-germaner

Token Classification

Model card Files Files and versions

Metrics Training metrics Community

HuggingAlex1247 commited on Sep 9, 2022

Commit

f3ef40f

·

1 Parent(s): 36e3091

Upload README.md with huggingface_hub

Files changed (1) hide show

README.md +46 -25

README.md CHANGED Viewed

@@ -1,22 +1,50 @@
 ---
 license: mit
-tags:
-- generated_from_keras_callback
 model-index:
-- name: HuggingAlex1247/gbert-large-germaner
-  results: []
 ---
-<!-- This model card has been generated automatically according to the information Keras had access to. You should
-probably proofread and complete it, then remove this comment. -->
-# HuggingAlex1247/gbert-large-germaner
-This model is a fine-tuned version of [deepset/gbert-large](https://huggingface.co/deepset/gbert-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.0104
-- Validation Loss: 0.0965
-- Epoch: 4
 ## Model description
@@ -35,23 +63,16 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 3e-05, 'decay_steps': 13915, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
-- training_precision: float32
-### Training results
-| Train Loss | Validation Loss | Epoch |
-|:----------:|:---------------:|:-----:|
-| 0.1216     | 0.0809          | 0     |
-| 0.0608     | 0.0817          | 1     |
-| 0.0388     | 0.0837          | 2     |
-| 0.0218     | 0.0883          | 3     |
-| 0.0104     | 0.0965          | 4     |
 ### Framework versions
 - Transformers 4.21.3
-- TensorFlow 2.6.2
 - Datasets 1.18.0
 - Tokenizers 0.12.1

 ---
+language:
+- de
 license: mit
+datasets:
+- germaner
+metrics:
+- precision
+- recall
+- f1
+- accuracy
 model-index:
+- name: gbert-large-germaner
+  results:
+  - task:
+      name: Token Classification
+      type: token-classification
+    dataset:
+      name: germaner
+      type: germaner
+      args: default
+    metrics:
+    - name: precision
+      type: precision
+      value: 0.8755112474437627
+    - name: recall
+      type: recall
+      value: 0.8861578266494179
+    - name: f1
+      type: f1
+      value: 0.8808023659508808
+    - name: accuracy
+      type: accuracy
+      value: 0.9788673918458856
 ---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# gbert-large-germaner
+This model is a fine-tuned version of [deepset/gbert-large](https://huggingface.co/deepset/gbert-large) on the germaner dataset.
 It achieves the following results on the evaluation set:
+- precision: 0.8755
+- recall: 0.8862
+- f1: 0.8808
+- accuracy: 0.9789
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- num_train_epochs: 5
+- train_batch_size: 8
+- eval_batch_size: 8
+- learning_rate: 3e-05
+- weight_decay_rate: 0.01
+- num_warmup_steps: 0
+- fp16: True
 ### Framework versions
 - Transformers 4.21.3
 - Datasets 1.18.0
 - Tokenizers 0.12.1