arnomatic
/

german-moe-gpt-v8-pretrained

Text Generation

mixture-of-experts

Mixture of Experts

Model card Files Files and versions

arnomatic commited on 15 days ago

Commit

f84673d

·

verified ·

1 Parent(s): 8d3e75f

Upload README.md

Files changed (1) hide show

README.md +30 -0

README.md CHANGED Viewed

@@ -1,3 +1,33 @@
 # German MoE GPT v8 - OPUS EDITION
 A research-grade language model with state-of-the-art Mixture-of-Experts (MoE) architecture, trained on consumer hardware (RTX 4090). This implementation follows best practices from recent MoE research (ST-MoE, Switch Transformer) while maintaining full cross-platform compatibility.

+---
+language:
+- de
+license: mit
+library_name: transformers
+tags:
+- text-generation
+- pytorch
+- causal-lm
+- mixture-of-experts
+- moe
+- german
+- gpt
+- language-model
+base_model: []
+pipeline_tag: text-generation
+model-index:
+- name: german-moe-gpt-v8-pretrained
+  results: []
+datasets:
+- wikipedia
+widget:
+- text: "Die Hauptstadt von Deutschland ist"
+  example_title: "German Capital"
+- text: "Künstliche Intelligenz ist"
+  example_title: "AI Definition"
+- text: "Es war einmal"
+  example_title: "Story Beginning"
+---
 # German MoE GPT v8 - OPUS EDITION
 A research-grade language model with state-of-the-art Mixture-of-Experts (MoE) architecture, trained on consumer hardware (RTX 4090). This implementation follows best practices from recent MoE research (ST-MoE, Switch Transformer) while maintaining full cross-platform compatibility.