AshtonIsNotHere
/

CodeLlama_7B_nlp_pp

Text Generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions

AshtonIsNotHere commited on Sep 4, 2023

Commit

92898a6

·

1 Parent(s): 0b9c49d

Update README.md

Files changed (1) hide show

README.md +7 -8

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ datasets:
 metrics:
 - accuracy
 model-index:
-- name: codellama_CodeLlama-7b-hf_08_27_23_15_32_28
   results:
   - task:
       name: Causal Language Modeling
@@ -22,10 +22,7 @@ model-index:
       value: 0.8968056729128353
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# codellama_CodeLlama-7b-hf_08_27_23_15_32_28
 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the AshtonIsNotHere/nlp_pp_code_dataset dataset.
 It achieves the following results on the evaluation set:
@@ -34,7 +31,7 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
@@ -42,10 +39,12 @@ More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:
@@ -81,4 +80,4 @@ The following hyperparameters were used during training:
 - Transformers 4.30.2
 - Pytorch 2.0.1+cu117
 - Datasets 2.13.0
-- Tokenizers 0.13.3

 metrics:
 - accuracy
 model-index:
+- name: CodeLlama_7B_nlp_pp
   results:
   - task:
       name: Causal Language Modeling
       value: 0.8968056729128353
 ---
+# CodeLlama_7B_nlp_pp
 This model is a fine-tuned version of [codellama/CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf) on the AshtonIsNotHere/nlp_pp_code_dataset dataset.
 It achieves the following results on the evaluation set:
 ## Model description
+This model has been fine-tuned for code completion on a dataset of NLP++ code.
 ## Intended uses & limitations
 ## Training and evaluation data
+Dataset consists of a combination of scraped NLP++ code and NLP++ code examples from the [VisualText website](https://visualtext.org/help/).
 ## Training procedure
+This model is trained in a multinode, multi-gpu setup with DeepSpeed Z3. For more information on the training setup, check out the [GitHub repo](https://github.com/ashtonomy/nlp_pp_code_completion).
 ### Training hyperparameters
 The following hyperparameters were used during training:
 - Transformers 4.30.2
 - Pytorch 2.0.1+cu117
 - Datasets 2.13.0
+- Tokenizers 0.13.3