ctrltokyo
/

llm_prompt_mask_fill_model

generated_from_keras_callback

Model card Files Files and versions

ctrltokyo commited on Jul 29, 2023

Commit

b817fdf

·

1 Parent(s): 85b8a5f

Update README.md

Files changed (1) hide show

README.md +9 -5

README.md CHANGED Viewed

@@ -6,6 +6,10 @@ tags:
 model-index:
 - name: ctrltokyo/llm_prompt_mask_fill_model
   results: []
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
@@ -13,7 +17,7 @@ probably proofread and complete it, then remove this comment. -->
 # ctrltokyo/llm_prompt_mask_fill_model
-This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
 - Train Loss: 2.1215
 - Validation Loss: 1.5672
@@ -21,15 +25,15 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -51,4 +55,4 @@ The following hyperparameters were used during training:
 - Transformers 4.31.0
 - TensorFlow 2.12.0
 - Datasets 2.14.1
-- Tokenizers 0.13.3

 model-index:
 - name: ctrltokyo/llm_prompt_mask_fill_model
   results: []
+datasets:
+- sahil2801/code_instructions_120k
+metrics:
+- accuracy
 ---
 <!-- This model card has been generated automatically according to the information Keras had access to. You should
 # ctrltokyo/llm_prompt_mask_fill_model
+This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on the [code_instructions_120k](https://huggingface.co/datasets/sahil2801/code_instructions_120k) dataset.
 It achieves the following results on the evaluation set:
 - Train Loss: 2.1215
 - Validation Loss: 1.5672
 ## Model description
+It's just distilbert-base-uncased with some fine tuning.
 ## Intended uses & limitations
+This model could be used for live autocompletion in a coding-specific chatbot.
 ## Training and evaluation data
+Evaluated on 5% of training data. No further evaluation performed at this point. Trained on NVIDIA V100.
 ## Training procedure
 - Transformers 4.31.0
 - TensorFlow 2.12.0
 - Datasets 2.14.1
+- Tokenizers 0.13.3