jylee420
/

gemma-2b-data-std

Text Generation

text-generation-inference

Model card Files Files and versions

jylee420 commited on Mar 12, 2024

Commit

5462782

·

verified ·

1 Parent(s): 0576947

Update README.md

Files changed (1) hide show

README.md +10 -15

README.md CHANGED Viewed

@@ -6,8 +6,7 @@ tags: []
 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
 ## Model Details
@@ -15,27 +14,23 @@ tags: []
 <!-- Provide a longer summary of what this model is. -->
-This is the model card of a 🤗 transformers model that has been pushed on the Hub. This model card has been automatically generated.
 - **Developed by:** [email protected]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
 - **Language(s) (NLP):** Korean/English
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
-### Model Sources [optional]
-<!-- Provide the basic links for the model. -->
-- **Repository:** [More Information Needed]
-- **Paper [optional]:** [More Information Needed]
-- **Demo [optional]:** [More Information Needed]
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 ### Direct Use

 # Model Card for Model ID
 <!-- Provide a quick summary of what the model is/does. -->
+This model card corresponds to the 2B base version of the Gemma model.
 ## Model Details
 <!-- Provide a longer summary of what this model is. -->
+This is a model that separates terms into words and describes each separated word.
 - **Developed by:** [email protected]
 - **Language(s) (NLP):** Korean/English
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+model = AutoModelForCausalLM.from_pretrained(
+    model_path,
+    vocab_size=len(tokenizer),
+    torch_dtype = torch.float16,
+    use_cache=False,
+    #attn_implementation="flash_attention_2",
+    device_map="auto")
 ### Direct Use