FrancescoPeriti
/

Llama3Dictionary

Text2Text Generation

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FrancescoPeriti commited on Nov 14, 2024

Commit

c9f607c

·

verified ·

1 Parent(s): 24b388c

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ pipeline_tag: text2text-generation
 tags:
 - text-generation-inference
 ---
-# Llama2Dictionary
 <!-- Provide a quick summary of what the model is/does. -->
 ```FrancescoPeriti/Llama3Dictionary``` is a fine-tuned version of the ```meta-llama/Meta-Llama-3-8B-Instruct```.
@@ -52,8 +52,8 @@ ft_model_name = "FrancescoPeriti/Llama3Dictionary" # fine-tuned model
 # load models
 chat_model = AutoModelForCausalLM.from_pretrained(model_name, device_map='auto')
-lama2dictionary = PeftModel.from_pretrained(chat_model, ft_model_name)
-lama2dictionary.eval()
 # load tokenizer
 tokenizer = AutoTokenizer.from_pretrained(
@@ -127,7 +127,7 @@ with torch.no_grad():
         for k in ['input_ids', 'attention_mask']:
             model_input[k] = torch.tensor(batch[k]).to('cuda')
-        output_ids = lama2dictionary.generate(**model_input,
                                        max_length = max_length * batch_size,
                                        forced_eos_token_id = eos_tokens,
                                        max_time = max_time * batch_size,

 tags:
 - text-generation-inference
 ---
+# Llama3Dictionary
 <!-- Provide a quick summary of what the model is/does. -->
 ```FrancescoPeriti/Llama3Dictionary``` is a fine-tuned version of the ```meta-llama/Meta-Llama-3-8B-Instruct```.
 # load models
 chat_model = AutoModelForCausalLM.from_pretrained(model_name, device_map='auto')
+lama3dictionary = PeftModel.from_pretrained(chat_model, ft_model_name)
+lama3dictionary.eval()
 # load tokenizer
 tokenizer = AutoTokenizer.from_pretrained(
         for k in ['input_ids', 'attention_mask']:
             model_input[k] = torch.tensor(batch[k]).to('cuda')
+        output_ids = lama3dictionary.generate(**model_input,
                                        max_length = max_length * batch_size,
                                        forced_eos_token_id = eos_tokens,
                                        max_time = max_time * batch_size,