opencsg
/

opencsg-stable-code-3b-v1

Text Generation

Model card Files Files and versions

cistine commited on Mar 4, 2024

Commit

35bcbb5

·

verified ·

1 Parent(s): 985922b

Update README.md

Files changed (1) hide show

README.md +18 -16

README.md CHANGED Viewed

@@ -42,7 +42,8 @@ This is the repository for the base 3B version finetuned based on [stable-code-3
 | Model Size    | Base Model                                                                    |
 | --- | ----------------------------------------------------------------------------- |
-| 3B|[opencsg/Opencsg-stable-coder-3b-v1](https://huggingface.co/opencsg/opencsg-stable-code-3b-v1)
 ## Model Eval
@@ -79,20 +80,20 @@ To simplify the comparison, we chosed the Pass@1 metric for the Python language,
 ```
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
-torch.set_default_device("cuda")
-model = AutoModelForCausalLM.from_pretrained("opencsg/opencsg-phi-2-v0.1", torch_dtype="auto", trust_remote_code=True)
-tokenizer = AutoTokenizer.from_pretrained("opencsg/opencsg-phi-2-v0.1", trust_remote_code=True)
-inputs = tokenizer('''def print_prime(n):
-   """
-   Print all primes between 1 and n
-   """''', return_tensors="pt", return_attention_mask=False)
-outputs = model.generate(**inputs, max_length=200)
-text = tokenizer.batch_decode(outputs)[0]
-print(text)
 ```
 # Training
@@ -145,7 +146,8 @@ opencsg-stable-code-3b-v1是是一系列基于stable-code-3b的通过全参数
 | 模型大小    | 基座模型                                                                    |
 | --- | ----------------------------------------------------------------------------- |
-| 3B |[opencsg/Opencsg-stable-coder-3b-v1](https://huggingface.co/opencsg/opencsg-stable-code-3b-v1)
 ## 模型评估

 | Model Size    | Base Model                                                                    |
 | --- | ----------------------------------------------------------------------------- |
+| 3B |[opencsg/Opencsg-stable-coder-3b-v1](https://huggingface.co/opencsg/opencsg-stable-code-3b-v1)|
+| opencsg-phi-2-v0.1 | [opencsg/Opencsg-phi-2-v0.1](https://huggingface.co/opencsg/opencsg-phi-2-v0.1)    |
 ## Model Eval
 ```
 import torch
 from transformers import AutoModelForCausalLM, AutoTokenizer
+tokenizer = AutoTokenizer.from_pretrained("opencsg/opencsg-stable-coder-3b-v1")
+model = AutoModelForCausalLM.from_pretrained(
+  "opencsg/opencsg-stable-coder-3b-v1",
+  torch_dtype="auto",
+)
+model.cuda()
+inputs = tokenizer("import torch\nimport torch.nn as nn", return_tensors="pt").to(model.device)
+tokens = model.generate(
+  **inputs,
+  max_new_tokens=48,
+  temperature=0.2,
+  do_sample=True,
+)
+print(tokenizer.decode(tokens[0], skip_special_tokens=True))
 ```
 # Training
 | 模型大小    | 基座模型                                                                    |
 | --- | ----------------------------------------------------------------------------- |
+| 3B|[opencsg/Opencsg-stable-coder-3b-v1](https://huggingface.co/opencsg/opencsg-stable-code-3b-v1)|
+| opencsg-phi-2-v0.1 | [opencsg/Opencsg-phi-2-v0.1](https://huggingface.co/opencsg/opencsg-phi-2-v0.1)    |
 ## 模型评估