Nexusflow
/

Athene-70B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

banghua commited on Jul 19

Commit

2198427

•

1 Parent(s): 6d665b1

Create README.md

Files changed (1) hide show

README.md +85 -0

README.md ADDED Viewed

	@@ -0,0 +1,85 @@

+---
+license: cc-by-nc-4.0
+language:
+- en
+library_name: transformers
+tags:
+- RLHF
+- Nexusflow
+- Athene
+- Chat Model
+---
+# Athene-Llama3-70B
+Blog: https://nexusflow.ai/blogs/introducing-athene-70b
+<!-- Provide a quick summary of what the model is/does. -->
+- **Developed by: The Nexusflow Team (Evan Frick\*, Peter Jin\*, Tianle Li\*, Karthik Ganesan, Jian Zhang, Jiantao Jiao and Banghua Zhu).
+- **Model type:** Chat Model
+- **Finetuned from model:** [Llama-3-70B-Instruct](meta-llama/Meta-Llama-3-70B-Instruct).
+We introduce Athene-Llama3-70B, an open-weights LLM trained through RLHF based off Llama-3-70B-Instruct. Athene-70B achieves a high score on Arena-Hard-Auto, a proxy benchmark for Chatbot Arena.
+| Model                 |  Arena-Hard |
+|-----------------------|------------------|
+| GPT-4o     (Proprietary)  |  79.2%                |
+| Claude-3.5    (Proprietary)      | 79.3%      |
+| **Athene-70B**  (Open)         | 77.8%    |
+| Gemini-Pro-1.5 (Proprietary)    |  72.0%                |
+| Gemma-2-27B (Open)              |  57.0%                |
+| Llama-3-70B (Open)              | 46.6%                |
+## Usage
+Athene-70B uses the same chat template as Llama-3-70B-Instruct. Below is an example simple usage using the Transformers library.
+```Python
+import transformers
+import torch
+model_id = "Nexusflow/Athene-70B"
+pipeline = transformers.pipeline(
+    "text-generation",
+    model=model_id,
+    model_kwargs={"torch_dtype": torch.bfloat16},
+    device_map="auto",
+)
+messages = [
+    {"role": "system", "content": "You are an Athene Noctura, you can only speak with owl sounds. Whoooo whooo."},
+    {"role": "user", "content": "Whooo are you?"},
+]
+terminators = [
+    pipeline.tokenizer.eos_token_id,
+    pipeline.tokenizer.convert_tokens_to_ids("<|eot_id|>")
+]
+outputs = pipeline(
+    messages,
+    max_new_tokens=256,
+    eos_token_id=terminators,
+    do_sample=True,
+    temperature=0.6,
+    top_p=0.9,
+)
+print(outputs[0]["generated_text"][-1])
+```
+## Acknowledgment
+We would like to thank the [LMSYS Organization](https://lmsys.org/) for their support of online demo and private test. We would like to thank Meta AI and the open source community for their efforts in providing the datasets and base models.
+## Citation
+```
+@misc{Athene2024,
+    title = {Athene-70B: Redefining the Boundaries of Post-Training for Open Models},
+    url = {},
+    author = {Frick, Evan and Jin, Peter and Li, Tianle and Ganesan, Karthik and Zhang, Jian and Jiao, Jiantao and Zhu, Banghua},
+    month = {July},
+    year = {2024}
+}
+```