AstroMLab
/

astrollama-3-8b-chat_summary

Text Generation

text-generation-inference

Model card Files Files and versions Community

tingyuansen commited on Nov 16, 2024

Commit

47adcb8

·

verified ·

1 Parent(s): 7f89965

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -38,7 +38,7 @@ AstroLLaMA-3-8B-Chat_Summary is a specialized chat model for astronomy, develope
   - Warmup ratio: 0.03
   - Cosine decay schedule for learning rate reduction
 - **Primary Use**: Instruction-following and chat-based interactions for astronomy-related queries
-- **Reference**: Pan et al. 2024 [Link to be added]
 ## Using the model for chat
@@ -85,7 +85,7 @@ print(f"Assistant: {response}")
 This model used the summarized content for training, which has led to improved performance compared to the AIC (Abstract, Introduction, Conclusion) version. The summarization process allows for the inclusion of more comprehensive information from each paper while maintaining a manageable token count.
-Here's a performance comparison chart based upon the astronomical benchmarking Q&A as described in [Ting et al. 2024](https://arxiv.org/abs/2407.11194), and Pan et al. 2024:
 | Model | Score (%) |
 |-------|-----------|

   - Warmup ratio: 0.03
   - Cosine decay schedule for learning rate reduction
 - **Primary Use**: Instruction-following and chat-based interactions for astronomy-related queries
+- **Reference**: [Pan et al. 2024](https://arxiv.org/abs/2409.19750)
 ## Using the model for chat
 This model used the summarized content for training, which has led to improved performance compared to the AIC (Abstract, Introduction, Conclusion) version. The summarization process allows for the inclusion of more comprehensive information from each paper while maintaining a manageable token count.
+Here's a performance comparison chart based upon the astronomical benchmarking Q&A as described in [Ting et al. 2024](https://arxiv.org/abs/2407.11194):
 | Model | Score (%) |
 |-------|-----------|