passing2961
/

Ultron-Summarizer-1B

conversational ai

conversation summarization

Model card Files Files and versions Community

passing2961 commited on Nov 5

Commit

c153553

•

1 Parent(s): 74743db

Create README.md

Files changed (1) hide show

README.md +56 -0

README.md ADDED Viewed

	@@ -0,0 +1,56 @@

+---
+license: cc-by-nc-3.0
+datasets:
+- passing2961/stark-summary
+language:
+- en
+base_model:
+- meta-llama/Llama-3.2-1B-Instruct
+tags:
+- conversational ai
+- conversation summarization
+---
+# Ultron-Summarizer-1B Model Card
+[🏠 Homepage](https://stark-dataset.github.io/) | [💻 Github](https://github.com/passing2961/Stark) | [📄 Arxiv](https://arxiv.org/abs/2407.03958) | [📕 PDF](https://arxiv.org/pdf/2407.03958)
+## List of Provided Model Series
+- **Ultron-Summarizer-Series:** [🤖 Ultron-Summarizer-1B](https://huggingface.co/passing2961/Ultron-Summarizer-1B) | [🤖 Ultron-Summarizer-3B](https://huggingface.co/passing2961/Ultron-Summarizer-3B) | [🤖 Ultron-Summarizer-8B](https://huggingface.co/passing2961/Ultron-Summarizer-8B)
+- **Ultron 7B**: [🤖 Ultron-7B](https://huggingface.co/passing2961/Ultron-7B)
+> 🚨 Disclaimer: All models and datasets are intended for research purposes only.
+## Model Description
+- **Repository:** [Code](https://github.com/passing2961/Thanos)
+- **Paper:** Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model
+- **Point of Contact:** [Young-Jun Lee](mailto:[email protected])
+## Model Details
+- **Model**: Ultron-Summarizer-1B is a fully open-source, conversation summarize that generate the summarization given the long-term conversation.
+- **Date**: Ultron-Summarizer-1B was trained in 2024.
+- **Training Dataset**: [Stark-Summary](https://huggingface.co/datasets/passing2961/stark-summary)
+- **Architecture**: Ultron-Summarizer-1B was trained on top of [LLaMA-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct).
+## How to Use
+## License and Recommendations
+🚨 Ultron-Summarizer-1B is intended to be used for research purposes only.
+## Acknowledgement
+This work was supported by a grant of the KAIST-KT joint research project through AI Tech Lab, Institute of convergence Technology, funded by KT [Project No. G01230605, Development of Task-oriented Persona-based Dialogue Generation Combining Multi-modal Interaction and Knowledge Modeling].
+## Citation
+If you find the resources in this repository useful, please cite our work:
+```
+@article{lee2024stark,
+  title={Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge},
+  author={Lee, Young-Jun and Lee, Dokyong and Youn, Junyoung and Oh, Kyeongjin and Ko, Byungsoo and Hyeon, Jonghwan and Choi, Ho-Jin},
+  journal={arXiv preprint arXiv:2407.03958},
+  year={2024}
+}
+```