prosa-text
/

indobert-nusa

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

tisorlawan commited on Feb 22, 2024

Commit

8532443

·

1 Parent(s): 79cbaf4

fix: improve doc

Files changed (1) hide show

README.md +4 -15

README.md CHANGED Viewed

@@ -6,6 +6,7 @@ language:
 - ban
 - bug
 - id
 tags:
 - generated_from_trainer
 datasets:
@@ -14,15 +15,15 @@ datasets:
 pipeline_tag: fill-mask
 ---
-# IndoBERT-nusa (IndoBERT Adapted for Balinese, Buginese, and Minangkabau)
 This repository contains a language adaptation and fine-tuning of the Indobenchmark IndoBERT language model for three specific languages: Balinese, Buginese, and Minangkabau.
-The adaptation was performed using nusa-st data.
 ## Model Details
 - **Base Model**: [indobenchmark/indobert-large-p2](https://huggingface.co/indobenchmark/indobert-large-p2)
-- **Adaptation Data**: nusa-st
 ## Performance Comparison / Benchmark
@@ -77,18 +78,6 @@ The following hyperparameters were used during training:
 The dataset is released under the terms of **CC-BY-SA 4.0**.
 By using this model, you are also bound to the respective Terms of Use and License of the dataset.
-### Citation Information
-```bibtex
-@article{purwarianti2023nusadialogue,
-  title={NusaDialogue: Dialogue Summarization and Generation for Underrepresented and Extremely Low-Resource Languages},
-  author={Purwarianti, Ayu and Adhista, Dea and Baptiso, Agung and Mahfuzh, Miftahul and Yusrina Sabila and Cahyawijaya, Samuel and Aji, Alham Fikri},
-  journal={arXiv preprint arXiv:(coming soon)},
-  url={https://huggingface.co/datasets/prosa-text/nusa-dialogue},
-  year={2023}
-}
-```
 ### Acknowledgement
 This research work is funded and supported by The Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH and FAIR Forward - Artificial Intelligence for all. We thank Direktorat Jenderal Pendidikan Tinggi, Riset, dan Teknologi Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi (Ditjen DIKTI) for providing the computing resources for this project.

 - ban
 - bug
 - id
+pretty_name: IndoBERTNusa
 tags:
 - generated_from_trainer
 datasets:
 pipeline_tag: fill-mask
 ---
+# IndoBERTNusa (IndoBERT Adapted for Balinese, Buginese, and Minangkabau)
 This repository contains a language adaptation and fine-tuning of the Indobenchmark IndoBERT language model for three specific languages: Balinese, Buginese, and Minangkabau.
+The adaptation was performed using [nusa-translation](https://huggingface.co/datasets/prosa-text/nusa-translation) dataset.
 ## Model Details
 - **Base Model**: [indobenchmark/indobert-large-p2](https://huggingface.co/indobenchmark/indobert-large-p2)
+- **Adaptation Data**:[nusa-translation](https://huggingface.co/datasets/prosa-text/nusa-translation)
 ## Performance Comparison / Benchmark
 The dataset is released under the terms of **CC-BY-SA 4.0**.
 By using this model, you are also bound to the respective Terms of Use and License of the dataset.
 ### Acknowledgement
 This research work is funded and supported by The Deutsche Gesellschaft für Internationale Zusammenarbeit (GIZ) GmbH and FAIR Forward - Artificial Intelligence for all. We thank Direktorat Jenderal Pendidikan Tinggi, Riset, dan Teknologi Kementerian Pendidikan, Kebudayaan, Riset, dan Teknologi (Ditjen DIKTI) for providing the computing resources for this project.