richardcsuwandi
/

llama2-javanese

Model card Files Files and versions Community

richardcsuwandi commited on Oct 19, 2024

Commit

118fa06

·

verified ·

1 Parent(s): d167ea8

Update README.md

Files changed (1) hide show

README.md +1 -13

README.md CHANGED Viewed

@@ -13,19 +13,7 @@ This model is a fine-tuned adaptation of [Llama-2-7b-chat-hf](https://huggingfac
 ## Training
-The model was fine-tuned on a dataset translated into Javanese using the [NLLB](https://ai.meta.com/research/no-language-left-behind/) model. This dataset includes texts from both [OASST1](https://huggingface.co/datasets/OpenAssistant/oasst1) and [OASST2](https://huggingface.co/datasets/OpenAssistant/oasst2), covering a wide range of conversational scenarios. The training process employed [TRL](https://github.com/huggingface/trl) and [PEFT](https://github.com/huggingface/peft) to facilitate efficient and rapid fine-tuning.
-The following `bitsandbytes` quantization settings were applied during training:
-- quant_method: bitsandbytes
-- load_in_8bit: False
-- load_in_4bit: True
-- llm_int8_threshold: 6.0
-- llm_int8_skip_modules: None
-- llm_int8_enable_fp32_cpu_offload: False
-- llm_int8_has_fp16_weight: False
-- bnb_4bit_quant_type: fp4
-- bnb_4bit_use_double_quant: False
-- bnb_4bit_compute_dtype: float32
 ## Usage

 ## Training
+The model was fine-tuned on a dataset translated into Javanese using the [NLLB](https://ai.meta.com/research/no-language-left-behind/) model. This dataset includes texts from both [OASST1](https://huggingface.co/datasets/OpenAssistant/oasst1) and [OASST2](https://huggingface.co/datasets/OpenAssistant/oasst2), covering a wide range of conversational scenarios. The training process employed [PEFT](https://github.com/huggingface/peft) and [TRL](https://github.com/huggingface/trl) to facilitate efficient and rapid fine-tuning.
 ## Usage