DLingo
/

qwen2-2b-instruct-trl-sft-mrg

Generated from Trainer

Model card Files Files and versions Community

qwen2-2b-instruct-trl-sft-mrg / README.md

DLingo's picture

Model save

4037c2b verified 17 days ago

|

history blame contribute delete

3.09 kB

	---
	base_model: Qwen/Qwen2-VL-2B-Instruct
	library_name: peft
	license: apache-2.0
	tags:
	- trl
	- sft
	- generated_from_trainer
	model-index:
	- name: qwen2-2b-instruct-trl-sft-mrg
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# qwen2-2b-instruct-trl-sft-mrg

	This model is a fine-tuned version of [Qwen/Qwen2-VL-2B-Instruct](https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct) on an unknown dataset.
	It achieves the following results on the evaluation set:
	- Loss: 1.2568

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 0.0001
	- train_batch_size: 4
	- eval_batch_size: 4
	- seed: 42
	- gradient_accumulation_steps: 8
	- total_train_batch_size: 32
	- optimizer: Use adamw_torch_fused with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
	- lr_scheduler_type: cosine
	- lr_scheduler_warmup_ratio: 0.1
	- num_epochs: 15

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \|
	\|:-------------:\|:-------:\|:----:\|:---------------:\|
	\| 3.0236 \| 0.4942 \| 32 \| 2.9521 \|
	\| 2.6642 \| 0.9884 \| 64 \| 2.4341 \|
	\| 2.156 \| 1.4846 \| 96 \| 1.9677 \|
	\| 1.9011 \| 1.9788 \| 128 \| 1.7312 \|
	\| 1.6955 \| 2.4749 \| 160 \| 1.6093 \|
	\| 1.5552 \| 2.9691 \| 192 \| 1.5437 \|
	\| 1.5361 \| 3.4653 \| 224 \| 1.4991 \|
	\| 1.4831 \| 3.9595 \| 256 \| 1.4554 \|
	\| 1.5036 \| 4.4556 \| 288 \| 1.4261 \|
	\| 1.3815 \| 4.9498 \| 320 \| 1.3991 \|
	\| 1.3762 \| 5.4459 \| 352 \| 1.3760 \|
	\| 1.3636 \| 5.9402 \| 384 \| 1.3562 \|
	\| 1.2826 \| 6.4363 \| 416 \| 1.3424 \|
	\| 1.3178 \| 6.9305 \| 448 \| 1.3256 \|
	\| 1.2689 \| 7.4266 \| 480 \| 1.3123 \|
	\| 1.2163 \| 7.9208 \| 512 \| 1.3019 \|
	\| 1.284 \| 8.4170 \| 544 \| 1.2920 \|
	\| 1.3356 \| 8.9112 \| 576 \| 1.2862 \|
	\| 1.2359 \| 9.4073 \| 608 \| 1.2820 \|
	\| 1.2157 \| 9.9015 \| 640 \| 1.2746 \|
	\| 1.1936 \| 10.3977 \| 672 \| 1.2709 \|
	\| 1.3181 \| 10.8919 \| 704 \| 1.2659 \|
	\| 1.2266 \| 11.3880 \| 736 \| 1.2641 \|
	\| 1.213 \| 11.8822 \| 768 \| 1.2605 \|
	\| 1.1997 \| 12.3784 \| 800 \| 1.2603 \|
	\| 1.2584 \| 12.8726 \| 832 \| 1.2577 \|
	\| 1.2547 \| 13.3687 \| 864 \| 1.2576 \|
	\| 1.2544 \| 13.8629 \| 896 \| 1.2574 \|
	\| 1.203 \| 14.3591 \| 928 \| 1.2569 \|
	\| 1.1467 \| 14.8533 \| 960 \| 1.2568 \|


	### Framework versions

	- PEFT 0.11.1
	- Transformers 4.46.3
	- Pytorch 2.3.1+cu121
	- Datasets 3.0.1
	- Tokenizers 0.20.0