ehekaanldk
/

lora-midm-7b-nsmc-understanding

Generated from Trainer

Model card Files Files and versions Community

lora-midm-7b-nsmc-understanding

This model is a fine-tuned version of KT-AI/midm-bitext-S-7B-inst-v1 on an unknown dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0001
train_batch_size: 1
eval_batch_size: 1
seed: 42
gradient_accumulation_steps: 2
total_train_batch_size: 2
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: cosine
lr_scheduler_warmup_ratio: 0.03
training_steps: 1500
mixed_precision_training: Native AMP

Training results

Framework versions

Transformers 4.35.2
Pytorch 2.1.0+cu118
Datasets 2.15.0
Tokenizers 0.15.0

test accuracy

kt-ai-midm

Confusion Matrix:

Predicted 0 Predicted 1

Actual 0 443 49

Actual 1 46 462

Accuracy: 0.905

llama-2

Confusion Matrix:

Predicted 0 Predicted 1

Actual 0 450 42

Actual 1 56 452

Accuracy: 0.902

수정부분

데이터로딩
- prepare_sample_text() : 시스템 메시지 변경 및 프롬프트 포멧 설정
- create_datasets() : train 데이터 상위 2000개 선택
미세튜닝용 모델 로딩
- script_args : 사용 데이터명 nsmc 설정 및 모델명 KT-AI/midm-bitext-S-7B-inst-v1 설정
- max_steps : 최대 훈련 단계 1500 설정 (300->1000->1500 수정결과 높은 정확도)
- save : 체크포인트 세이브를 위한 파라미터 지정
허깅페이스 push_to_hub 로 push
추론테스트
- 프롬프트 템플릿 수정 및 시스템 메시지 변경
- valid_dataset : test 데이터 상위 1000개 선택
미세튜닝된 모델 로딩 후 테스트
- eval_dic : valid_dataset 학습한 결과 출력
정확도
- valid_dataset 과 모델 훈련 결과 true_labels 를 이용한 정확도 분석

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for ehekaanldk/lora-midm-7b-nsmc-understanding

Base model

KT-AI/midm-bitext-S-7B-inst-v1

Finetuned

(11)

this model

Evaluation results

Metadata error: specify a dataset to view leaderboard