Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Serega6678
/
M7Bv01_SFT_50pct_LORA_BF16_FA_2H100
like
0
PEFT
Safetensors
HuggingFaceH4/ultrachat_200k
mistral
alignment-handbook
Generated from Trainer
trl
sft
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
b13dd47
M7Bv01_SFT_50pct_LORA_BF16_FA_2H100
1 contributor
History:
10 commits
Serega6678
Training in progress, step 1000
b13dd47
verified
10 months ago
.gitattributes
1.52 kB
initial commit
10 months ago
adapter_config.json
655 Bytes
Training in progress, step 100
10 months ago
adapter_model.safetensors
83.9 MB
LFS
Training in progress, step 1000
10 months ago
special_tokens_map.json
437 Bytes
Training in progress, step 100
10 months ago
tokenizer.json
1.8 MB
Training in progress, step 100
10 months ago
tokenizer.model
493 kB
LFS
Training in progress, step 100
10 months ago
tokenizer_config.json
1.39 kB
Training in progress, step 100
10 months ago
training_args.bin
Unsafe
pickle
Detected Pickle imports (9)
"torch.device"
,
"accelerate.utils.dataclasses.DistributedType"
,
"alignment.configs.SFTConfig"
,
"transformers.training_args.OptimizerNames"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.trainer_utils.HubStrategy"
,
"accelerate.state.PartialState"
,
"__builtin__.getattr"
,
"transformers.trainer_utils.SchedulerType"
How to fix it?
4.86 kB
LFS
Training in progress, step 100
10 months ago