Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
IrwinD
/
log_sage_reward_model
like
0
Text Classification
Transformers
Safetensors
hdfs_rlhf_log_summary_dataset
distilbert
trl
reward-trainer
Generated from Trainer
Eval Results
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
log_sage_reward_model
1 contributor
History:
12 commits
IrwinD
End of training
4b9c9d6
verified
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
12 months ago
README.md
Safe
5.65 kB
End of training
11 months ago
config.json
Safe
655 Bytes
End of training
12 months ago
model.safetensors
Safe
268 MB
LFS
End of training
11 months ago
special_tokens_map.json
Safe
125 Bytes
Model save
12 months ago
tokenizer.json
Safe
711 kB
Model save
12 months ago
tokenizer_config.json
Safe
1.25 kB
End of training
12 months ago
training_args.bin
pickle
Detected Pickle imports (9)
"accelerate.state.PartialState"
,
"transformers.trainer_utils.IntervalStrategy"
,
"transformers.training_args.OptimizerNames"
,
"accelerate.utils.dataclasses.DistributedType"
,
"transformers.trainer_pt_utils.AcceleratorConfig"
,
"transformers.trainer_utils.SchedulerType"
,
"torch.device"
,
"trl.trainer.reward_config.RewardConfig"
,
"transformers.trainer_utils.HubStrategy"
How to fix it?
4.98 kB
LFS
End of training
11 months ago
vocab.txt
Safe
232 kB
Model save
12 months ago