Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
distily
/
distily_miles_projector_experiment
like
0
Follow
Distily Project
2
TensorBoard
Safetensors
wikimedia/wikipedia
Distily
gpt2
bitnet
1.58b
Generated from Trainer
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
main
distily_miles_projector_experiment
/
logs
1 contributor
History:
3 commits
lapp0
End of training
4ad37fb
verified
5 months ago
attn_loss_fn=raw_mse, attn_weight=25.0, layer_mapper=all, projector=miles
End of training
5 months ago
attn_loss_fn=raw_mse, attn_weight=5, layer_mapper=all, projector=miles
Training in progress, step 61875
5 months ago