Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
pkbiswas
/
Phi-3-Detoxified-PPO-LoRa
like
0
Reinforcement Learning
Transformers
PyTorch
Safetensors
trl
ppo
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
Phi-3-Detoxified-PPO-LoRa
Commit History
Push model using huggingface_hub.
66a811a
verified
pkbiswas
commited on
May 18, 2024
Upload tokenizer
d569e95
verified
pkbiswas
commited on
May 2, 2024
Upload Phi3ForCausalLM
03b471c
verified
pkbiswas
commited on
May 2, 2024
Upload tokenizer
f1fcb3e
verified
pkbiswas
commited on
May 2, 2024
Upload Phi3ForCausalLM
421110d
verified
pkbiswas
commited on
May 2, 2024
Push model using huggingface_hub.
bfb7444
verified
pkbiswas
commited on
May 2, 2024
initial commit
4a22299
verified
pkbiswas
commited on
May 2, 2024