pkbiswas
/

Phi-3-Detoxified-PPO-LoRa

Reinforcement Learning

Model card Files Files and versions Community

Phi-3-Detoxified-PPO-LoRa

Ctrl+K

Ctrl+K

1 contributor

History: 4 commits

pkbiswas's picture

Upload tokenizer

f1fcb3e verified over 1 year ago

.gitattributes

1.52 kB

initial commit over 1 year ago
README.md

1.27 kB

Push model using huggingface_hub. over 1 year ago
adapter_config.json

699 Bytes

Push model using huggingface_hub. over 1 year ago
adapter_model.safetensors

50.4 MB
LFS

Upload Phi3ForCausalLM over 1 year ago
added_tokens.json

293 Bytes

Push model using huggingface_hub. over 1 year ago
config.json

1.25 kB

Push model using huggingface_hub. over 1 year ago
generation_config.json

172 Bytes

Upload Phi3ForCausalLM over 1 year ago
pytorch_model.bin
Detected Pickle imports (3)
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage",
- "collections.OrderedDict"
What is a pickle import?
13.8 kB
LFS

Push model using huggingface_hub. over 1 year ago
special_tokens_map.json

569 Bytes

Push model using huggingface_hub. over 1 year ago
tokenizer.json

1.84 MB

Push model using huggingface_hub. over 1 year ago
tokenizer.model

500 kB
LFS

Push model using huggingface_hub. over 1 year ago
tokenizer_config.json

3.18 kB

Push model using huggingface_hub. over 1 year ago