pkbiswas
/

Llama-2-7b-Detoxified-PPO-QLoRa

Reinforcement Learning

Inference Endpoints

Model card Files Files and versions Community

Llama-2-7b-Detoxified-PPO-QLoRa

1 contributor

History: 2 commits

pkbiswas's picture

Push model using huggingface_hub.

9b5294d verified 7 months ago

.gitattributes

1.52 kB

initial commit 7 months ago
README.md

1.29 kB

Push model using huggingface_hub. 7 months ago
adapter_config.json

727 Bytes

Push model using huggingface_hub. 7 months ago
adapter_model.safetensors

160 MB
LFS

Push model using huggingface_hub. 7 months ago
config.json

1.24 kB

Push model using huggingface_hub. 7 months ago
pytorch_model.bin
Detected Pickle imports (3)
- "collections.OrderedDict",
- "torch._utils._rebuild_tensor_v2",
- "torch.FloatStorage"
What is a pickle import?
17.9 kB
LFS

Push model using huggingface_hub. 7 months ago
special_tokens_map.json

437 Bytes

Push model using huggingface_hub. 7 months ago
tokenizer.json

1.84 MB

Push model using huggingface_hub. 7 months ago
tokenizer.model

500 kB
LFS

Push model using huggingface_hub. 7 months ago
tokenizer_config.json

919 Bytes

Push model using huggingface_hub. 7 months ago