Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
mkahari
/
RL_testing
like
0
Reinforcement Learning
Transformers
Taxi-v3
q-learning
custom-implementation
Eval Results
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
RL_testing
/
mk_ppo_lunar
1 contributor
History:
1 commit
mkahari
PPO LunarLander-v2 model
898f026
almost 2 years ago
_stable_baselines3_version
Safe
5 Bytes
PPO LunarLander-v2 model
almost 2 years ago
data
Safe
14.7 kB
PPO LunarLander-v2 model
almost 2 years ago
policy.optimizer.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
88.1 kB
LFS
PPO LunarLander-v2 model
almost 2 years ago
policy.pth
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"collections.OrderedDict"
,
"torch.FloatStorage"
What is a pickle import?
43.2 kB
LFS
PPO LunarLander-v2 model
almost 2 years ago
pytorch_variables.pth
Safe
pickle
Pickle imports
No problematic imports detected
What is a pickle import?
431 Bytes
LFS
PPO LunarLander-v2 model
almost 2 years ago
system_info.txt
Safe
184 Bytes
PPO LunarLander-v2 model
almost 2 years ago