rlucasz93 commited on
Commit
ee511ed
·
1 Parent(s): a4aab6c

Pyramid pushed

Browse files
Pyramids-499902.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2b0ee5743deaccdfbfa3f2761d1e6909ab250d8708a74c8ff3aa1695a9f613a
3
+ size 1418184
Pyramids-499902.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fbd069481d9134a53c39992ebca92696bd077c87f66e8299e6f536b3ea7843fc
3
+ size 8650990
Pyramids-500030.onnx ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2b0ee5743deaccdfbfa3f2761d1e6909ab250d8708a74c8ff3aa1695a9f613a
3
+ size 1418184
Pyramids-500030.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3024dbffa86b6752cf9b46de0b4656de852b1a01e7a9054fb99e9e58ceaf3b1e
3
+ size 8650990
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: ml-agents
3
+ tags:
4
+ - Pyramids
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - ML-Agents-Pyramids
8
+ ---
9
+
10
+ # **ppo** Agent playing **Pyramids**
11
+ This is a trained model of a **ppo** agent playing **Pyramids** using the [Unity ML-Agents Library](https://github.com/Unity-Technologies/ml-agents).
12
+
13
+ ## Usage (with ML-Agents)
14
+ The Documentation: https://github.com/huggingface/ml-agents#get-started
15
+ We wrote a complete tutorial to learn to train your first agent using ML-Agents and publish it to the Hub:
16
+
17
+
18
+ ### Resume the training
19
+ ```
20
+ mlagents-learn <your_configuration_file_path.yaml> --run-id=<run_id> --resume
21
+ ```
22
+ ### Watch your Agent play
23
+ You can watch your agent **playing directly in your browser:**.
24
+
25
+ 1. Go to https://huggingface.co/spaces/unity/ML-Agents-Pyramids
26
+ 2. Step 1: Find your model_id: rlucasz93/ppo-Pyramid
27
+ 3. Step 2: Select your *.nn /*.onnx file
28
+ 4. Click on Watch the agent play 👀
29
+
checkpoint.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3024dbffa86b6752cf9b46de0b4656de852b1a01e7a9054fb99e9e58ceaf3b1e
3
+ size 8650990
config.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"behaviors": {"Pyramids": {"trainer_type": "ppo", "hyperparameters": {"batch_size": 128, "buffer_size": 2048, "learning_rate": 0.0003, "beta": 0.01, "epsilon": 0.2, "lambd": 0.95, "num_epoch": 3, "learning_rate_schedule": "linear"}, "network_settings": {"normalize": false, "hidden_units": 512, "num_layers": 2, "vis_encode_type": "simple"}, "reward_signals": {"extrinsic": {"gamma": 0.99, "strength": 1.0}, "rnd": {"gamma": 0.99, "strength": 0.01, "network_settings": {"hidden_units": 64, "num_layers": 3}, "learning_rate": 0.0001}}, "keep_checkpoints": 5, "max_steps": 500000, "time_horizon": 128, "summary_freq": 30000}}}
configuration.yaml ADDED
@@ -0,0 +1,32 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ behaviors:
2
+ Pyramids:
3
+ trainer_type: ppo
4
+ hyperparameters:
5
+ batch_size: 128
6
+ buffer_size: 2048
7
+ learning_rate: 0.0003
8
+ beta: 0.01
9
+ epsilon: 0.2
10
+ lambd: 0.95
11
+ num_epoch: 3
12
+ learning_rate_schedule: linear
13
+ network_settings:
14
+ normalize: false
15
+ hidden_units: 512
16
+ num_layers: 2
17
+ vis_encode_type: simple
18
+ reward_signals:
19
+ extrinsic:
20
+ gamma: 0.99
21
+ strength: 1.0
22
+ rnd:
23
+ gamma: 0.99
24
+ strength: 0.01
25
+ network_settings:
26
+ hidden_units: 64
27
+ num_layers: 3
28
+ learning_rate: 0.0001
29
+ keep_checkpoints: 5
30
+ max_steps: 500000
31
+ time_horizon: 128
32
+ summary_freq: 30000
events.out.tfevents.1681249406.701fd5495228.1858.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec3d7a531d5d01246a4ef31929b845aa6ca249fa05ce9c98b402490c6e6ae990
3
+ size 138399