Spaces:
Sleeping
Sleeping
Andrei Cozma
commited on
Commit
·
8a49a12
1
Parent(s):
fee7b36
Updates
Browse files
README.md
CHANGED
@@ -11,10 +11,12 @@ pinned: true
|
|
11 |
|
12 |
# CS581 Final Project - Dynamic Programming & Monte-Carlo RL Methods
|
13 |
|
14 |
-
|
15 |
|
16 |
Evolution of Reinforcement Learning methods from pure Dynamic Programming-based methods to Monte Carlo methods + Bellman Optimization Comparison
|
17 |
|
|
|
|
|
18 |
# 1. Requirements
|
19 |
|
20 |
Python 3.6+ with the following major dependencies:
|
|
|
11 |
|
12 |
# CS581 Final Project - Dynamic Programming & Monte-Carlo RL Methods
|
13 |
|
14 |
+
Authors: Andrei Cozma and Landon Harris
|
15 |
|
16 |
Evolution of Reinforcement Learning methods from pure Dynamic Programming-based methods to Monte Carlo methods + Bellman Optimization Comparison
|
17 |
|
18 |
+
[Google Slides](https://docs.google.com/presentation/d/1v4WwBQKoPnGiyCMXgUs-pCCJ8IwZqM3thUf-Ky00eTQ/edit?usp=sharing)
|
19 |
+
|
20 |
# 1. Requirements
|
21 |
|
22 |
Python 3.6+ with the following major dependencies:
|
demo.py
CHANGED
@@ -347,7 +347,7 @@ with gr.Blocks(title="CS581 Demo") as demo:
|
|
347 |
all_policies = []
|
348 |
|
349 |
gr.components.HTML(
|
350 |
-
"<h1>CS581 Final Project Demo - Dynamic Programming & Monte-Carlo RL Methods (<a href='https://huggingface.co/spaces/acozma/CS581-Algos-Demo'>HF Space</a>)</h1>"
|
351 |
)
|
352 |
|
353 |
localstate = gr.State(RunState())
|
|
|
347 |
all_policies = []
|
348 |
|
349 |
gr.components.HTML(
|
350 |
+
"<h1>CS581 Final Project Demo - Dynamic Programming & Monte-Carlo RL Methods (<a href='https://github.com/andreicozma1/CS581-Algorithms-Project'>GitHub</a>) (<a href='https://huggingface.co/spaces/acozma/CS581-Algos-Demo'>HF Space</a>)</h1>"
|
351 |
)
|
352 |
|
353 |
localstate = gr.State(RunState())
|