Spaces:
Paused
Paused
Commit
·
0347d5d
1
Parent(s):
0febb3e
Update README.md
Browse files
README.md
CHANGED
@@ -20,14 +20,16 @@ Generate synthetic data set for the state that you want, search over the action
|
|
20 |
You can bootstrap process with priors still search for the desired state
|
21 |
|
22 |
|
23 |
-
|
24 |
Reward any trajectory proportionally to a semantically similar state as any state in a run with a victory condition.
|
25 |
Linear or some function reward curve
|
26 |
|
27 |
|
28 |
-
|
29 |
Sections of states with more changes in them
|
30 |
|
|
|
|
|
31 |
|
32 |
|
33 |
# 7/21/23
|
|
|
20 |
You can bootstrap process with priors still search for the desired state
|
21 |
|
22 |
|
23 |
+
## reward
|
24 |
Reward any trajectory proportionally to a semantically similar state as any state in a run with a victory condition.
|
25 |
Linear or some function reward curve
|
26 |
|
27 |
|
28 |
+
## Sample curve
|
29 |
Sections of states with more changes in them
|
30 |
|
31 |
+
## notes
|
32 |
+
http://www.incompleteideas.net/IncIdeas/BitterLesson.html
|
33 |
|
34 |
|
35 |
# 7/21/23
|