Spaces:
Paused
Paused
Commit
·
32b0d27
1
Parent(s):
cb9c6a8
Update README.md
Browse files
README.md
CHANGED
@@ -12,23 +12,42 @@ pinned: true
|
|
12 |
|
13 |

|
14 |
|
15 |
-
# 7/27/23
|
16 |
-
Rogue GPT is an attempt to not only instruction Tune LLM powered agents ( treating llms as reasoning engines) for tasks and the mini hack environment, but explore the use of reinforcement learning and continuous learning for embodied agents inside environments using only llms so that Lessons Learned can be abstracted to other modalities.
|
17 |
|
18 |
-
|
19 |
|
20 |
-
|
21 |
|
22 |
-
|
23 |
|
24 |
-
Tiny
|
25 |
|
26 |
-
|
27 |
|
28 |
-
|
29 |
|
30 |
-
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
# 7/25/23
|
33 |
https://astralcodexten.substack.com/p/were-not-platonists-weve-just-learned
|
34 |
intelligence explosion
|
|
|
12 |
|
13 |

|
14 |
|
15 |
+
# 7/27/23 RougeGPTSure! Here's the nicely formatted version in markdown:
|
|
|
16 |
|
17 |
+
## Rogue GPT
|
18 |
|
19 |
+
Rogue GPT is an attempt to not only instruct Tune LLM-powered agents (treating LLMs as reasoning engines) for tasks in the mini hack environment but also to explore the use of reinforcement learning and continuous learning for embodied agents inside environments, using only LLMs so that Lessons Learned can be abstracted to other modalities.
|
20 |
|
21 |
+
## Justifications for the Datasets
|
22 |
|
23 |
+
### Tiny Stories Dataset
|
24 |
|
25 |
+
I want to give the model a basic understanding of the English language so that it can hopefully comprehend what's happening in the Wikipedia or any of the game messages that Nah hack produces. [^1^]
|
26 |
|
27 |
+
### Trajectory Dataset
|
28 |
|
29 |
+
This carefully formatted dataset will be used to structure how the agent behaves and how I parse out the states, actions, and other various information I'm interested in. [^2^]
|
30 |
|
31 |
+
### Subset of the Nat Hack Wiki
|
32 |
+
|
33 |
+
I will be creating a subset dataset that contains categories I think would be most useful to an agent who should have information on things inside the game. [^3^]
|
34 |
+
|
35 |
+
## Papers I'm Interested In
|
36 |
+
|
37 |
+
- Work in Progress Paper 1 [^4^]
|
38 |
+
- Work in Progress Paper 2 [^5^]
|
39 |
+
- Work in Progress Paper 3 [^6^]
|
40 |
+
|
41 |
+
## References
|
42 |
+
|
43 |
+
[^1^]: [Link to Paper 1]
|
44 |
+
[^2^]: [Link to Paper 2]
|
45 |
+
[^3^]: [Link to Paper 3]
|
46 |
+
[^4^]: [Link to Paper 4]
|
47 |
+
[^5^]: [Link to Paper 5]
|
48 |
+
[^6^]: [Link to Paper 6]
|
49 |
+
|
50 |
+
Please replace the "[Link to Paper X]" with actual links to the papers you're interested in or their respective references. Also, feel free to update the content inside the subsections with the appropriate information about each dataset and your justifications.
|
51 |
# 7/25/23
|
52 |
https://astralcodexten.substack.com/p/were-not-platonists-weve-just-learned
|
53 |
intelligence explosion
|