Spaces:
Paused
Paused
Commit
·
cb9c6a8
1
Parent(s):
680f0fc
Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,14 @@ I want to use small llms and a focused data set so I can really get a good idea
|
|
19 |
|
20 |
My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
|
21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
22 |
I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
|
23 |
|
24 |
# 7/25/23
|
|
|
19 |
|
20 |
My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
|
21 |
|
22 |
+
My justifications for my justifications for the data sets:
|
23 |
+
|
24 |
+
Tiny stories: I want to give the model a basic understanding of the English language so that it can hopefully understand what's happening in the Wikipedia or any of the game messages that nah hack produces.
|
25 |
+
|
26 |
+
Trajectory data set: this carefully formatted data set will be used to structure how the agent behaves and how I parse out the states and actions and other various information I'm interested in.
|
27 |
+
|
28 |
+
Subset of the nattack wiki: I will be making a subset data set that contains categories that I think would be most useful to an agent who should have information on things inside the game.
|
29 |
+
|
30 |
I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
|
31 |
|
32 |
# 7/25/23
|