Spaces:
Paused
Paused
Commit
·
cb9c6a8
1
Parent(s):
680f0fc
Update README.md
Browse files
README.md
CHANGED
|
@@ -19,6 +19,14 @@ I want to use small llms and a focused data set so I can really get a good idea
|
|
| 19 |
|
| 20 |
My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
|
| 23 |
|
| 24 |
# 7/25/23
|
|
|
|
| 19 |
|
| 20 |
My rough strat is to use the tiny stories data set, a trajectory based data set only using the human Monk trajectories, and select categories from the Nat hack Wiki. I plan to do some ablations to see which data sets are critical. And once I have the basic instruction tuning up so we can follow basic small instructions, I will then attempt to implement some combination of ideas of some papers that I've been interested in.
|
| 21 |
|
| 22 |
+
My justifications for my justifications for the data sets:
|
| 23 |
+
|
| 24 |
+
Tiny stories: I want to give the model a basic understanding of the English language so that it can hopefully understand what's happening in the Wikipedia or any of the game messages that nah hack produces.
|
| 25 |
+
|
| 26 |
+
Trajectory data set: this carefully formatted data set will be used to structure how the agent behaves and how I parse out the states and actions and other various information I'm interested in.
|
| 27 |
+
|
| 28 |
+
Subset of the nattack wiki: I will be making a subset data set that contains categories that I think would be most useful to an agent who should have information on things inside the game.
|
| 29 |
+
|
| 30 |
I am carefully formatting my trajectory data set for two reasons: one I want to make parsing trivial. Two on I am assuming that regularity and format of state's presented to the llm will allow it to generate output that I desire more easily. This is just an assumption.
|
| 31 |
|
| 32 |
# 7/25/23
|