pcalhoun commited on
Commit
a16f7d7
·
1 Parent(s): e0640f9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -7,7 +7,9 @@ Finetuned on this dataset: https://github.com/calhounpaul/modernized-jonathan-sw
7
 
8
  The format is as follows:
9
 
10
- <|\_| BEGIN PRECEDING CONTEXT |\_| Context goes here. |\_| END PRECEDING CONTEXT |\_|> <|\_| BEGIN TEXT TO RESTATE |\_| Text to swiftify goes here. |\_| RESTATE AS JON SWIFT |\_|
 
 
11
 
12
  Then it should generate until the end of the Swifted statement and then output:
13
 
 
7
 
8
  The format is as follows:
9
 
10
+ <|\_| BEGIN PRECEDING CONTEXT |\_| Context goes here. |\_| END PRECEDING CONTEXT |\_|> <|\_| BEGIN TEXT TO RESTATE |\_| Text to swiftify goes here. |\_| RESTATE AS JON SWIFT |\_|
11
+
12
+ Note: In the training data the above line starts with a space (before the first "<"), and does not end with a space (after the final "|"). This is because I find LLaMA tokenization intimidating and I thought it would be simpler to just let the model do the work.
13
 
14
  Then it should generate until the end of the Swifted statement and then output:
15