Update README.md
Browse files
README.md
CHANGED
|
@@ -27,4 +27,17 @@ augmxnt/shisa-7b-v1
|
|
| 27 |
* Japanese Law Precedent Dataset
|
| 28 |
* Japanese Wikipedia
|
| 29 |
* .lg.jp, .go.jp, .ac.jp domain webscrapes from CulturaX (Any documents with same first 25 characters were de-duplicated)
|
| 30 |
-
* English Ultrachat200K-gen (So that it doesn't forget English and chatting ability learned in the base checkpoint)
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 27 |
* Japanese Law Precedent Dataset
|
| 28 |
* Japanese Wikipedia
|
| 29 |
* .lg.jp, .go.jp, .ac.jp domain webscrapes from CulturaX (Any documents with same first 25 characters were de-duplicated)
|
| 30 |
+
* English Ultrachat200K-gen (So that it doesn't forget English and chatting ability learned in the base checkpoint)
|
| 31 |
+
|
| 32 |
+
# Developed by
|
| 33 |
+
|
| 34 |
+
### Engineers
|
| 35 |
+
Peter Devine
|
| 36 |
+
Sho Higuchi
|
| 37 |
+
|
| 38 |
+
### Advisors
|
| 39 |
+
Yuuki Yamanaka
|
| 40 |
+
Atom Sonoda
|
| 41 |
+
|
| 42 |
+
### Dataset evaluator
|
| 43 |
+
Renju Aoki
|