Update README.md
Browse files
README.md
CHANGED
|
@@ -1,3 +1,33 @@
|
|
| 1 |
-
---
|
| 2 |
-
license: mit
|
| 3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: mit
|
| 3 |
+
tags:
|
| 4 |
+
- natural-language-processing
|
| 5 |
+
- code-generation
|
| 6 |
+
- torch
|
| 7 |
+
- lstm
|
| 8 |
+
---
|
| 9 |
+
|
| 10 |
+
This generative text model was trained using [Andrej Karpathy's code](https://github.com/karpathy/char-rnn) on homeworks by Linguistics students' homework assignments for a beginning Python course of HSE University.
|
| 11 |
+
|
| 12 |
+
Model was trained with size 512 and 3 layers, dropout 0.5.
|
| 13 |
+
|
| 14 |
+
## Usage
|
| 15 |
+
|
| 16 |
+
The procedure for installing the required software is described [by Karpathy](https://github.com/karpathy/char-rnn), torch is required, the code is written in lua. Be careful, versions of libraries written many years ago are used.
|
| 17 |
+
|
| 18 |
+
```bash
|
| 19 |
+
th sample.lua lm_lstm_epoch19.14_0.6292.t7 -length 10000 -temperature 0.5 -primetext 'some text'
|
| 20 |
+
```
|
| 21 |
+
|
| 22 |
+
## Train data
|
| 23 |
+
|
| 24 |
+
Train corpus consists of joined programms in to one file inclded in this repository as `input.txt`
|
| 25 |
+
|
| 26 |
+
|
| 27 |
+
## What for?
|
| 28 |
+
|
| 29 |
+
In an era of winning Transformers, ancient RNN models seem archaic. But I see that they still work better than modern architectures with such important categories from the humanities point of view as individual style.
|
| 30 |
+
|
| 31 |
+
This model was created just or fun of students at the end of the course.
|
| 32 |
+
|
| 33 |
+
|