Update README.md
Browse files
README.md
CHANGED
|
@@ -19,4 +19,8 @@ It is not yet a competent model because it does not meet the minimum training re
|
|
| 19 |
|
| 20 |
If you want to try how to use it here is a file of how to use it in [test_gen.py](https://github.com/F4k3r22/LLaDA-from-scratch/blob/main/test_gen.py) Or using this [Google Colab](https://colab.research.google.com/drive/1jPIPu9qHEFMkANzUEkeOxUW6hS3DeVwd?usp=sharing) notebook
|
| 21 |
|
|
|
|
|
|
|
|
|
|
|
|
|
| 22 |
For those who want to train and get the correct format to be able to load it with `transformers`, everything needed is in [`pre_train.py`](https://github.com/F4k3r22/LLaDA-from-scratch/blob/main/pre_train.py) of the project repo
|
|
|
|
| 19 |
|
| 20 |
If you want to try how to use it here is a file of how to use it in [test_gen.py](https://github.com/F4k3r22/LLaDA-from-scratch/blob/main/test_gen.py) Or using this [Google Colab](https://colab.research.google.com/drive/1jPIPu9qHEFMkANzUEkeOxUW6hS3DeVwd?usp=sharing) notebook
|
| 21 |
|
| 22 |
+
Example of the results it gives:
|
| 23 |
+
|
| 24 |
+

|
| 25 |
+
|
| 26 |
For those who want to train and get the correct format to be able to load it with `transformers`, everything needed is in [`pre_train.py`](https://github.com/F4k3r22/LLaDA-from-scratch/blob/main/pre_train.py) of the project repo
|