Update README.md
Browse files
README.md
CHANGED
|
@@ -11,7 +11,7 @@ datasets:
|
|
| 11 |
- hromi/winograd_dpo_basic
|
| 12 |
---
|
| 13 |
|
| 14 |
-

|
| 15 |
|
| 16 |
# UDKai_Garrulus
|
| 17 |
|
|
@@ -49,10 +49,16 @@ But before writing a paper with title "DPO-Contamination with Winogrande increas
|
|
| 49 |
* max_length=1536
|
| 50 |
|
| 51 |
## UDK.ai
|
| 52 |
-
This is the result of the first LLM-optimization experiment running on a hardware of Berlin University of the Arts.
|
|
|
|
|
|
|
|
|
|
|
|
|
| 53 |
|
| 54 |
# Garrulus
|
| 55 |
Originally I planned to call the model "ContaminatedWine" but then I had a nice winter encounter with a very convivial eurasian jay (Garrulus Glandarius in latin), hence the name.
|
| 56 |
|
| 57 |
# Thanks
|
| 58 |
-
Thanks to mlabonne and Cultrix for demonstrating that DPO is not 'rocket science' but within reach of anyone with an idea, a dataset and a GPU
|
|
|
|
|
|
|
|
|
| 11 |
- hromi/winograd_dpo_basic
|
| 12 |
---
|
| 13 |
|
| 14 |
+

|
| 15 |
|
| 16 |
# UDKai_Garrulus
|
| 17 |
|
|
|
|
| 49 |
* max_length=1536
|
| 50 |
|
| 51 |
## UDK.ai
|
| 52 |
+
This is the result of the first LLM-optimization experiment running on a hardware of Berlin University of the Arts (UDK-berlin).
|
| 53 |
+
|
| 54 |
+
DPO took few minutes on a A40.
|
| 55 |
+
|
| 56 |
+
Check [udk.ai](https://udk.ai) from time to time, we plan to make some noise.
|
| 57 |
|
| 58 |
# Garrulus
|
| 59 |
Originally I planned to call the model "ContaminatedWine" but then I had a nice winter encounter with a very convivial eurasian jay (Garrulus Glandarius in latin), hence the name.
|
| 60 |
|
| 61 |
# Thanks
|
| 62 |
+
Thanks to mlabonne and Cultrix for demonstrating that DPO is not 'rocket science' but within reach of anyone with an idea, a dataset and a GPU.
|
| 63 |
+
|
| 64 |
+
And thanks to [unslothai](https://github.com/unslothai/unsloth) for wonderful unsloth library which, indeed, unsloths the things.
|