Update README.md
Browse files
README.md
CHANGED
|
@@ -10,8 +10,21 @@ tags:
|
|
| 10 |
license: apache-2.0
|
| 11 |
language:
|
| 12 |
- en
|
|
|
|
|
|
|
|
|
|
| 13 |
---
|
| 14 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 15 |
# Uploaded model
|
| 16 |
|
| 17 |
- **Developed by:** HashTag766
|
|
@@ -20,4 +33,4 @@ language:
|
|
| 20 |
|
| 21 |
This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
| 22 |
|
| 23 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
|
| 10 |
license: apache-2.0
|
| 11 |
language:
|
| 12 |
- en
|
| 13 |
+
datasets:
|
| 14 |
+
- diabolic6045/open-ocra-alpaca-cleaned
|
| 15 |
+
- HashTag766/SMART-Goals-Validation
|
| 16 |
---
|
| 17 |
|
| 18 |
+
# Overview
|
| 19 |
+
#### Finetuned Qwen2.5-3B
|
| 20 |
+
#### the training was for increasing the model capabilities on Instruction following and specific data.
|
| 21 |
+
#### Training Time : 14.5h
|
| 22 |
+
|
| 23 |
+
### Datasets
|
| 24 |
+
#### SMART-Goals-Validation------[https://huggingface.co/datasets/HashTag766/SMART-Goals-Validation]
|
| 25 |
+
#### open-ocra-alpaca-cleaned----[https://huggingface.co/datasets/diabolic6045/open-ocra-alpaca-cleaned] only on 120000k examples
|
| 26 |
+
|
| 27 |
+
|
| 28 |
# Uploaded model
|
| 29 |
|
| 30 |
- **Developed by:** HashTag766
|
|
|
|
| 33 |
|
| 34 |
This qwen2 model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
|
| 35 |
|
| 36 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|