File size: 1,734 Bytes
c00cc75
2e9e6ad
dd4d3d2
 
c00cc75
dd4d3d2
9398174
dd4d3d2
b87b061
dd4d3d2
c00cc75
 
dd4d3d2
c00cc75
dd4d3d2
 
c00cc75
dd4d3d2
c00cc75
dd4d3d2
 
c00cc75
dd4d3d2
 
 
 
 
c00cc75
dd4d3d2
2e9e6ad
dd4d3d2
d6fe8b3
2e9e6ad
dd4d3d2
2e9e6ad
dd4d3d2
923d2ac
dd4d3d2
 
 
 
 
 
 
d6fe8b3
 
c00cc75
dd4d3d2
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
---
base_model: unsloth/Qwen3.5-9B
library_name: transformers
model_name: LayLaLLM-SimpleText2026
tags:
- generated_from_trainer
- trl
- sft
- unsloth
licence: license
---

# Model Card for LayLaLLM-SimpleText2026

This model is a fine-tuned version of [unsloth/Qwen3.5-9B](https://huggingface.co/unsloth/Qwen3.5-9B).
It has been trained using [TRL](https://github.com/huggingface/trl).

## Quick start

```python
from transformers import pipeline

question = "If you had a time machine, but could only go to the past or the future once and never return, which would you choose and why?"
generator = pipeline("text-generation", model="IoakeimE/LayLaLLM-SimpleText2026", device="cuda")
output = generator([{"role": "user", "content": question}], max_new_tokens=128, return_full_text=False)[0]
print(output["generated_text"])
```

## Training procedure

[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="150" height="24"/>](https://wandb.ai/ioakeime-aristotle-university-of-thessaloniki/LayLaLLM-SimpleText2026/runs/abi6oo5b) 


This model was trained with SFT.

### Framework versions

- TRL: 0.23.0
- Transformers: 5.8.0
- Pytorch: 2.10.0
- Datasets: 4.3.0
- Tokenizers: 0.22.2

## Citations



Cite TRL as:
    
```bibtex
@misc{vonwerra2022trl,
	title        = {{TRL: Transformer Reinforcement Learning}},
	author       = {Leandro von Werra and Younes Belkada and Lewis Tunstall and Edward Beeching and Tristan Thrush and Nathan Lambert and Shengyi Huang and Kashif Rasul and Quentin Gallou{\'e}dec},
	year         = 2020,
	journal      = {GitHub repository},
	publisher    = {GitHub},
	howpublished = {\url{https://github.com/huggingface/trl}}
}
```