Text Generation
PEFT
lora
trl
naming
brand-generation
controllable-generation
File size: 1,081 Bytes
ff0bdb6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
# No-GPU-Execution Attestation

This repository contains a complete Nomen-AI training and deployment pipeline, but the actual SFT/DPO adapter weights were not produced in the agent environment.

## Reason

GPU/HF Jobs execution was repeatedly rejected by the user/tool approval layer during this session. Therefore the following commands were prepared but not successfully executed here:

```bash
python scripts/preflight_gpu.py
python scripts/smoke_test.py
python scripts/train_sft.py
python scripts/train_dpo.py
```

## Current adapter state

- `krystv/nomen-ai-sft-lora`: initialized, no `adapter_model.safetensors` yet.
- `krystv/nomen-ai-dpo-lora`: initialized, no `adapter_model.safetensors` yet.

## What is complete

- Codebase
- Datasets
- Training scripts
- DPO scripts
- Validation scripts
- Colab notebooks
- Docker GPU path
- Demo Space
- Artifact checker
- Research/citation/license docs

## Required external run

Run one of:

```bash
bash scripts/train_all_colab.sh
```

or:

```bash
docker compose up --build
```

on a GPU machine with a valid Hugging Face token.