File size: 518 Bytes
6f8d8d9
 
 
 
 
bc02199
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
# Training Data

Reserved for SFT or LoRA training data.

Target: 200-500 generated samples, with at least 50 manually selected high-quality samples.

## Current Preview

Generate deterministic mock preview data with:

```bash
.venv/bin/python -B scripts/generate_dataset.py
```

Default output:

```text
data/train/objectverse_sft_preview.jsonl
```

This preview is synthetic and mock-generated. It is useful for validating schema, curation workflow, and training script assumptions, but it is not the final dataset.