File size: 1,073 Bytes
cdf485e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
# Data Directory

This directory holds the preference datasets used by PreferenceLab.

On first run, if these files are absent, the environment falls back to
built-in synthetic examples (defined in `server/environment.py`).

## File Format

### pairwise_data.json
```json
[
  {
    "prompt": "...",
    "response_a": "...",
    "response_b": "...",
    "gold_label": "A",
    "source": "hh-rlhf"
  }
]
```

### likert_data.json
```json
[
  {
    "prompt": "...",
    "response": "...",
    "rubric": "...",
    "gold_scores": {
      "helpfulness": 4,
      "honesty": 5,
      "harmlessness": 5,
      "instruction_following": 4
    },
    "source": "ultrafeedback"
  }
]
```

### consistency_data.json
```json
[
  {
    "prompt": "...",
    "response_a": "...",
    "response_b": "...",
    "response_c": "...",
    "response_d": "...",
    "gold_ranking": ["C", "A", "B", "D"],
    "source": "stanford-shp"
  }
]
```

## Loading Real Datasets

Run `python scripts/prepare_datasets.py` to download and convert
HH-RLHF, UltraFeedback, and Stanford SHP into these formats.