File size: 1,453 Bytes
aeb767e
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---

title: README
emoji: "πŸ“Š"
colorFrom: blue
colorTo: purple
sdk: static
pinned: false
---


# OBay Data

**World-class training data production for frontier AI models.**

We build high-quality datasets that power the next generation of AI β€” from large language models to embodied intelligence.

## What We Do

| Domain | Description |
|--------|-------------|
| 🧠 **Pre-training Data** | Large-scale, curated corpora for foundation model training |
| 🎯 **Post-training Data** | SFT, RLHF, DPO datasets for alignment and instruction-following |
| πŸ€– **Embodied AI Data** | Robotics trajectories, gameplay recordings, sensor logs for world models |
| πŸ–ΌοΈ **Multimodal Data** | Image editing, composition, style transfer instruction sets |

## Datasets

| Dataset | Description |
|---------|-------------|
| trajectory_demo | Terminal agent trajectories (ATIF format) |

| svg-multimodal-rubrics | SVG code generation + evaluation rubrics |

| image-editing-style-instruction-following | Style transfer + instruction following |

| swe-coding-instruction-following | SWE-bench coding tasks |

| world-model-gameplay-recording | Gameplay recording for world model training |

| multi-image-composition-instruction-following | Multi-image composition with instructions |



## Contact



🌐 [obaydata.com](https://obaydata.com) Β· πŸ’» [GitHub](https://github.com/simonsu20000) Β· βœ‰οΈ simon.su@obaydata.com