Add model description, links and citation

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +38 -2
README.md CHANGED
@@ -1,7 +1,43 @@
1
  ---
2
  base_model:
3
  - Qwen/Qwen3-32B
4
- pipeline_tag: text-generation
5
  library_name: transformers
 
6
  arxiv: 2602.11089
7
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  base_model:
3
  - Qwen/Qwen3-32B
 
4
  library_name: transformers
5
+ pipeline_tag: text-generation
6
  arxiv: 2602.11089
7
+ license: other
8
+ ---
9
+
10
+ # DataChef-32B
11
+
12
+ [**HF Models**](https://huggingface.co/yichengchen24/DataChef-32B) | [**HF Demo**](https://huggingface.co/spaces/yichengchen24/DataChef) | [**Paper**](https://arxiv.org/abs/2602.11089) | [**GitHub**](https://github.com/yichengchen24/DataChef)
13
+
14
+ DataChef-32B is a specialized large language model designed for **automated data recipe generation**. It was introduced in the paper [DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning](https://huggingface.co/papers/2602.11089).
15
+
16
+ DataChef-32B facilitates LLM adaptation by generating executable data processing pipelines (data recipes) that transform raw data sources into high-quality training corpora targeted at specific benchmarks.
17
+
18
+ ## Model Description
19
+ DataChef-32B addresses the manual, labor-intensive process of designing data processing pipelines. It was trained using online reinforcement learning with a proxy reward system that predicts downstream performance for candidate recipes. Given a target benchmark and available data sources, the model outputs a complete data recipe to adapt a base LLM.
20
+
21
+ ### Performance Highlights
22
+ Across diverse tasks, DataChef-32B produces practical recipes that reach performance comparable to those curated by human experts. Notably, a recipe generated by DataChef-32B was used to adapt Qwen3-1.7B-Base to the math domain, achieving a score of **66.7 on AIME'25**, surpassing the performance of the standard Qwen3-1.7B.
23
+
24
+ ## Installation
25
+ To use the DataChef framework for generating your own data recipes, follow the installation steps from the [GitHub repository](https://github.com/yichengchen24/DataChef):
26
+
27
+ ```bash
28
+ conda create -n datachef python=3.12
29
+ conda activate datachef
30
+ pip install -e .
31
+ ```
32
+
33
+ ## Citation
34
+ If you find this work helpful, please consider citing:
35
+
36
+ ```bibtex
37
+ @article{chen2026datachef,
38
+ title={DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning},
39
+ author={Chen, Yicheng and Ma, Zerun and Xie, Xinchen and Li, Yining and Chen, Kai},
40
+ journal={arXiv preprint arXiv:2602.11089},
41
+ year={2026}
42
+ }
43
+ ```