Safetensors
File size: 1,362 Bytes
3603237
 
 
bc25037
3603237
83bf2db
3603237
 
 
 
 
 
 
 
 
 
 
 
 
83bf2db
3603237
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
license: cc-by-2.0
---
# World Knowledge Dataset and Model

This training model and dataset are the official artifact for the paper **[Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration](https://huggingface.co/papers/2604.18131)**.

## 📄 Paper Information
- **Title:** Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration
- **Authors:** Qifan Zhang, Dongyang Ma, Tianqing Fang, Jia Li, Jing Tang, Nuo Chen, Haitao Mi, Yan Wang
- **Hugging Face Paper Page:** [https://huggingface.co/papers/2604.18131](https://huggingface.co/papers/2604.18131)
- **arXiv:** [arXiv:2604.18131](https://arxiv.org/abs/2604.18131)

## 💻 Code Repository
The official codebase for the web-agent pipeline, including data generation, preprocessing, and evaluation scripts, can be found on GitHub:
**[Bklight999/world-knowledge](https://github.com/Bklight999/world-knowledge)**


## 📝 Citation
If you use this repo or our code, please cite our paper:

```bibtex
@article{zhang2026training,
  title={Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration},
  author={Zhang, Qifan and Ma, Dongyang and Fang, Tianqing and Li, Jia and Tang, Jing and Chen, Nuo and Mi, Haitao and Wang, Yan},
  journal={arXiv preprint arXiv:2604.18131},
  year={2026}
}