metadata
license: cc-by-2.0
World Knowledge Dataset and Model
This training model and dataset are the official artifact for the paper Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration.
π Paper Information
- Title: Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration
- Authors: Qifan Zhang, Dongyang Ma, Tianqing Fang, Jia Li, Jing Tang, Nuo Chen, Haitao Mi, Yan Wang
- Hugging Face Paper Page: https://huggingface.co/papers/2604.18131
- arXiv: arXiv:2604.18131
π» Code Repository
The official codebase for the web-agent pipeline, including data generation, preprocessing, and evaluation scripts, can be found on GitHub: Bklight999/world-knowledge
π Citation
If you use this repo or our code, please cite our paper:
@article{zhang2026training,
title={Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration},
author={Zhang, Qifan and Ma, Dongyang and Fang, Tianqing and Li, Jia and Tang, Jing and Chen, Nuo and Mi, Haitao and Wang, Yan},
journal={arXiv preprint arXiv:2604.18131},
year={2026}
}