PersonalAILab
/

AFM-WebAgent-32B-rl

Model card Files Files and versions

xet

Community

Add comprehensive model card for Chain-of-Agents (AFM)

by nielsr HF Staff - opened Aug 21, 2025

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+111

-0

Files changed (1) hide show

README.md +111 -0

README.md ADDED Viewed

	@@ -0,0 +1,111 @@

+---
+license: apache-2.0
+library_name: transformers
+pipeline_tag: text-generation
+tags:
+  - agents
+  - foundation-model
+  - multi-agent
+  - reinforcement-learning
+  - code-generation
+  - web-browsing
+  - large-language-model
+---
+# Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
+This repository contains an **Agent Foundation Model (AFM)** based on the paper [Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL](https://huggingface.co/papers/2508.13167).
+<div align="center">
+  <a href='https://chain-of-agents-afm.github.io/'><img src='https://img.shields.io/badge/Project-Homepage-blue?logo=github&logoColor=white'></a>
+  <a href='https://huggingface.co/papers/2508.13167'><img src='https://img.shields.io/badge/Paper-HuggingFace-d63031?logo=huggingface&logoColor=white'></a>
+  <a href='https://github.com/OPPO-PersonalAI/Agent_Foundation_Models'><img src='https://img.shields.io/badge/Code-GitHub-red?logo=github&logoColor=white'></a>
+  <a href='https://huggingface.co/collections/PersonalAILab/afm-689200e11d0b21a67c015ba8'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Models-Huggingface-yellow'></a>
+  <a href='https://huggingface.co/collections/PersonalAILab/afm-datasets-6892140eaad360ea5ccdcde1'><img src='https://img.shields.io/badge/%F0%9F%A4%97%20Datasets-Huggingface-yellow'></a>
+</div>
+## Overview
+Recent advances in large language models (LLMs) and multi-agent systems have demonstrated remarkable capabilities in complex problem-solving tasks. However, most existing multi-agent systems rely on manual prompt/workflow engineering, making them computationally inefficient and less adaptable.
+This work introduces **Chain-of-Agents (CoA)**, a novel paradigm of LLM reasoning that enables native end-to-end complex problem-solving by simulating multi-agent collaboration within a single model. The model dynamically activates different tool agents and role-playing agents to achieve multi-turn problem-solving. To elicit these abilities, a multi-agent distillation framework is used to distill state-of-the-art multi-agent systems into chain-of-agents trajectories for agentic supervised fine-tuning. This is further improved by agentic reinforcement learning on verifiable tasks. The resulting models are called **Agent Foundation Models (AFMs)**.
+Empirical studies demonstrate that AFM establishes new state-of-the-art performance across diverse benchmarks in both web agent and code agent settings.
+<div align="center">
+  <img src="https://github.com/OPPO-PersonalAI/Agent_Foundation_Models/raw/main/assets/afm.png" width="85%" height="auto" alt="Chain-of-Agents Overview"/>
+</div>
+## Key Features
+*   **Core Paradigm**: Chain-of-Agents (CoA) for end-to-end problem-solving within a single model, simulating multi-agent collaboration via dynamic activation of tool and role-playing agents.
+*   **Training Framework**: Utilizes a Multi-Agent Distillation pipeline and Agentic Reinforcement Learning, supporting mask fine-tuning for selective learning.
+*   **Agent Capabilities**: Excels in Web interaction (Web Agent), Multi-hop Question Answering (MHQA Agent), and Code Execution (Code Agent).
+*   **Tool Integration**: Features web search and crawling servers, a secure code sandbox (via nsjail), and configurable multi-tool collaboration.
+## Quick Start (with `transformers`)
+This section provides general instructions on how to load and use an Agent Foundation Model for inference using the Hugging Face `transformers` library. Please note that specific models from the AFM collection (e.g., different sizes or SFT/RL checkpoints) may have specific requirements or optimized usage patterns.
+First, ensure you have the necessary `transformers` library installed:
+```bash
+pip install transformers
+```
+For detailed installation instructions, environment setup, and advanced usage (including training and evaluation scripts, and specific tool integration), please refer to the [official GitHub repository](https://github.com/OPPO-PersonalAI/Agent_Foundation_Models).
+Here's an example of how to load and use a generic AFM for text generation:
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+import torch
+# Replace 'your-model-name' with the actual model ID you wish to use from the AFM collection,
+# e.g., "PersonalAILab/afm-qwen2.5-7b-sft" or "PersonalAILab/afm-qwen2.5-32b-sft"
+model_name = "your-model-name"
+# Load tokenizer and model
+tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
+model = AutoModelForCausalLM.from_pretrained(
+    model_name,
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+    trust_remote_code=True
+)
+# Example: Ask the agent to perform a task or answer a question
+prompt = "Act as a web agent. Find the current temperature in Paris, France."
+# Prepare inputs
+inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
+# Generate response
+# Adjust generation parameters (max_new_tokens, temperature, etc.) as needed for your task.
+# The model's behavior is agentic, so it might output tool calls or multi-turn reasoning.
+output = model.generate(
+    **inputs,
+    max_new_tokens=512,
+    temperature=0.7,
+    do_sample=True,
+    eos_token_id=tokenizer.eos_token_id
+)
+response = tokenizer.decode(output[0], skip_special_tokens=True)
+print(response)
+```
+## Citation
+If you find `AFM` useful in your research or applications, we would appreciate it if you could cite our work:
+```bibtex
+@misc{li2025chainofagentsendtoendagentfoundation,
+      title={Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL},
+      author={Weizhen Li and Jianbo Lin and Zhuosong Jiang and Jingyi Cao and Xinpeng Liu and Jiayu Zhang and Zhenqiang Huang and Qianben Chen and Weichen Sun and Qiexiang Wang and Hongxuan Lu and Tianrui Qin and Chenghao Zhu and Yi Yao and Shuying Fan and Xiaowan Li and Tiannan Wang and Pai Liu and King Zhu and He Zhu and Dingfeng Shi and Piaohong Wang and Yeyi Guan and Xiangru Tang and Minghao Liu and Yuchen Eleanor Jiang and Jian Yang and Jiaheng Liu and Ge Zhang and Wangchunshu Zhou},
+      year={2025},
+      eprint={2508.13167},
+      archivePrefix={arXiv},
+      primaryClass={cs.AI},
+      url={https://arxiv.org/abs/2508.13167},
+}
+```