nielsr HF Staff commited on
Commit
47bef89
·
verified ·
1 Parent(s): 437dfa0

Add pipeline tag, library name, and sample usage

Browse files

This PR improves the model card by adding:
- `pipeline_tag: text-generation` and `library_name: transformers` to the metadata for better discoverability and to enable the code snippet widget.
- A link to the paper on the Hugging Face Hub.
- A "Sample Usage" section based on the instructions in the official GitHub repository for serving the model and running the agent demo.
- Cleaned up the README by removing technical file metadata.

Files changed (1) hide show
  1. README.md +29 -11
README.md CHANGED
@@ -1,14 +1,16 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
  - Qwen/Qwen3-4B
5
  language:
6
- - en
 
7
  tags:
8
- - agent
9
- - tool-use
10
- - reinforcement-learning
11
- - mcp
 
 
12
  ---
13
 
14
  <h1 align="center">Arctic-AWM-4B</h1>
@@ -29,15 +31,31 @@ tags:
29
  <sup>1</sup>UNC-Chapel Hill &nbsp; <sup>2</sup>Snowflake AI Research &nbsp;
30
  </p>
31
 
32
-
33
-
34
  # Overview
35
 
36
- **Arctic-AWM-4B** is a multi-turn tool-use agent model trained with agentic reinforcement learning on [Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B), using the fully synthetic environments from [AgentWorldModel-1K](https://huggingface.co/datasets/Snowflake/AgentWorldModel-1K).
37
 
38
  The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.
39
 
40
- For detailed usage of the model, please visit [https://github.com/Snowflake-Labs/agent-world-model](https://github.com/Snowflake-Labs/agent-world-model).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
41
 
42
  # Resources
43
 
@@ -66,4 +84,4 @@ If you find this resource useful, please kindly cite:
66
  primaryClass={cs.AI},
67
  url={https://arxiv.org/abs/2602.10090},
68
  }
69
- ```
 
1
  ---
 
2
  base_model:
3
  - Qwen/Qwen3-4B
4
  language:
5
+ - en
6
+ license: apache-2.0
7
  tags:
8
+ - agent
9
+ - tool-use
10
+ - reinforcement-learning
11
+ - mcp
12
+ pipeline_tag: text-generation
13
+ library_name: transformers
14
  ---
15
 
16
  <h1 align="center">Arctic-AWM-4B</h1>
 
31
  <sup>1</sup>UNC-Chapel Hill &nbsp; <sup>2</sup>Snowflake AI Research &nbsp;
32
  </p>
33
 
 
 
34
  # Overview
35
 
36
+ **Arctic-AWM-4B** is a multi-turn tool-use agent model trained with agentic reinforcement learning on [Qwen3-4B](https://huggingface.co/Qwen/Qwen3-4B), using the fully synthetic environments from [AgentWorldModel-1K](https://huggingface.co/datasets/Snowflake/AgentWorldModel-1K). It was introduced in the paper [Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning](https://huggingface.co/papers/2602.10090).
37
 
38
  The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.
39
 
40
+ # Sample Usage
41
+
42
+ To use the model for agentic tasks, you can serve it using [vLLM](https://github.com/vllm-project/vllm) and interact with it using the `awm` CLI tool.
43
+
44
+ ### Serve the model
45
+ ```bash
46
+ vllm serve Snowflake/Arctic-AWM-4B --host 127.0.0.1 --port 8000
47
+ ```
48
+
49
+ ### Run the Agent Demo
50
+ After starting an MCP environment (see the [GitHub repository](https://github.com/Snowflake-Labs/agent-world-model) for environment setup), you can run the agent:
51
+
52
+ ```bash
53
+ awm agent \
54
+ --task "show me the top 10 most expensive products" \
55
+ --mcp_url http://localhost:8001/mcp \
56
+ --vllm_url http://localhost:8000/v1 \
57
+ --model Snowflake/Arctic-AWM-4B
58
+ ```
59
 
60
  # Resources
61
 
 
84
  primaryClass={cs.AI},
85
  url={https://arxiv.org/abs/2602.10090},
86
  }
87
+ ```