Add pipeline tag, library name, correct base model and add sample usage

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +30 -11
README.md CHANGED
@@ -1,14 +1,16 @@
1
  ---
2
- license: apache-2.0
3
  base_model:
4
- - Qwen/Qwen3-4B
5
  language:
6
- - en
 
7
  tags:
8
- - agent
9
- - tool-use
10
- - reinforcement-learning
11
- - mcp
 
 
12
  ---
13
 
14
  <h1 align="center">Arctic-AWM-14B</h1>
@@ -29,11 +31,9 @@ tags:
29
  <sup>1</sup>UNC-Chapel Hill &nbsp; <sup>2</sup>Snowflake AI Research &nbsp;
30
  </p>
31
 
32
-
33
-
34
  # Overview
35
 
36
- **Arctic-AWM-14B** is a multi-turn tool-use agent model trained with agentic reinforcement learning on [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B), using the fully synthetic environments from [AgentWorldModel-1K](https://huggingface.co/datasets/Snowflake/AgentWorldModel-1K).
37
 
38
  The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.
39
 
@@ -52,6 +52,25 @@ Related resources are also available, please check:
52
  | πŸ€– Arctic-AWM-8B | [πŸ€— Snowflake/Arctic-AWM-8B](https://huggingface.co/Snowflake/Arctic-AWM-8B) |
53
  | πŸ€– Arctic-AWM-14B | [πŸ€— Snowflake/Arctic-AWM-14B](https://huggingface.co/Snowflake/Arctic-AWM-14B) |
54
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
55
  # Citation
56
 
57
  If you find this resource useful, please kindly cite:
@@ -66,4 +85,4 @@ If you find this resource useful, please kindly cite:
66
  primaryClass={cs.AI},
67
  url={https://arxiv.org/abs/2602.10090},
68
  }
69
- ```
 
1
  ---
 
2
  base_model:
3
+ - Qwen/Qwen3-14B
4
  language:
5
+ - en
6
+ license: apache-2.0
7
  tags:
8
+ - agent
9
+ - tool-use
10
+ - reinforcement-learning
11
+ - mcp
12
+ pipeline_tag: text-generation
13
+ library_name: transformers
14
  ---
15
 
16
  <h1 align="center">Arctic-AWM-14B</h1>
 
31
  <sup>1</sup>UNC-Chapel Hill &nbsp; <sup>2</sup>Snowflake AI Research &nbsp;
32
  </p>
33
 
 
 
34
  # Overview
35
 
36
+ **Arctic-AWM-14B** is a multi-turn tool-use agent model trained with agentic reinforcement learning on [Qwen3-14B](https://huggingface.co/Qwen/Qwen3-14B), using the fully synthetic environments from [AgentWorldModel-1K](https://huggingface.co/datasets/Snowflake/AgentWorldModel-1K). It was introduced in the paper [Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning](https://huggingface.co/papers/2602.10090).
37
 
38
  The model is trained to interact with tool-use environments exposed via a unified MCP (Model Context Protocol) interface, enabling strong multi-turn agentic capabilities.
39
 
 
52
  | πŸ€– Arctic-AWM-8B | [πŸ€— Snowflake/Arctic-AWM-8B](https://huggingface.co/Snowflake/Arctic-AWM-8B) |
53
  | πŸ€– Arctic-AWM-14B | [πŸ€— Snowflake/Arctic-AWM-14B](https://huggingface.co/Snowflake/Arctic-AWM-14B) |
54
 
55
+ # Sample Usage
56
+
57
+ You can use [vLLM](https://github.com/vllm-project/vllm) to serve the model and interact with it using the `awm` CLI provided in the [official repository](https://github.com/Snowflake-Labs/agent-world-model).
58
+
59
+ ```bash
60
+ # serve the model
61
+ vllm serve Snowflake/Arctic-AWM-14B --host 127.0.0.1 --port 8000
62
+
63
+ # start the environment (example scenario)
64
+ awm env start --scenario e_commerce_33 --envs_load_path outputs/gen_envs.jsonl --port 8001
65
+
66
+ # run the agent
67
+ awm agent \
68
+ --task "show me the top 10 most expensive products" \
69
+ --mcp_url http://localhost:8001/mcp \
70
+ --vllm_url http://localhost:8000/v1 \
71
+ --model Snowflake/Arctic-AWM-14B
72
+ ```
73
+
74
  # Citation
75
 
76
  If you find this resource useful, please kindly cite:
 
85
  primaryClass={cs.AI},
86
  url={https://arxiv.org/abs/2602.10090},
87
  }
88
+ ```