add: readme file
Browse files
README.md
ADDED
|
@@ -0,0 +1,13 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
## AgentEvol-7B
|
| 2 |
+
|
| 3 |
+
<p align="center">
|
| 4 |
+
π <a href="TODO" target="_blank">Paper</a > β’ π <a href="https://agentgym.github.io/" target="_blank">Project Page</a > β’ π» <a href="https://github.com/WooooDyy/AgentGym" target="_blank">[Github Repo]</a> β’ π <a href="https://huggingface.co/datasets/AgentGym/AgentTraj-L" target="_blank">[Trajectory Dataset]</a > β’ π <a href="https://huggingface.co/datasets/AgentGym/AgentEval" target="_blank">[Eval Benchmark]</a> β’ π€ <a href="https://huggingface.co/AgentGym/AgentEvol-7B" target="_blank">Model (AgentEvol-7B)</a ><br>
|
| 5 |
+
</p >
|
| 6 |
+
|
| 7 |
+
**AgentEvol** is a novel method to evolve generall-capable LLM-based agents across multiple environments. AgentEvol first trains a base generally-capable agent with behavioral cloning, equipping it with basic abability and prior knowledgs. Subsequently, the agent is allowed to perform exploration and learning acorss various tasks and environments.
|
| 8 |
+
|
| 9 |
+
**AgentEvol-7B** is trained with the AgentEvol algorithm on Llama-2-Chat-7B. The model is first trained on the AgentTraj set with behavioural cloning. Next it performs exploration and learning from a broader set of instructions. After evolution, its performance outperforms SOTA models on many tasks.
|
| 10 |
+
|
| 11 |
+
## π Citation
|
| 12 |
+
|
| 13 |
+
- TODO
|