ShawLiu commited on
Commit
ecf8851
1 Parent(s): b464d8e

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +44 -0
README.md ADDED
@@ -0,0 +1,44 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: other
3
+ language:
4
+ - en
5
+ base_model:
6
+ - meta/Llama-3.1-8B
7
+ tags:
8
+ - webrl
9
+ - llama3.1
10
+ - webarena-lite
11
+ - llm
12
+ - agent
13
+ ---
14
+
15
+ # WebRL-Llama-3.1-8B
16
+
17
+ ## Model Introduction
18
+ WebRL-Llama-3.1-8B is the open-source version of WebRL in Llama-3.1-8B released by Zhipu AI.
19
+ It has the ability to complete web operations on five websites in WebArena: OpenStreetMap (Map), Reddit, GitLab, online store content management system (CMS) and OneStopShop (OSS).
20
+
21
+ ## Evaluation Results
22
+
23
+ We evaluated the WebRL-GLM-4-9B model on WebArena-Lite and obtained the following results:
24
+
25
+ | Model | Reddit | Gitlab | CMS | Map | OSS | Avg.SR |
26
+ |:--------------------|:------:|:------:|:------:|:------:|:------:|:--------:|
27
+ | Llama-3.1-8B-Instruct | 0.0 | 3.3 | 2.9 | 3.3 | 11.1 | 4.8 |
28
+ | GLM-4-9B-Chat | 5.3 | 10.0 | 6.7 | 3.3 | 6.7 | 6.1 |
29
+ | WebRL-Llama-3.1-8B | 63.2 | 46.7 | 54.3 | 36.7 | 31.1 | 42.4 |
30
+
31
+ **For more inference code and requirements, please visit our [github page](GitHub - THUDM/WebRL).**
32
+
33
+ ## Citations
34
+
35
+ If you find our work useful, please consider citing the following paper.
36
+
37
+ ```
38
+ @artical{qi2024webrl,
39
+ title={WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning},
40
+ author={Zehan Qi and Xiao Liu and Iat Long Iong and Hanyu Lai and Xueqiao Sun and Xinyue Yang and Jiadai Sun and Yu Yang and Shuntian Yao and Tianjie Zhang and Wei Xu and Jie Tang and Yuxiao Dong},
41
+ journal={arXiv preprint arXiv:2411.02337},
42
+ year={2024},
43
+ }
44
+ ```