We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent f405fc7 commit 892367aCopy full SHA for 892367a
.gitignore
@@ -18,3 +18,8 @@
18
utils/reload/*.js
19
utils/reload/injections/*.js
20
public/manifest.json
21
+
22
+# benchmarking
23
+/results
24
+tasks_test.jsonl
25
+webwand_test_log.txt
benchmark.py
@@ -1,3 +1,10 @@
1
+"""
2
+To run benchmark testing, copy and paste corresponding test dataset from https://github.com/MinorJerry/WebVoyager/blob/main/data/WebVoyager_data.jsonl
3
+to the tasks_test.jsonl file.
4
+Then in terminal, first run "pnpm dev" to start the webwand server.
5
+Then run "python benchmark.py"
6
7
8
import base64
9
import os
10
import json
0 commit comments