Commit Graph

257 Commits

Author SHA1 Message Date
Magnus Müller
a0ffb70b01 remove test self hosted 2025-09-03 10:52:36 -07:00
Magnus Müller
6bba023d38 Reduce timeout for GitHub Actions tests from 15 to 10 minutes to prevent hanging. 2025-09-02 00:24:09 -07:00
Magnus Müller
172951b8cd Fix GitHub Actions workflow hanging: add timeouts and force fresh checkout
- Add timeout-minutes to prevent jobs from hanging indefinitely
- Force fresh checkout with fetch-depth: 1 to avoid cache issues
- Add file existence check to handle renamed/deleted tests gracefully
- Add debug output to track test discovery process
2025-09-02 00:20:54 -07:00
Magnus Müller
dd6a187fc0 Type 2025-09-02 00:14:20 -07:00
Magnus Müller
616c81e435 Skip none existing tests 2025-09-02 00:14:06 -07:00
Magnus Müller
723c68c20c Test fails if 0% 2025-08-29 18:38:24 -07:00
Saurav Panda
dda797693a feat: added a stale bot 2025-08-28 12:41:09 -07:00
Magnus Müller
9210c6c79c Pip test 2025-08-26 17:33:43 -07:00
Magnus Müller
657ea42efd Remove Playwright/Patchright dependencies - use uvx playwright install for browser only
- Updated test.yaml to use uvx playwright install chromium --with-deps --no-shell
- Removed Chrome stable installation and patchright dependencies
- Updated Dockerfile to use uvx playwright temporarily for browser installation
- Updated chromium Dockerfile to use uvx playwright without permanent dependency
- Simplified CI pipeline while maintaining browser functionality
2025-08-26 17:27:41 -07:00
Magnus Müller
caa0e7ef1b Rename controller to tools instances 2025-08-26 11:30:39 -07:00
Magnus Müller
13dd73b23a Update Tool imports in docs, readme, and docstrings 2025-08-26 11:13:27 -07:00
Nick Sweeting
2a0f4bd93a fix publish script 2025-08-20 16:24:09 -07:00
Nick Sweeting
226c992fdb try self-hosted runner for packaging workflow 2025-08-20 19:21:55 -04:00
Magnus Müller
d96b6f4b23 update pip workflow file 2025-08-18 17:28:37 -07:00
Nick Sweeting
f5925b1080 fix browser-use extensions dir to use .config/browseruse like everything else and cache in actions 2025-07-29 13:04:44 -07:00
Nick Sweeting
845e83099b Update claude.yml 2025-07-19 02:03:30 -04:00
Nick Sweeting
2c9eaf700a Update claude.yml 2025-07-15 18:42:37 -04:00
Nick Sweeting
f46fbc6ce5 Update claude.yml to use opus 2025-07-15 17:51:33 -04:00
Nick Sweeting
464a51512f more claude code permissions tweaks and set up python env 2025-07-15 17:42:52 -04:00
Nick Sweeting
520fc3abc4 Give claude code action more bash permissions 2025-07-15 17:29:09 -04:00
Alexander Yue
4a2e52ab36 Merge branch 'main' into move-eval 2025-07-11 22:36:41 -07:00
Alezander9
883c47bb04 move eval workflow 2025-07-11 22:35:47 -07:00
Nick Sweeting
fd07360f57 simpler page title fetching 2025-07-10 18:35:24 -07:00
Nick Sweeting
cdca6339f6 try all browsers for evals 2025-07-10 16:24:47 -07:00
Nick Sweeting
cb4f6be2a8 Merge branch 'main' into cdp-loading 2025-07-10 05:55:01 -07:00
Gregor Žunič
1eac01dfab openrouter evals support 2025-07-10 12:51:52 +02:00
Nick Sweeting
4b4b93f6cc tweak chrome used for test.yaml evaluate_Tasks 2025-07-09 15:32:10 -07:00
Nick Sweeting
3bd76cea98 tweak chrome used for test.yaml evaluate_Tasks 2025-07-09 15:26:07 -07:00
Magnus Müller
bc5ff33b09 eval remove laminar args 2025-07-09 20:02:00 +02:00
Magnus Müller
f5763bdf49 add browser use logging 2025-07-09 18:01:09 +02:00
Magnus Müller
db3fa28442 eval laminar key name 2025-07-09 17:59:47 +02:00
Nick Sweeting
435426fc9a bump cache action version 2025-07-08 18:40:36 -07:00
Nick Sweeting
b4a8776fec speed up chrome install in CI 2025-07-08 18:19:08 -07:00
Nick Sweeting
1fa7fee4f6 fix cache key for tests 2025-07-08 18:10:14 -07:00
Nick Sweeting
7cf6a26664 fix flipped order 2025-07-08 18:07:43 -07:00
Nick Sweeting
1c6b510f07 use sudo for curl to update 2025-07-08 18:05:20 -07:00
Nick Sweeting
28f0d4d401 use runner arch in cache key 2025-07-08 18:03:11 -07:00
Nick Sweeting
b206db41a1 use consistent bin name 2025-07-08 18:01:24 -07:00
Nick Sweeting
32e5430b62 only cache actual binary 2025-07-08 18:00:55 -07:00
Nick Sweeting
14030006db fix missing sudo 2025-07-08 17:57:18 -07:00
Nick Sweeting
4599f815f2 try to cache chrome apt package 2025-07-08 17:55:44 -07:00
Aitor
e409c36fd7 feat: forward unikraft secrets to the eval workflow .yaml 2025-07-08 17:40:59 +02:00
Nick Sweeting
3f84d1c460 set in_docker in evals 2025-07-08 06:09:30 -07:00
Nick Sweeting
4d8bdb3dbf install all browser versions for evals and tests 2025-07-08 06:05:20 -07:00
Nick Sweeting
7403c33be3 fix user-data-dir matching 2025-07-08 05:05:49 -07:00
Magnus Müller
8ed1f6cb88 Update failing test 2025-07-08 13:43:12 +02:00
Nick Sweeting
fdba54fb34 add pyright to pre-commit hooks 2025-07-07 18:03:55 -07:00
reformedot
b7fa04d336 feat: add parameters to remove images and css in the eval.yaml 2025-07-07 16:26:54 +02:00
Aitor
d032a1ec61 fix: update eval.yaml to use full HD screen resolution 2025-07-07 09:44:50 +02:00
reformedot
9de712d702 feat: added browser settings to browser profile 2025-07-06 20:00:06 +02:00