Commit Graph

358 Commits

Author SHA1 Message Date
Magnus Müller
aef11dc6f1 Parallel tests 2025-10-25 09:56:33 -07:00
Magnus Müller
4db51f9e24 Ci cd test update 2025-10-25 09:12:15 -07:00
Magnus Müller
96d27cc646 Template 2025-10-24 13:28:19 -07:00
Magnus Müller
9d2e379af7 Api key 2025-10-24 02:40:35 -07:00
Magnus Müller
c1abe88048 Higher timeout + weekly chrome cache 2025-10-24 02:30:29 -07:00
Magnus Müller
9f711b4281 Reduce timeout 2025-10-24 01:41:22 -07:00
Magnus Müller
46e727f7bd Enhance CI workflow by adding conditional caching for Chromium installation
This update introduces a caching mechanism for Chromium binaries in the GitHub Actions workflow. The installation of Chromium will now only occur if it is not already cached, reducing unnecessary downloads and speeding up the CI process. This change aims to optimize the workflow efficiency, particularly during parallel test runs.
2025-10-24 01:28:39 -07:00
Magnus Müller
e6cb8e7587 Add explicit uv package caching to speed up CI
The built-in astral-sh/setup-uv cache wasn't working properly,
causing 2m+ downloads of Python packages on every test run.

Added explicit ~/.cache/uv caching keyed on uv.lock hash.

Before: 2m 9s downloading packages (numpy, oci, imageio-ffmpeg, etc)
After: ~5s restoring from cache

Saves ~2 minutes per test job × 40 parallel jobs = 80 runner-minutes saved
2025-10-24 01:25:03 -07:00
Magnus Müller
b31c3ab0b4 no sync pip 2025-10-24 01:24:41 -07:00
Magnus Müller
7b0b92991c Optimize lint workflow by skipping redundant uv sync checks
Add --no-sync flag to uv run commands that come after uv sync.

Before:
- uv sync (50s)
- uv run pre-commit (50s sync check + 8s run = 58s)
- Total: 108s

After:
- uv sync (50s)
- uv run --no-sync pre-commit (8s run)
- Total: 58s

Saves 50 seconds per lint job (46% faster)
2025-10-24 01:23:46 -07:00
Magnus Müller
2c7d3372e1 Add setup-chromium job to pre-cache chromium before parallel test runs
Prevents 40 parallel runners from racing to install chromium simultaneously on cache miss.

Before: 40 runners × 2min = 80 runner-minutes wasted on first run
After: 1 runner installs (2min), then 40 runners use cached version (10s each)

Savings on cache miss: ~78 runner-minutes per workflow run
2025-10-24 01:19:06 -07:00
Magnus Müller
a2b0f084be Refactor GitHub workflows to remove Playwright version detection and update cache keys for Chromium binaries based on uv.lock file. 2025-10-24 01:07:39 -07:00
Magnus Müller
2ca0ba38ec Bug playwright not found 2025-10-24 01:03:36 -07:00
Magnus Müller
633fb53b1a Enhance GitHub workflows by adding Playwright version detection and updating cache keys for Chromium binaries based on the installed Playwright version. 2025-10-24 01:00:26 -07:00
Magnus Müller
a6d080d73f Update GitHub workflows to include branch and tag triggers for Docker and package workflows 2025-10-24 00:58:22 -07:00
Saurav Panda
85a2386c91 feat: added cloud_browser feature 2025-09-16 22:27:45 -07:00
Magnus Müller
5c9d7c9a90 Test cancel 2025-09-08 08:54:01 -07:00
Magnus Müller
4bc50f37e1 Test cancel 2025-09-08 08:53:08 -07:00
Magnus Müller
aba7e2cbae cancel tests on new commit 2025-09-08 08:52:25 -07:00
Magnus Müller
a0ffb70b01 remove test self hosted 2025-09-03 10:52:36 -07:00
Magnus Müller
6bba023d38 Reduce timeout for GitHub Actions tests from 15 to 10 minutes to prevent hanging. 2025-09-02 00:24:09 -07:00
Magnus Müller
172951b8cd Fix GitHub Actions workflow hanging: add timeouts and force fresh checkout
- Add timeout-minutes to prevent jobs from hanging indefinitely
- Force fresh checkout with fetch-depth: 1 to avoid cache issues
- Add file existence check to handle renamed/deleted tests gracefully
- Add debug output to track test discovery process
2025-09-02 00:20:54 -07:00
Magnus Müller
dd6a187fc0 Type 2025-09-02 00:14:20 -07:00
Magnus Müller
616c81e435 Skip none existing tests 2025-09-02 00:14:06 -07:00
Magnus Müller
723c68c20c Test fails if 0% 2025-08-29 18:38:24 -07:00
Saurav Panda
dda797693a feat: added a stale bot 2025-08-28 12:41:09 -07:00
Magnus Müller
82b241ddbb Linter new line 2025-08-26 18:13:11 -07:00
Magnus Müller
9210c6c79c Pip test 2025-08-26 17:33:43 -07:00
Magnus Müller
657ea42efd Remove Playwright/Patchright dependencies - use uvx playwright install for browser only
- Updated test.yaml to use uvx playwright install chromium --with-deps --no-shell
- Removed Chrome stable installation and patchright dependencies
- Updated Dockerfile to use uvx playwright temporarily for browser installation
- Updated chromium Dockerfile to use uvx playwright without permanent dependency
- Simplified CI pipeline while maintaining browser functionality
2025-08-26 17:27:41 -07:00
Magnus Müller
147ed30d02 Remove planner llm 2025-08-26 14:50:14 -07:00
Magnus Müller
caa0e7ef1b Rename controller to tools instances 2025-08-26 11:30:39 -07:00
Magnus Müller
dbfaffa8d1 Agent service update tools with backward compatible 2025-08-26 11:24:06 -07:00
Magnus Müller
13dd73b23a Update Tool imports in docs, readme, and docstrings 2025-08-26 11:13:27 -07:00
Nick Sweeting
2a0f4bd93a fix publish script 2025-08-20 16:24:09 -07:00
Nick Sweeting
226c992fdb try self-hosted runner for packaging workflow 2025-08-20 19:21:55 -04:00
Magnus Müller
d96b6f4b23 update pip workflow file 2025-08-18 17:28:37 -07:00
Nick Sweeting
f5925b1080 fix browser-use extensions dir to use .config/browseruse like everything else and cache in actions 2025-07-29 13:04:44 -07:00
Nick Sweeting
845e83099b Update claude.yml 2025-07-19 02:03:30 -04:00
Nick Sweeting
2c9eaf700a Update claude.yml 2025-07-15 18:42:37 -04:00
Nick Sweeting
f46fbc6ce5 Update claude.yml to use opus 2025-07-15 17:51:33 -04:00
Nick Sweeting
464a51512f more claude code permissions tweaks and set up python env 2025-07-15 17:42:52 -04:00
Nick Sweeting
520fc3abc4 Give claude code action more bash permissions 2025-07-15 17:29:09 -04:00
Alexander Yue
4a2e52ab36 Merge branch 'main' into move-eval 2025-07-11 22:36:41 -07:00
Alezander9
883c47bb04 move eval workflow 2025-07-11 22:35:47 -07:00
Nick Sweeting
fd07360f57 simpler page title fetching 2025-07-10 18:35:24 -07:00
Nick Sweeting
cdca6339f6 try all browsers for evals 2025-07-10 16:24:47 -07:00
Nick Sweeting
cb4f6be2a8 Merge branch 'main' into cdp-loading 2025-07-10 05:55:01 -07:00
Gregor Žunič
1eac01dfab openrouter evals support 2025-07-10 12:51:52 +02:00
Nick Sweeting
4b4b93f6cc tweak chrome used for test.yaml evaluate_Tasks 2025-07-09 15:32:10 -07:00
Nick Sweeting
3bd76cea98 tweak chrome used for test.yaml evaluate_Tasks 2025-07-09 15:26:07 -07:00