Magnus Müller
aef11dc6f1
Parallel tests
2025-10-25 09:56:33 -07:00
Magnus Müller
4db51f9e24
Ci cd test update
2025-10-25 09:12:15 -07:00
Magnus Müller
9d2e379af7
Api key
2025-10-24 02:40:35 -07:00
Magnus Müller
c1abe88048
Higher timeout + weekly chrome cache
2025-10-24 02:30:29 -07:00
Magnus Müller
9f711b4281
Reduce timeout
2025-10-24 01:41:22 -07:00
Magnus Müller
46e727f7bd
Enhance CI workflow by adding conditional caching for Chromium installation
...
This update introduces a caching mechanism for Chromium binaries in the GitHub Actions workflow. The installation of Chromium will now only occur if it is not already cached, reducing unnecessary downloads and speeding up the CI process. This change aims to optimize the workflow efficiency, particularly during parallel test runs.
2025-10-24 01:28:39 -07:00
Magnus Müller
e6cb8e7587
Add explicit uv package caching to speed up CI
...
The built-in astral-sh/setup-uv cache wasn't working properly,
causing 2m+ downloads of Python packages on every test run.
Added explicit ~/.cache/uv caching keyed on uv.lock hash.
Before: 2m 9s downloading packages (numpy, oci, imageio-ffmpeg, etc)
After: ~5s restoring from cache
Saves ~2 minutes per test job × 40 parallel jobs = 80 runner-minutes saved
2025-10-24 01:25:03 -07:00
Magnus Müller
b31c3ab0b4
no sync pip
2025-10-24 01:24:41 -07:00
Magnus Müller
7b0b92991c
Optimize lint workflow by skipping redundant uv sync checks
...
Add --no-sync flag to uv run commands that come after uv sync.
Before:
- uv sync (50s)
- uv run pre-commit (50s sync check + 8s run = 58s)
- Total: 108s
After:
- uv sync (50s)
- uv run --no-sync pre-commit (8s run)
- Total: 58s
Saves 50 seconds per lint job (46% faster)
2025-10-24 01:23:46 -07:00
Magnus Müller
2c7d3372e1
Add setup-chromium job to pre-cache chromium before parallel test runs
...
Prevents 40 parallel runners from racing to install chromium simultaneously on cache miss.
Before: 40 runners × 2min = 80 runner-minutes wasted on first run
After: 1 runner installs (2min), then 40 runners use cached version (10s each)
Savings on cache miss: ~78 runner-minutes per workflow run
2025-10-24 01:19:06 -07:00
Magnus Müller
a2b0f084be
Refactor GitHub workflows to remove Playwright version detection and update cache keys for Chromium binaries based on uv.lock file.
2025-10-24 01:07:39 -07:00
Magnus Müller
2ca0ba38ec
Bug playwright not found
2025-10-24 01:03:36 -07:00
Magnus Müller
633fb53b1a
Enhance GitHub workflows by adding Playwright version detection and updating cache keys for Chromium binaries based on the installed Playwright version.
2025-10-24 01:00:26 -07:00
Magnus Müller
a6d080d73f
Update GitHub workflows to include branch and tag triggers for Docker and package workflows
2025-10-24 00:58:22 -07:00
Magnus Müller
5c9d7c9a90
Test cancel
2025-09-08 08:54:01 -07:00
Magnus Müller
4bc50f37e1
Test cancel
2025-09-08 08:53:08 -07:00
Magnus Müller
aba7e2cbae
cancel tests on new commit
2025-09-08 08:52:25 -07:00
Magnus Müller
a0ffb70b01
remove test self hosted
2025-09-03 10:52:36 -07:00
Magnus Müller
6bba023d38
Reduce timeout for GitHub Actions tests from 15 to 10 minutes to prevent hanging.
2025-09-02 00:24:09 -07:00
Magnus Müller
172951b8cd
Fix GitHub Actions workflow hanging: add timeouts and force fresh checkout
...
- Add timeout-minutes to prevent jobs from hanging indefinitely
- Force fresh checkout with fetch-depth: 1 to avoid cache issues
- Add file existence check to handle renamed/deleted tests gracefully
- Add debug output to track test discovery process
2025-09-02 00:20:54 -07:00
Magnus Müller
dd6a187fc0
Type
2025-09-02 00:14:20 -07:00
Magnus Müller
616c81e435
Skip none existing tests
2025-09-02 00:14:06 -07:00
Magnus Müller
723c68c20c
Test fails if 0%
2025-08-29 18:38:24 -07:00
Saurav Panda
dda797693a
feat: added a stale bot
2025-08-28 12:41:09 -07:00
Magnus Müller
9210c6c79c
Pip test
2025-08-26 17:33:43 -07:00
Magnus Müller
657ea42efd
Remove Playwright/Patchright dependencies - use uvx playwright install for browser only
...
- Updated test.yaml to use uvx playwright install chromium --with-deps --no-shell
- Removed Chrome stable installation and patchright dependencies
- Updated Dockerfile to use uvx playwright temporarily for browser installation
- Updated chromium Dockerfile to use uvx playwright without permanent dependency
- Simplified CI pipeline while maintaining browser functionality
2025-08-26 17:27:41 -07:00
Magnus Müller
caa0e7ef1b
Rename controller to tools instances
2025-08-26 11:30:39 -07:00
Magnus Müller
13dd73b23a
Update Tool imports in docs, readme, and docstrings
2025-08-26 11:13:27 -07:00
Nick Sweeting
2a0f4bd93a
fix publish script
2025-08-20 16:24:09 -07:00
Nick Sweeting
226c992fdb
try self-hosted runner for packaging workflow
2025-08-20 19:21:55 -04:00
Magnus Müller
d96b6f4b23
update pip workflow file
2025-08-18 17:28:37 -07:00
Nick Sweeting
f5925b1080
fix browser-use extensions dir to use .config/browseruse like everything else and cache in actions
2025-07-29 13:04:44 -07:00
Nick Sweeting
845e83099b
Update claude.yml
2025-07-19 02:03:30 -04:00
Nick Sweeting
2c9eaf700a
Update claude.yml
2025-07-15 18:42:37 -04:00
Nick Sweeting
f46fbc6ce5
Update claude.yml to use opus
2025-07-15 17:51:33 -04:00
Nick Sweeting
464a51512f
more claude code permissions tweaks and set up python env
2025-07-15 17:42:52 -04:00
Nick Sweeting
520fc3abc4
Give claude code action more bash permissions
2025-07-15 17:29:09 -04:00
Alexander Yue
4a2e52ab36
Merge branch 'main' into move-eval
2025-07-11 22:36:41 -07:00
Alezander9
883c47bb04
move eval workflow
2025-07-11 22:35:47 -07:00
Nick Sweeting
fd07360f57
simpler page title fetching
2025-07-10 18:35:24 -07:00
Nick Sweeting
cdca6339f6
try all browsers for evals
2025-07-10 16:24:47 -07:00
Nick Sweeting
cb4f6be2a8
Merge branch 'main' into cdp-loading
2025-07-10 05:55:01 -07:00
Gregor Žunič
1eac01dfab
openrouter evals support
2025-07-10 12:51:52 +02:00
Nick Sweeting
4b4b93f6cc
tweak chrome used for test.yaml evaluate_Tasks
2025-07-09 15:32:10 -07:00
Nick Sweeting
3bd76cea98
tweak chrome used for test.yaml evaluate_Tasks
2025-07-09 15:26:07 -07:00
Magnus Müller
bc5ff33b09
eval remove laminar args
2025-07-09 20:02:00 +02:00
Magnus Müller
f5763bdf49
add browser use logging
2025-07-09 18:01:09 +02:00
Magnus Müller
db3fa28442
eval laminar key name
2025-07-09 17:59:47 +02:00
Nick Sweeting
435426fc9a
bump cache action version
2025-07-08 18:40:36 -07:00
Nick Sweeting
b4a8776fec
speed up chrome install in CI
2025-07-08 18:19:08 -07:00