Commit Graph

444 Commits

Author SHA1 Message Date
Nick Sweeting
b061e9bc22 limit navigation time for tests so they fail faster 2025-07-10 18:40:08 -07:00
Nick Sweeting
848fdf8aad clearer exception handling 2025-07-10 18:26:08 -07:00
Nick Sweeting
4f55a0e916 bump sleep time 2025-07-10 18:02:54 -07:00
Nick Sweeting
c96af9db7f allow more time in recovery 2025-07-10 18:02:30 -07:00
Nick Sweeting
f2091a86d5 make screenshot tests parallel more lax 2025-07-10 17:57:09 -07:00
Nick Sweeting
651ae9101b force page list refresh after force CDP close of page 2025-07-10 17:29:19 -07:00
Nick Sweeting
b05da141b9 lax timeouts for pageload for easier tests 2025-07-10 17:19:14 -07:00
Nick Sweeting
c321656ffe use 1px white png for about:blank screenshots 2025-07-10 16:58:24 -07:00
Nick Sweeting
317b2d1fe3 allow agent to attempt to use crashed pages 2025-07-10 16:30:58 -07:00
Nick Sweeting
cdca6339f6 try all browsers for evals 2025-07-10 16:24:47 -07:00
Nick Sweeting
d66cf32615 tweak nav error handling and logging 2025-07-10 16:22:18 -07:00
Nick Sweeting
e461380998 switch evaluate_tasks back to patchright+chrome 2025-07-09 14:51:42 -07:00
mertunsall
13aa27b84e fix tests 2025-07-09 16:20:07 +02:00
Nick Sweeting
4d8bdb3dbf install all browser versions for evals and tests 2025-07-08 06:05:20 -07:00
Nick Sweeting
80327bb63e dont explode all tests when a single screenshot fails 2025-07-08 05:40:04 -07:00
Nick Sweeting
54a413d255 tweak screenshot timing 2025-07-08 05:28:50 -07:00
Nick Sweeting
fc0af9e1e9 resolve pytest-httpserver hanging on stop after test cleanup 2025-07-08 03:56:02 -07:00
Nick Sweeting
a7649fb197 sequential_agents test fix 2025-07-08 03:28:00 -07:00
Nick Sweeting
48086d0cdd use retry decorator from bubus 2025-07-08 03:16:22 -07:00
Nick Sweeting
58f374e010 Merge branch 'main' into browser-subprocess 2025-07-08 02:46:20 -07:00
Nick Sweeting
932953683a fix trustedTypes html and fallback test 2025-07-08 02:45:48 -07:00
Nick Sweeting
af0697f12b fix tests and other improvements 2025-07-08 02:41:48 -07:00
Magnus Müller
2a36c8003e Update example and test 2025-07-08 11:16:00 +02:00
Nick Sweeting
9b552ee8ac test and telemetry fixes 2025-07-08 01:37:47 -07:00
Nick Sweeting
48d88bff33 fix sensitive_data tests 2025-07-08 01:16:17 -07:00
Nick Sweeting
5171916bb4 fix har recording and logging tests 2025-07-08 01:12:10 -07:00
Nick Sweeting
1b9f7253f0 fix tests and simplify pid detection 2025-07-08 00:56:25 -07:00
Nick Sweeting
45bf9815a5 fix cookies errors in tests and launch pid detection 2025-07-08 00:37:46 -07:00
Nick Sweeting
ab36c9a301 document multiple mcp servers connected to client 2025-07-07 18:04:32 -07:00
Nick Sweeting
ecdae5ca4c add unified tree iframe test 2025-07-07 17:18:15 -07:00
Nick Sweeting
983696aed1 Merge branch 'main' into mcp-server 2025-07-07 17:07:53 -07:00
Nick Sweeting
6dcf479fc6 add readme MCP section and fix lint errors 2025-07-07 17:07:06 -07:00
Daniel T.
c5a3c0cf2e Refines dynamic page state calculations
Replaces static page statistics with dynamic calculations for
viewport and scroll position metrics. Simplifies code logic in
scroll handling by removing unnecessary defaults for page
scrolling. Improves readability and reliability of page
scroll operations by requiring explicit num_pages parameter.

Enhances page-view consistency and user interaction handling.
2025-07-07 21:22:12 +02:00
Daniel T.
55c392a8a3 Merge remote-tracking branch 'upstream/main' into enhancement/improved-scroll-action 2025-07-07 19:04:35 +02:00
Daniel T.
539274a7d4 Updates scroll functionality to use page units
Replaces pixel-based scrolling with page unit determination, allowing for more intuitive scroll actions by specifying the number of pages. Adjusts related documentation, examples, and tests to reflect this change for improved code consistency and user experience.
2025-07-07 18:21:37 +02:00
Daniel T.
eb7c7fa2bc Enhances scroll action testing
Removes unnecessary Field description in ScrollAction to streamline code.

Implements additional test cases for varying scroll amounts, improving test coverage and validation of scrolling behavior in the integration tests.
2025-07-07 17:34:57 +02:00
Magnus Müller
718fe9f679 Fix controller tests 2025-07-07 16:06:21 +02:00
Magnus Müller
d3496874db Update test 2025-07-07 16:01:09 +02:00
Magnus Müller
ea481972b1 Merge branch 'main' into better-error-handling-and-lmnr-inclusion 2025-07-07 15:33:08 +02:00
Magnus Müller
1a7d5bf5ac Remove retry test because we dont need to retry tool calls - the llm can just decide how to contine 2025-07-07 15:30:26 +02:00
mertunsall
daf1754d58 move tests 2025-07-07 13:43:03 +02:00
Magnus Müller
d03196c09c Merge remote-tracking branch 'origin/main' into include-lmnr 2025-07-06 23:42:14 +02:00
Alezander9
7b5ea73b4d fix ruff errors 2025-07-05 20:48:37 -07:00
Alezander9
522f55f81e fix-screenshot-test 2025-07-05 20:24:27 -07:00
mertunsall
220f0bc994 update models to gpt-4.1 2025-07-05 10:32:49 +02:00
mertunsall
304b065d59 more security 2025-07-05 02:07:11 +02:00
Magnus Müller
dc43bc2528 Fixed test because we dont use include_in_memory anymore & success=False is not needed 2025-07-04 17:34:29 +02:00
Gregor Žunič
177d5bcb7e Merge branch 'main' into fuckaround/dom-processing-tree 2025-07-02 18:14:40 +02:00
Gregor Žunič
cf03b7a9d4 removed client reacts cache, refactor scripts 2025-06-30 15:21:41 +02:00
Gregor Žunič
3a50681e26 removed PERF_METRICS for nicer code view 2025-06-30 14:56:43 +02:00