Commit Graph

731 Commits

Author SHA1 Message Date
Magnus Müller
ba79f4df88 Linter 2025-10-25 09:59:30 -07:00
Magnus Müller
a991e19d75 Remove some models 2025-10-25 09:58:59 -07:00
Magnus Müller
a8cb8bae12 Remove deepseek 2025-10-25 09:56:38 -07:00
Magnus Müller
19df83f7ab Linter 2025-10-25 09:51:07 -07:00
Magnus Müller
8ac418b17f No api key fails in github 2025-10-25 09:50:59 -07:00
Magnus Müller
c342442a0b Linter 2025-10-25 09:38:28 -07:00
Magnus Müller
8a76853837 Basic model test 2025-10-25 09:36:56 -07:00
Magnus Müller
4db51f9e24 Ci cd test update 2025-10-25 09:12:15 -07:00
Magnus Müller
c1982936c9 Organize tests 2025-10-25 09:09:54 -07:00
Magnus Müller
90e4569125 Remove old 2025-10-25 08:46:40 -07:00
Magnus Müller
22b24723bf Cache 2025-10-24 23:53:56 -07:00
Magnus Müller
17db15320d Cloud browser 2025-10-24 23:53:50 -07:00
Magnus Müller
a00a999bdc Remove sleep 2025-10-24 23:47:36 -07:00
Magnus Müller
8d3f30da80 Patch api key 2025-10-24 23:47:01 -07:00
Magnus Müller
56badd6ca8 Move to interaction tests 2025-10-24 23:45:13 -07:00
Magnus Müller
763a7b325b Remove more 2025-10-24 23:43:17 -07:00
Magnus Müller
01dd419eaf Remove event tests because inside tools 2025-10-24 23:36:41 -07:00
Magnus Müller
b36b85715d Longer wait in actor press 2025-10-24 23:36:27 -07:00
Magnus Müller
daa74ddcfe Tests 2025-10-24 23:29:38 -07:00
Magnus Müller
bb8b801b5a Remove test 2025-10-24 23:23:26 -07:00
Magnus Müller
dd6ec0bf03 Delete files 2025-10-24 23:21:25 -07:00
Magnus Müller
096b29c9f7 Remove tab 2025-10-24 23:15:53 -07:00
Magnus Müller
5a8f30d378 Refactor tests to utilize direct action calls in Tools
- Simplified test implementations by replacing ActionModel usage with direct calls to Tools methods (e.g., navigate, click, upload_file).
- Enhanced readability and maintainability of test code by removing unnecessary model definitions.
- Utilized new helper methods for element lookup by ID, improving efficiency in finding elements during tests.

This change aligns with recent updates to the Tools API, making tests cleaner and more intuitive.
2025-10-24 23:08:21 -07:00
Magnus Müller
8fa6efa0c9 Tests simplified 2025-10-24 23:00:42 -07:00
Magnus Müller
95716350ef Linter 2025-10-24 19:03:42 -07:00
Magnus Müller
36c0405b4a Linter 2025-10-24 19:02:08 -07:00
Magnus Müller
f41da2d3b4 Fix dropdown selection for Vue.js and reactive frameworks (#3415)
Fixes dropdown selections not persisting in Vue.js, React, and other
reactive frameworks. The issue was that only 'change' events were
dispatched, but Vue's v-model requires 'input' events to register changes.

**Root Cause:**
Dropdown selection handler only dispatched 'change' events, missing the
'input' event required by reactive frameworks.

**Solution:**
Enhanced event dispatching sequence in default_action_watchdog.py:
1. Focus element first
2. Set value and selected state
3. Dispatch 'input' event (critical for Vue v-model and React)
4. Dispatch 'change' event (standard for select elements)
5. Blur element for validation

**Tests:**
Added comprehensive integration tests in tests/ci/interaction/:
- test_dropdown_vue_submit.py: Vue.js 3 with v-model
- test_dropdown_react_submit.py: React 18 with controlled components

Both tests verify dropdown selections persist through the reactive
framework's state management and form submission works correctly.

Closes #3415

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-24 18:54:57 -07:00
Magnus Müller
672e749d0f fix evaluate test 2025-10-24 14:17:48 -07:00
Magnus Müller
3cbea6fbf6 Add env key 2025-10-24 02:13:27 -07:00
Magnus Müller
6cf5c54075 Add api key 2025-10-24 02:12:40 -07:00
Magnus Müller
f53ae8c2db Split registry 2025-10-24 02:08:42 -07:00
Magnus Müller
1404935a5c Passed 0% 2025-10-24 01:56:39 -07:00
Magnus Müller
2c7d3372e1 Add setup-chromium job to pre-cache chromium before parallel test runs
Prevents 40 parallel runners from racing to install chromium simultaneously on cache miss.

Before: 40 runners × 2min = 80 runner-minutes wasted on first run
After: 1 runner installs (2min), then 40 runners use cached version (10s each)

Savings on cache miss: ~78 runner-minutes per workflow run
2025-10-24 01:19:06 -07:00
Magnus Müller
16e64ad42b Remove not needed tests 2025-10-24 00:56:10 -07:00
Magnus Müller
affe4a6665 remove test 2025-10-24 00:02:43 -07:00
Magnus Müller
4ac49de9cf fix(code-use): prevent done() execution inside conditional blocks
Previously, done() calls inside if/else/elif blocks would only show a warning
but still execute, potentially causing tasks to never complete if the condition
wasn't met. This changes the behavior to raise a RuntimeError, forcing the LLM
to restructure code properly.

The proper pattern is:
```python
# Validate and set result variables
if condition:
    result = "success"
else:
    result = "failure"

# Then call done() unconditionally
await done(result, success=True)
```

Changes:
- Upgraded warning to RuntimeError in namespace.py
- Added comprehensive test suite with 5 test cases
- Tests cover if/else/elif blocks and verify standalone done() still works

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-22 10:03:10 -07:00
Magnus Müller
490a5b84e1 Enhance test for actor page key presses: increased sleep duration for key presses, added debug output, and improved handling of empty output. 2025-10-21 22:39:28 -07:00
Magnus Müller
db8f533fe3 test 2025-10-21 22:36:28 -07:00
Magnus Müller
ff36e605e1 test 2025-10-21 22:33:23 -07:00
Magnus Müller
f55d789370 Fix tests 2025-10-21 22:28:50 -07:00
Magnus Müller
fd831afeef Test 2025-10-21 22:23:43 -07:00
Magnus Müller
23fbf75215 Formatter 2025-10-21 22:17:26 -07:00
Magnus Müller
afbc0a0913 Test 2025-10-21 22:15:08 -07:00
Magnus Müller
32ce9161be Tests 2025-10-21 22:03:05 -07:00
Magnus Müller
fad8ed06d6 Linter 2025-10-21 21:57:51 -07:00
Magnus Müller
b574c403f3 Fix test 2025-10-21 21:56:42 -07:00
Magnus Müller
9caa248316 Merge code7 branch changes 2025-10-21 18:55:54 -07:00
Reagan Hsu
bc0ce8c226 Merge branch 'main' into fix/keyboard-events-complete 2025-10-13 19:29:16 -07:00
Reagan Hsu
d3ea9a17e6 add new testing for Page.press() for Actors 2025-10-13 18:29:24 -07:00
Magnus Müller
4078b6e265 Remove while_holding_ctrl logic in click 2025-10-12 16:11:09 -07:00