mirror of
https://github.com/browser-use/browser-use
synced 2026-05-06 17:52:15 +02:00
Refines the system prompt in judge_system.py by improving the context about the browser-use agent and updating evaluation criteria for better clarity. Adjusts the JSON response structure to reflect changes in task satisfaction and trajectory quality metrics.