mirror of
https://github.com/browser-use/browser-use
synced 2026-05-06 17:52:15 +02:00
- Replaced dataclass ScoreBreakdown with Pydantic's BaseModel for JudgeResult. - Updated scoring guidelines to reflect percentage-based final scores. - Removed unnecessary fields and improved JSON response structure. - Enhanced error handling and logging for evaluation failures. - Simplified parsing logic for structured responses from the model.