## Problem
Single task cloud execution was failing with validation errors because
the evaluation script was generating local run IDs instead of using the
provided Convex run ID.
## Solution
- Modified to respect the provided parameter in single task mode
- Removed fragile local run ID generation that caused database
validation errors
- Now follows the same robust pattern as regular evaluation runs
- Maintains backward compatibility for local single task runs without
server
## Changes
- **service.py**: Use provided run_id when available in single task mode
instead of always generating a local ID
## Testing
- Single task cloud execution now works without validation errors
- Results are properly saved to the correct Convex run
- Local single task runs still work as fallback
Fixes the issue where single task runs were showing success but failing
to save results due to run ID mismatch.
<!-- This is an auto-generated description by cubic. -->
---
## Summary by cubic
Fixed single task cloud execution to use the provided Convex run ID,
preventing validation errors and ensuring results are saved correctly.
- **Bug Fixes**
- Uses the given run ID in single task mode instead of always generating
a local one.
- Keeps local single task runs working as before.
<!-- End of auto-generated description by cubic. -->
- Fixes single task cloud execution to properly use Convex run ID
- Removes fragile local run ID generation that caused validation errors
- Ensures single task results are saved to correct run in database
- Maintains backward compatibility for local single task runs
Auto-generated PR for branch: eval-single-task-run
<!-- This is an auto-generated description by cubic. -->
---
## Summary by cubic
Added support for running a single evaluation task by passing task
details directly, without needing to fetch tasks from the server.
- **New Features**
- Accepts task ID, text, website, and branch as parameters for single
task runs.
- Skips server fetch and uses a local run ID if only a single task is
provided.
- Results are saved locally or to the server if credentials are present.
<!-- End of auto-generated description by cubic. -->
- Eliminated the branch argument from both eval.yaml and service.py for single task mode, simplifying argument parsing.
- Updated related logic to ensure backward compatibility while maintaining functionality for task ID, text, and website.
- Enhanced environment variable loading for improved clarity and consistency.
- Introduced parameters for single task mode in eval.yaml, allowing task ID, text, website, and branch to be specified.
- Updated service.py to handle single task mode, including conditional saving to the server and local run ID generation.
- Enhanced argument parsing to accommodate single task mode, ensuring backward compatibility with existing multi-task functionality.