sure/.github at 94582612496b134b830a7002a6d90b7563c89263 - sure

mirror of https://github.com/we-promise/sure synced 2026-04-25 17:15:07 +02:00

Files

Sure Admin (bot) 43460664c4 feat(ci): improve LLM eval visibility in GitHub Actions (#1546 )

* feat(ci): improve LLM eval visibility in GitHub Actions

- Add step summary output for each eval run (shows in GH UI)
- Add new 'summarize_evals' job that aggregates results from all matrix runs
- Generate markdown table with accuracy, cost, and duration for all evals
- Add threshold checking (fails workflow if accuracy < 70%)
- Include status icons (✅/❌) for quick visual assessment
- Show overall pass/fail status at the end of summary

* Fix LLM eval workflow summary

---------

Co-authored-by: SureBot <sure-bot@we-promise.com>
Co-authored-by: Juan José Mata <juanjo.mata@gmail.com>

2026-04-24 11:18:45 +02:00

DISCUSSION_TEMPLATE

Update feature-requests.yml

2024-05-20 12:16:30 -04:00

ISSUE_TEMPLATE

Remove Intercom integration (#51 )

2025-08-01 19:47:48 +02:00

workflows

feat(ci): improve LLM eval visibility in GitHub Actions (#1546 )

2026-04-24 11:18:45 +02:00

copilot-instructions.md

Enable selenium service in devcontainer for system tests (#1340 )

2026-04-06 14:15:57 +02:00

dependabot.yml

Weekly dependabot checks (#407 )

2024-02-09 08:24:34 -06:00