Skip to content

Conversation

@ZackMitchell910
Copy link

Summary

  • add a replay-only RunLedger eval suite (suite/case/schema/cassette + stub agent)
  • add a baseline file for regression gating
  • add a GitHub Actions workflow using runledger/Runledger@v0.1
  • add a small README note + ignore runledger_out/

How to run locally

runledger run evals/runledger --mode replay --baseline baselines/runledger-demo.json

Notes

  • no external calls; replay-only cassette
  • feel free to remove the suite/workflow if it is not desired

@ZackMitchell910
Copy link
Author

Thanks for taking a look! This PR adds a replay-only RunLedger gate. The workflow runs are currently waiting on fork approval (action_required) or have not started yet for forks. If you are open to it, please approve/authorize the workflow run so CI can complete. Happy to adjust anything.

@code-yeongyu
Copy link
Owner

what is this

@ZackMitchell910
Copy link
Author

It adds a small RunLedger check to CI so agent changes don’t break tools. It uses a replayed cassette (no live calls). Totally optional. Repo: https://github.com/runledger/Runledger

@code-yeongyu code-yeongyu force-pushed the master branch 5 times, most recently from 632f77a to bd8c43e Compare December 20, 2025 08:10
@darinkishore
Copy link
Contributor

what is this

lowkey an ad

@code-yeongyu code-yeongyu changed the base branch from master to dev December 21, 2025 09:02
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants