ci: implement multi-model evaluation support by omkargaikwad23 · Pull Request #169 · gemini-cli-extensions/cloud-sql-postgresql

omkargaikwad23 · 2026-05-07T06:02:05Z

Description:

Updated the CI pipeline to support evaluating multiple models dynamically

Key changes include:

Added specific evaluation configurations for Claude (model, dataset, and run configs)
Renamed existing configuration files to be explicitly Gemini-specific
Updated cloudbuild.yaml and substitute_env.py to automatically process and launch evaluations for all run configurations present in the workspace

…configs with dynamic discovery and adding Claude/Gemini-specific configurations.

… config

… evaluation

…N env var

… to 4-7, and restructured run settings

…e references

prernakakkar-google · 2026-05-08T07:10:24Z

Add detailed description before merging

ci: implement multi-model evaluation support by replacing static run …

a9394cc

…configs with dynamic discovery and adding Claude/Gemini-specific configurations.

omkargaikwad23 requested a review from a team as a code owner May 7, 2026 06:02

omkargaikwad23 added the ci:run-evals Manually trigger the evaluation CI pipeline on a PR. label May 7, 2026

omkargaikwad23 requested a review from a team as a code owner May 7, 2026 06:02

github-actions Bot assigned prernakakkar-google May 7, 2026

github-actions Bot requested a review from prernakakkar-google May 7, 2026 06:02

omkargaikwad23 added 4 commits May 7, 2026 06:12

ci: simplify PR label checks and update Claude model version in evals…

9aa8b4f

… config

ci: update model, region, and environment configuration for Cloud SQL…

8273535

… evaluation

ci: update evaluation model to claude-opus-4-1 and set CLOUD_ML_REGIO…

bbb044b

…N env var

ci: update claude-opus model version to 4-6 in eval configuration

abae0d3

omkargaikwad23 commented May 7, 2026

View reviewed changes

Comment thread evals/claude_code_model.yaml Outdated

omkargaikwad23 added 3 commits May 7, 2026 11:45

ci: update evaluation configuration with skill tagging, model upgrade…

5d5fc1c

… to 4-7, and restructured run settings

ci: upgrade claude-code version to 2.1.119

9dfca58

ci: split shared dataset into model-specific configurations and updat…

9b88419

…e references

prernakakkar-google approved these changes May 8, 2026

View reviewed changes

omkargaikwad23 merged commit 24b2db3 into main May 8, 2026
11 checks passed

omkargaikwad23 deleted the claude-ci-evals branch May 8, 2026 07:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: implement multi-model evaluation support #169

ci: implement multi-model evaluation support #169
omkargaikwad23 merged 8 commits intomainfrom
claude-ci-evals

omkargaikwad23 commented May 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

prernakakkar-google commented May 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

omkargaikwad23 commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description:

Uh oh!

Uh oh!

prernakakkar-google commented May 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

omkargaikwad23 commented May 7, 2026 •

edited

Loading