Reference Only: Benchmark redesign (1662 Feedback) by rlundeen2 · Pull Request #1664 · microsoft/PyRIT

rlundeen2 · 2026-04-28T17:28:45Z

The fundamental architectural difference: 1662 treats models as a strategy dimension (permuting them into enum
members), requiring two different strategy classes and a _prepare_strategies override to reconcile them.

This PR treats models as a runtime parameter (looping at create-time), keeping the strategy axis purely about technique selection — which is what it was designed for.

Replace static BENCHMARK_TECHNIQUES list with _get_benchmarkable_specs() that filters SCENARIO_TECHNIQUES using two criteria: - _accepts_adversarial(attack_class): technique CAN use adversarial model - adversarial_chat is None: technique does NOT have one baked in New adversarial techniques added to SCENARIO_TECHNIQUES are auto-discovered. Fix test to use _adversarial_chat private attr on AtomicAttack. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

rlundeen2 · 2026-04-28T18:05:08Z

+    ]
+
+
+def _build_benchmark_strategy() -> type[ScenarioStrategy]:


So much of the strategy is shared with rapid response, these two functions could likely use a helper

build_strategy_from_techniques

Victor Valbuena and others added 5 commits April 23, 2026 17:33

notes

0e86b33

draft PR

42d3ab5

tests

f5f1563

Merge branch 'main' into benchmark

d36ced0

redesign

f184e6b

rlundeen2 commented Apr 28, 2026

View reviewed changes

Comment thread pyrit/scenario/scenarios/benchmark/benchmark.py Outdated

rlundeen2 and others added 2 commits April 28, 2026 10:38

redesign

294c5d6

rlundeen2 mentioned this pull request Apr 28, 2026

[DRAFT] FEAT: Benchmark Scenario #1662

Draft

rlundeen2 commented Apr 28, 2026

View reviewed changes

rlundeen2 changed the title ~~Benchmark redesign (1662 Feedback)~~ Reference Only: Benchmark redesign (1662 Feedback) Apr 28, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reference Only: Benchmark redesign (1662 Feedback)#1664

Reference Only: Benchmark redesign (1662 Feedback)#1664
rlundeen2 wants to merge 7 commits intomicrosoft:mainfrom
rlundeen2:benchmark-redesign

rlundeen2 commented Apr 28, 2026

Uh oh!

Uh oh!

rlundeen2 Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

rlundeen2 commented Apr 28, 2026

Uh oh!

Uh oh!

rlundeen2 Apr 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants