refactor(intrinsics): requirement_check_to_bool raises on schema mismatch (Epic #929 Phase 1) by planetf1 · Pull Request #1320 · generative-computing/mellea

planetf1 · 2026-06-23T16:37:30Z

Context

This is one of three parallel Wave 3 issues in Epic #929 Phase 1:

Issue	PR	File	Status
#1137	#1321	`rag.py` migration	⬜ Draft
#1138	This PR	`requirement_check` migration	This PR
#1139	—	`guardian.py` migration	⬜ Pending

Phase 1 is gated on #1136 (PR #1269, merged 2026-06-23), which rewrote call_intrinsic to use the new resolve_adapter() path from the Phase 0 scaffolding. All three Wave 3 issues were unblocked by that merge.

No file overlap with #1321 — that PR touches _util.py and rag.py; this one touches core.py, requirement.py, and tests. They can merge in any order.

Approach note: #1321 introduces IOContract subclasses wired through call_intrinsic. This PR uses inline validation directly in the helper functions. The IOContract-based approach for requirement-check is Phase 2 work; inline validation here achieves the stated goal of #1138 (loud schema mismatch) without depending on Phase 2 being complete.

Problem

PR #1008 changed the requirement-check adapter output schema from {"requirement_likelihood": 0.9} to {"requirement_check": {"score": 0.9}}, but requirement_check_to_bool was never updated to match. The result: every call silently returned False with a log warning — the kind of failure that looks like a working system until someone notices the model is never actually satisfying any requirements.

This issue is also called out in the Epic as the worked example for how adapter output contracts should be enforced going forward: schema drift must raise immediately, not degrade quietly.

What changed

Core fix

requirement_check_to_bool now raises AdapterSchemaMismatchError instead of returning False when the adapter output does not match {"requirement_check": {"score": <float>}}. Covers missing top-level key, missing nested score, and wrong-type score (null, string, etc.).
requirement_check (the higher-level wrapper that calls the adapter directly) now applies the same validation — previously it raised a bare KeyError on bad output.

Consistency

check_certainty and find_context_attributions now accept a model_options parameter, bringing all three intrinsic helpers in core.py into line.

Documentation

ALoraRequirement docstring previously said "falls back to LLMaJ on error" without qualification. That fallback only covers generation failures (adapter not found, etc.) — output-parsing failures were always a hard raise and now correctly documented as such.
docs/advanced/intrinsics.md example comment updated from the old {"requirement_likelihood": 1.0} schema to the current {"requirement_check": {"score": 1.0}}.

Breaking change

requirement_check_to_bool no longer returns False on parse failure — it raises AdapterSchemaMismatchError. Any caller treating silent False as "requirement not met" must now also handle this exception. The old behaviour was masking real errors, so the assumption was already wrong.

Test plan

24 tests pass in test/stdlib/requirements/test_requirement.py (was 19), covering the old requirement_likelihood schema, missing nested key, null score, string score, and an integration test confirming the exception propagates through ALoraRequirement.validate()
uv run ruff format . && uv run ruff check . — clean
uv run mypy mellea/stdlib/requirements/requirement.py mellea/stdlib/components/intrinsic/core.py test/stdlib/requirements/test_requirement.py --ignore-missing-imports — clean
Broader fast suite: 2350 passed, 7 pre-existing failures (optional extras not installed)

Closes #1138

…mputing#1320 - ALoraRequirement docstring now accurately scopes the LLMaJ fallback to generation-time errors only; schema mismatches from output_to_bool propagate intentionally as AdapterSchemaMismatchError - requirement_check_to_bool: broaden score guard from `is None` to isinstance check, so non-numeric scores (e.g. JSON strings) raise AdapterSchemaMismatchError instead of bare TypeError - Add tests for null and string-typed score values - Remove issue number refs from test docstrings per project convention Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

psschwei

A couple of minor things that came up on review. Feel free to push back on any.

planetf1 · 2026-06-25T13:23:34Z

Testing summary

Local (macOS, no GPU):

42 unit tests pass — schema validation coverage for requirement_check and requirement_check_to_bool across null, list, bool, NaN, ±Inf, and out-of-range score inputs (test_core_schema.py + test_requirement.py)
Full non-qualitative suite clean (pre-existing failures in test_run_transformers and test_richdocument are MPS/float64 hardware limitations, unrelated to this change)

LSF/Cuda (GPU, granite-4.1-3b):

test_requirement_check passed against a real adapter call — output fits the new schema and score lands in [0.0, 1.0] ✓
Groundedness requirement suite: 13 passed, 1 xfailed (job 1689420)
Core intrinsics suite: 3 passed, 2 xfailed (job 1689643)

psschwei

LGTM
Put a hold in case @jakelorocco wants to take a look, but if not I'll remove the hold/merge tomorrow morning.

AngeloDanducci · 2026-06-26T21:20:59Z

This is next up on my merge conflict list.

psschwei · 2026-06-26T21:38:03Z

@AngeloDanducci feel free to drop the hold label as you see fit

jakelorocco

I think this seems reasonable. Raising an exception during output_to_bool slightly changes the contract that output_to_bool puts forward. I don't know if we should just flag for end users that output_to_bool / validate might raise arbitrary exceptions? Should validate have a some means of flagging these exceptions differently from True/False?

AngeloDanducci · 2026-06-29T15:50:16Z

Added documentation around the new raise behavior, will open an issue for addressing this the "right" way in the future if it does not end up covered by changes to iocontract etc in the rest of the epic.

…atch (Epic generative-computing#929 Phase 1) requirement_check_to_bool no longer silently returns False when the adapter output does not match the expected contract. Both missing-key paths now raise AdapterSchemaMismatchError so callers learn about schema drift immediately rather than silently treating every call as "requirement not met" (generative-computing#1008). requirement_check gains a model_options parameter and a declared output contract in its docstring. Signed-off-by: Nigel Jones <jonesn@uk.ibm.com> Assisted-by: Claude Code Closes generative-computing#1138

…mputing#1320 - ALoraRequirement docstring now accurately scopes the LLMaJ fallback to generation-time errors only; schema mismatches from output_to_bool propagate intentionally as AdapterSchemaMismatchError - requirement_check_to_bool: broaden score guard from `is None` to isinstance check, so non-numeric scores (e.g. JSON strings) raise AdapterSchemaMismatchError instead of bare TypeError - Add tests for null and string-typed score values - Remove issue number refs from test docstrings per project convention Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

- requirement.py: rename `likelihood` -> `req_check` (holdover from pre-schema-change field name) - core.py: requirement_check now raises AdapterSchemaMismatchError on malformed adapter output instead of bare KeyError/TypeError - core.py: add model_options parameter to check_certainty and find_context_attributions for consistency with requirement_check - core.py: import AdapterSchemaMismatchError; clean up Returns section in requirement_check docstring - test: integration test confirming AdapterSchemaMismatchError propagates through ALoraRequirement.validate() (documents deliberate hard-fail) Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

…sics.md Example comment showed {"requirement_likelihood": 1.0} (pre-generative-computing#1008 schema). Updated to {"requirement_check": {"score": 1.0}} and corrected the accompanying description from "likelihood score" to the actual output shape. Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

- Tighten ALoraRequirement docstring: the LLMaJ fallback is a pre-generation availability check, not an exception handler around output parsing. Reword to avoid implying schema errors are caught. - Add cross-reference comments between the two parallel validation blocks in requirement_check_to_bool and requirement_check; Phase 2 will consolidate both via IOContract. Signed-off-by: Nigel Jones <jonesn@uk.ibm.com> Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

…ment_check NaN, +/-Inf, and scores outside [0.0, 1.0] were passing the existing isinstance guard silently -- nan > 0.5 evaluates False rather than raising. Tighten both guards with math.isfinite + range check. Update Raises: docstrings to cover the broadened condition. Add unit tests for NaN, Inf, above-range, and below-range cases in both test_requirement.py and the new test_core_schema.py. Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

Signed-off-by: AngeloDanducci <angelo.danducci.ii@ibm.com>

github-actions Bot added the enhancement New feature or request label Jun 23, 2026

This was referenced Jun 24, 2026

refactor(intrinsics): migrate rag.py to new Adapter types (Epic #929 Phase 1 Wave 3) #1321

Merged

Epic: Fix Adapter Function Lifecycle & Consistency in Mellea #929

Open

planetf1 marked this pull request as ready for review June 24, 2026 09:12

planetf1 requested a review from a team as a code owner June 24, 2026 09:12

planetf1 requested review from akihikokuroda, markstur and psschwei June 24, 2026 09:12

planetf1 enabled auto-merge June 24, 2026 10:01

psschwei reviewed Jun 24, 2026

View reviewed changes

Comment thread mellea/stdlib/components/intrinsic/core.py

Comment thread mellea/stdlib/requirements/requirement.py

Comment thread mellea/stdlib/components/intrinsic/core.py Outdated

Comment thread mellea/stdlib/requirements/requirement.py Outdated

psschwei mentioned this pull request Jun 25, 2026

refactor(intrinsics): guardian.py IOContract subclasses + Adapter constants (Epic #929 Phase 1 follow-up) #1332

Open

6 tasks

psschwei added the do-not-merge/hold Block merging this PR label Jun 25, 2026

psschwei approved these changes Jun 25, 2026

View reviewed changes

jakelorocco reviewed Jun 29, 2026

View reviewed changes

AngeloDanducci requested a review from nrfulton as a code owner June 29, 2026 15:49

AngeloDanducci removed the do-not-merge/hold Block merging this PR label Jun 29, 2026

jakelorocco approved these changes Jun 29, 2026

View reviewed changes

planetf1 added this pull request to the merge queue Jun 29, 2026

github-merge-queue Bot removed this pull request from the merge queue due to a conflict with the base branch Jun 29, 2026

planetf1 added 6 commits June 29, 2026 15:13

style: ruff format fix for core.py

47a83eb

Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

planetf1 and others added 3 commits June 29, 2026 15:14

style(intrinsics): clarify bool/int subclass guard with inline comment

56bc306

Assisted-by: Claude Code Signed-off-by: Nigel Jones <jonesn@uk.ibm.com>

document raising an excpetion in output to bool

618528a

Signed-off-by: AngeloDanducci <angelo.danducci.ii@ibm.com>

AngeloDanducci force-pushed the worktree-issue-1138 branch from dd7a1bb to 618528a Compare June 29, 2026 19:15

fix ci

9114709

Signed-off-by: AngeloDanducci <angelo.danducci.ii@ibm.com>

AngeloDanducci added this pull request to the merge queue Jun 29, 2026

Merged via the queue into generative-computing:main with commit e81333d Jun 29, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor(intrinsics): requirement_check_to_bool raises on schema mismatch (Epic #929 Phase 1)#1320

refactor(intrinsics): requirement_check_to_bool raises on schema mismatch (Epic #929 Phase 1)#1320
AngeloDanducci merged 10 commits into
generative-computing:mainfrom
planetf1:worktree-issue-1138

planetf1 commented Jun 23, 2026 •

edited

Loading

Uh oh!

psschwei left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

planetf1 commented Jun 25, 2026 •

edited

Loading

Uh oh!

psschwei left a comment

Uh oh!

AngeloDanducci commented Jun 26, 2026

Uh oh!

psschwei commented Jun 26, 2026

Uh oh!

jakelorocco left a comment

Uh oh!

AngeloDanducci commented Jun 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

planetf1 commented Jun 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Context

Problem

What changed

Breaking change

Test plan

Uh oh!

psschwei left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

planetf1 commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

psschwei left a comment

Choose a reason for hiding this comment

Uh oh!

AngeloDanducci commented Jun 26, 2026

Uh oh!

psschwei commented Jun 26, 2026

Uh oh!

jakelorocco left a comment

Choose a reason for hiding this comment

Uh oh!

AngeloDanducci commented Jun 29, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

planetf1 commented Jun 23, 2026 •

edited

Loading

planetf1 commented Jun 25, 2026 •

edited

Loading