Skip to content

Implement end-to-end hypothesis generation and testing workflow for issue #14#236

Merged
Josephrp merged 4 commits into
DeepCritical:devfrom
taivu1998:tdv/issue-14-hypothesis-testing
May 3, 2026
Merged

Implement end-to-end hypothesis generation and testing workflow for issue #14#236
Josephrp merged 4 commits into
DeepCritical:devfrom
taivu1998:tdv/issue-14-hypothesis-testing

Conversation

@taivu1998
Copy link
Copy Markdown

Summary

This draft PR implements the end-to-end hypothesis workflow for issue #14 on top of the current main branch snapshot.

It adds a shared hypothesis engine that supports both hypothesis generation and hypothesis testing, wires that engine into the application and orchestration layers, and adds the supporting configs, scripts, and focused tests needed to exercise the flow.

What Changed

  • add shared hypothesis datatypes, prompts, agents, tools, and a dedicated hypothesis workflow state machine
  • add a dedicated DeepResearch.hypothesis_app entrypoint and CLI wiring for hypothesis runs
  • route explicit hypothesis generation and testing flows through DeepResearch.app and the primary workflow orchestrator
  • add hypothesis config groups, workflow compatibility shims, and runnable example configs
  • harden adjacent integration surfaces uncovered during implementation, including optional tool input validation and lazy tool loading
  • add focused import, workflow, tool, and example tests, plus a smoke script for the new workflow

Why

Issue #14 calls for comprehensive hypothesis testing support. On current main, the hypothesis path was still effectively placeholder-backed: the workflow was not fully wired end to end, testing-environment generation was incomplete, and several integration edges around provided hypotheses, routing, and validation still needed real implementation.

This PR closes that gap with a single shared workflow that keeps the implementation cohesive without adding unnecessary orchestration layers.

Validation

  • UV_CACHE_DIR=/tmp/uv-cache uv run pytest tests/test_statemachines/test_hypothesis_workflow.py tests/test_tools/test_hypothesis_tools.py tests/test_examples/test_hypothesis_engine.py tests/imports/test_app_imports.py tests/imports/test_datatypes_imports.py tests/imports/test_tools_imports.py -q
  • UV_CACHE_DIR=/tmp/uv-cache uv run python scripts/test/run_hypothesis_smoke.py
  • UV_CACHE_DIR=/tmp/uv-cache uv run pytest -q

Issue

Addresses #14.

@codecov
Copy link
Copy Markdown

codecov Bot commented Apr 19, 2026

@taivu1998 taivu1998 marked this pull request as ready for review April 20, 2026 00:07
Copy link
Copy Markdown

@greptile-apps greptile-apps Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

@Josephrp Josephrp enabled auto-merge April 25, 2026 13:29
@Josephrp Josephrp disabled auto-merge April 25, 2026 13:29
@Josephrp Josephrp enabled auto-merge April 25, 2026 13:29
@Josephrp Josephrp disabled auto-merge April 25, 2026 14:51
@Josephrp
Copy link
Copy Markdown
Collaborator

@taivu1998 please retarget the PR from "to main" to "to dev" please :-) otherwise it's 10/10 , with all my thanks

Josephrp
Josephrp previously approved these changes Apr 25, 2026
Copy link
Copy Markdown
Collaborator

@Josephrp Josephrp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM 🏆

@taivu1998 taivu1998 changed the base branch from main to dev April 26, 2026 07:20
@taivu1998 taivu1998 dismissed Josephrp’s stale review April 26, 2026 07:20

The base branch was changed.

@taivu1998 taivu1998 force-pushed the tdv/issue-14-hypothesis-testing branch from 8bad2f3 to e397fbd Compare April 26, 2026 09:12
Copy link
Copy Markdown

@greptile-apps greptile-apps Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Your free trial has ended. If you'd like to continue receiving code reviews, you can add a payment method here.

@taivu1998 taivu1998 requested a review from Josephrp April 26, 2026 09:36
@taivu1998
Copy link
Copy Markdown
Author

Thanks @Josephrp, I just updated. Could you review and merge?

@Josephrp Josephrp merged commit 759fe4a into DeepCritical:dev May 3, 2026
3 of 4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants