FIX BED-8687: Replace fields that use type `Any` with fixed types to fix DLT schema validation errors by d3vzer0 · Pull Request #14 · SpecterOps/openhound-github

d3vzer0 · 2026-06-18T10:12:04Z

Replaced the Any field type with stable types to prevent DLT schema validation issues (when different field types are collected during the same run) for the following models:

WorkflowJob
WorkflowStepDefinition
WorkflowJobDefinition

Copilot

Pull request overview

This PR updates the GitHub Actions workflow-related models to replace Any-typed fields with more stable, explicitly typed structures in order to reduce DLT schema validation errors caused by type variance across extracted rows.

Changes:

Tightened workflow YAML parsing models (WorkflowStepDefinition, WorkflowJobDefinition, WorkflowDocument) by replacing several Any fields with explicit unions / mappings.
Introduced a Container model and updated job/workflow permission typing.
Updated WorkflowJob to normalize runs_on into a stable string when emitting graph node properties.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 6 comments.

File	Description
`src/openhound_github/models/workflow.py`	Replaces `Any` in workflow YAML parsing models and adds a `Container` model.
`src/openhound_github/models/workflow_job.py`	Narrows `runs_on` typing and adds a normalization property used when building graph nodes.

Comments suppressed due to low confidence (1)

src/openhound_github/models/workflow.py:59

with_/env are now typed as dict[str, str], but the shared validator still returns dict[str, Any] and doesn't coerce values. If YAML contains non-string scalars (or other types), this can either fail validation or reintroduce mixed DLT types. Consider normalizing keys/values to strings in the validator and updating the return type accordingly.

    env: dict[str, str] = Field(default_factory=dict)

    @field_validator("with_", "env", mode="before")
    @classmethod
    def dict_or_empty(cls, value: Any) -> dict[str, Any]:

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

src/openhound_github/models/workflow.py:60

with_/env are now typed as dict[str, str], but this validator still returns the raw mapping (dict[str, Any]) without coercing keys/values. YAML commonly produces non-string scalars (including null for empty values), which can cause Pydantic validation to fail and the whole workflow document to be dropped. Normalize keys/values to str here (consistent with the job-level env/secrets normalization).

    with_: dict[str, str] = Field(default_factory=dict, alias="with")
    env: dict[str, str] = Field(default_factory=dict)

    @field_validator("with_", "env", mode="before")
    @classmethod
    def dict_or_empty(cls, value: Any) -> dict[str, Any]:
        return value if isinstance(value, dict) else {}

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

Comments suppressed due to low confidence (1)

src/openhound_github/models/workflow.py:60

with_/env are now typed as dict[str, str], but this validator still returns dict[str, Any] and doesn't normalize values. If a workflow YAML contains non-string scalars (e.g., null for an empty value), WorkflowDocument.model_validate() can now fail and document becomes None, dropping all workflow-derived rows. Coerce keys/values to strings here (and map YAML null to an empty string) to keep parsing robust and the emitted schema stable.

    @field_validator("with_", "env", mode="before")
    @classmethod
    def dict_or_empty(cls, value: Any) -> dict[str, Any]:
        return value if isinstance(value, dict) else {}

…f RunsOn is a RunsOn model or dict

Copilot

Pull request overview

Copilot reviewed 2 out of 2 changed files in this pull request and generated 1 comment.

Comments suppressed due to low confidence (1)

src/openhound_github/models/workflow.py:60

with_/env are now typed as dict[str, str], but the pre-validator still returns dict[str, Any] without coercion. This can lead to Pydantic validation errors (e.g., numeric YAML scalars) and also defeats the goal of keeping the extracted schema stable.

Consider normalizing both keys and values to strings here (mirroring the WorkflowJobDefinition normalizer).

    @field_validator("with_", "env", mode="before")
    @classmethod
    def dict_or_empty(cls, value: Any) -> dict[str, Any]:
        return value if isinstance(value, dict) else {}

d3vzer0 added 3 commits June 18, 2026 11:41

Fix type for WorkFLowJob runs_on field

c7aa4f8

Fix type for WorkFLowJob runs_on field

36ed53d

Replace "Any" field types with fixed types for Workflow model fields

ddd04eb

d3vzer0 marked this pull request as draft June 18, 2026 10:13

d3vzer0 requested a review from Copilot June 18, 2026 11:57

Copilot started reviewing on behalf of d3vzer0 June 18, 2026 11:57 View session

Copilot AI reviewed Jun 18, 2026

View reviewed changes

d3vzer0 added 8 commits June 18, 2026 14:05

Create a list of runs_on instead of single-value string.

ed087ee

Make WorkflowJobDefinition fields optional

75f1a7f

Container can be a single-value string as well

89259d3

Simplify returning container image name

4185d2b

Add None as allowed value for permissions in WorkflowDocument

83acaca

Also normalize the secrets map when loading the model

5bcc84c

Combine secret/env normalization in single validator

39568ba

Combine runs_on/permissions normalization in single validator

c6dbfda

d3vzer0 requested a review from Copilot June 18, 2026 12:58

Copilot started reviewing on behalf of d3vzer0 June 18, 2026 12:59 View session

Copilot AI reviewed Jun 18, 2026

View reviewed changes

Comment thread src/openhound_github/models/workflow.py Outdated

d3vzer0 added 2 commits June 18, 2026 15:34

Add additional types for runs_on

ef304a6

Split runs_on normalizer due to more complex field types

da4e318

d3vzer0 requested a review from Copilot June 18, 2026 13:39

Copilot started reviewing on behalf of d3vzer0 June 18, 2026 13:40 View session

Copilot AI reviewed Jun 18, 2026

View reviewed changes

Comment thread src/openhound_github/models/workflow.py

Comment thread src/openhound_github/models/workflow.py

d3vzer0 added 2 commits June 18, 2026 16:07

Remove is_self_hosted from top Workflow model

c961e31

Add is_self_hosted to sub WorkflowJob model + added safety to check i…

93c84bf

…f RunsOn is a RunsOn model or dict

d3vzer0 requested a review from Copilot June 18, 2026 14:12

Copilot started reviewing on behalf of d3vzer0 June 18, 2026 14:12 View session

Copilot AI reviewed Jun 18, 2026

View reviewed changes

Comment thread src/openhound_github/models/workflow_job.py

Add typehint

2318a02

d3vzer0 marked this pull request as ready for review June 18, 2026 14:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FIX BED-8687: Replace fields that use type `Any` with fixed types to fix DLT schema validation errors #14

FIX BED-8687: Replace fields that use type `Any` with fixed types to fix DLT schema validation errors #14
d3vzer0 wants to merge 16 commits into
mainfrom
fix/BED-8687

d3vzer0 commented Jun 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

d3vzer0 commented Jun 18, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants