Skip to content

Add azure-di-financial-haystack integration#504

Open
zavera wants to merge 2 commits into
deepset-ai:mainfrom
zavera:add-azure-di-financial-haystack
Open

Add azure-di-financial-haystack integration#504
zavera wants to merge 2 commits into
deepset-ai:mainfrom
zavera:add-azure-di-financial-haystack

Conversation

@zavera

@zavera zavera commented Jun 8, 2026

Copy link
Copy Markdown

Adds an integration listing for azure-di-financial-haystack — structured KV extraction and delta reconciliation for financial PDFs using Azure Document Intelligence.

Components:

  • AzureDiExtractor — 4-stage PDF recovery chain, multi-endpoint round-robin pool
  • KvNormalizer — maps verbose Azure DI labels to canonical field names, handles punctuation/whitespace/newline variants
  • DeltaCalculator — compares extracted values against reference values, scores HIGH/MEDIUM/LOW
  • build_pipeline() — pre-wired convenience pipeline

Tests: 76 unit tests passing across Python 3.10–3.13, no Azure credentials required. Live Azure integration tests available under pytest -m integration.

PyPI: https://pypi.org/project/azure-di-financial-haystack
Repo: https://github.com/zavera/haystack-financial-doc-extractor

@zavera zavera requested a review from a team as a code owner June 8, 2026 17:56
@vercel

vercel Bot commented Jun 8, 2026

Copy link
Copy Markdown

@azaver1 is attempting to deploy a commit to the deepset Team on Vercel.

A member of the Team first needs to authorize it.

@zavera zavera force-pushed the add-azure-di-financial-haystack branch from 61016fd to 910b911 Compare June 8, 2026 17:59
@zavera zavera force-pushed the add-azure-di-financial-haystack branch from 910b911 to 2610a70 Compare June 8, 2026 17:59
Comment thread integrations/azure-di-financial-haystack.md
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@zavera

zavera commented Jun 9, 2026

Copy link
Copy Markdown
Author

The package is now live on PyPI: https://pypi.org/project/azure-di-financial-haystack/ (v0.1.1). Also fixed the LinkedIn URL in the latest commit. Please take another look when you get a chance!

@zavera zavera requested a review from kacperlukawski June 11, 2026 17:41
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants