feat: add weighted confidence scoring for extraction validation by Arijit429 · Pull Request #424 · fireform-core/FireForm

Arijit429 · 2026-04-12T07:30:51Z

Closes #60
Closes #450

🚀 Summary

This PR enhances the extraction validation workflow by introducing field-level weighted confidence scoring instead of using a flat deduction model.

The goal is to make confidence evaluation more representative of real-world data importance and improve the reliability of the requires_review decision.

✨ What Changed

Updated the ExtractionValidator logic to use weighted importance for each field:

FIELD_WEIGHTS = {
    "location": 30,
    "time": 20,
    "severity": 30,
    "description": 20
}

Each missing field now reduces confidence based on its relative importance instead of applying a uniform deduction.

💡 Why This Helps

In the previous approach, all missing fields contributed equally to confidence reduction.

This could under-represent critical missing information.

For example:

missing location should have higher impact
missing severity should strongly affect review decision
missing description may be comparatively less critical

The new weighted model makes confidence scoring more realistic and production-aligned.

🔍 Example

Previous behavior

all fields = equal impact

New behavior

location → 30
severity → 30
time → 20
description → 20

This allows the validator to make smarter review decisions.

🧪 Testing

Added dedicated unit test coverage for weighted confidence scoring.

Validated scenarios include:

multiple high-weight missing fields
partial extraction completeness
review threshold correctness
confidence floor protection

Executed locally using:

PYTHONPATH=. pytest test/test_extraction_validator.py -q

and verified successful execution.

🎯 Impact

This improves:

confidence score realism
human review routing accuracy
downstream PDF fill reliability
production readiness of extraction workflow

Arijit429 · 2026-04-12T07:31:08Z

Hi maintainers, this update improves the review workflow by making confidence scoring field-aware and more aligned with real extraction importance—happy to refine the weighting logic based on project needs.

This was referenced Apr 17, 2026

[FEATURE] Introduce Schema Validation and Confidence Scoring Layer for LLM Extraction Reliability #450

Closed

[UPDATE] Post-proposal contribution summary — Arijit Deb #456

Closed

Arijit429 added 9 commits April 20, 2026 00:52

Fixing error handling message when PDF generation fails

5d0d7bc

Replace print statements with logging for better observability

afb3d43

Improve README with clearer local setup steps

1224ad0

Add requires_review flag for incomplete LLM extraction validation

d9e3560

feat: add structured extraction flow with safe fallback

2c3f7c4

feat: add extraction validation and review pipeline

8d0f5cb

fix: auto-create database tables on application startup

24d4861

test: add extraction validator unit tests

db5f3a7

feat: add weighted confidence scoring for extraction validation

ab7f503

Arijit429 force-pushed the weighted-confidence-scoring branch from cf4ec51 to ab7f503 Compare April 19, 2026 19:25

marcvergees added the confidence-score label Jun 17, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add weighted confidence scoring for extraction validation#424

feat: add weighted confidence scoring for extraction validation#424
Arijit429 wants to merge 9 commits into
fireform-core:mainfrom
Arijit429:weighted-confidence-scoring

Arijit429 commented Apr 12, 2026 •

edited

Loading

Uh oh!

Arijit429 commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Arijit429 commented Apr 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🚀 Summary

✨ What Changed

💡 Why This Helps

🔍 Example

Previous behavior

New behavior

🧪 Testing

🎯 Impact

Uh oh!

Arijit429 commented Apr 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Arijit429 commented Apr 12, 2026 •

edited

Loading