Pydantic v2 native
Type-safe nested models, rich validators, and full mypy/pyright support. Compose schemas the way you compose Python.
A typed schema catalog that turns images, videos, and documents into validated JSON. Pin a version, compose them, run them with any VLM. Batteries included.
Every schema is Pydantic-native, validated against real industry use cases, and model-agnostic. Use it with Orion, GPT, Claude, or Gemini without rewriting prompts.
Type-safe nested models, rich validators, and full mypy/pyright support. Compose schemas the way you compose Python.
Invoices, resumes, ID cards, medical records, receipts, W-2s, and more. Thirty-plus production schemas, ready to import.
Catch bad extractions before they hit your pipeline. Pydantic validators clean, coerce, and reject at the model edge.
Works with OpenAI structured outputs, Claude, Gemini, and open-source VLMs via JSON mode. One schema, every backend.
Reuse fragments across schemas. Lift a Address or LineItem block into a larger document without rewriting it.
Browse the public YAML catalog of schemas. Open a PR with yours; we review and merge weekly.
Schema catalog
Pre-built Pydantic schemas across documents, identity, healthcare, finance, retail, and media. Pin a version and ship.
document.invoiceCustomer details, line items, and totals.
finance · work
document.bank-statementAccount, balances, and transaction history.
finance · banking
document.resumeContact, education, work history, and skills.
hr · work
document.us-passportPassport number, name, and biometric fields.
identity · travel
document.business-cardName, role, company, and contact details.
work
document.request-for-proposalScope, vendor terms, and submission details.
work
document.india.pan-cardPAN number, name, and DOB fields.
identity
healthcare.hipaa-releasePatient, recipients, and authorized release scope.
healthcare · hipaa
accounting.form-w2Employee, employer, wages, and tax withholding.
finance · hr
finance.balance-sheetAssets, liabilities, equity, and totals.
finance
retail.product-catalogSKU, attributes, and merchandising fields.
retail
media.tv-newsHeadlines, anchors, and on-screen text.
media · news
media.nfl-game-stateDown, distance, score, and play context.
media · sports
logistics.bill-of-ladingShipper, consignee, items, and carrier.
logistics
social.twitter-cardAuthor, body, engagement, and media.
social · media
document.invoiceCustomer details, line items, and totals.
finance · work
document.bank-statementAccount, balances, and transaction history.
finance · banking
document.resumeContact, education, work history, and skills.
hr · work
document.us-passportPassport number, name, and biometric fields.
identity · travel
document.business-cardName, role, company, and contact details.
work
document.request-for-proposalScope, vendor terms, and submission details.
work
document.india.pan-cardPAN number, name, and DOB fields.
identity
healthcare.hipaa-releasePatient, recipients, and authorized release scope.
healthcare · hipaa
accounting.form-w2Employee, employer, wages, and tax withholding.
finance · hr
finance.balance-sheetAssets, liabilities, equity, and totals.
finance
retail.product-catalogSKU, attributes, and merchandising fields.
retail
media.tv-newsHeadlines, anchors, and on-screen text.
media · news
media.nfl-game-stateDown, distance, score, and play context.
media · sports
logistics.bill-of-ladingShipper, consignee, items, and carrier.
logistics
social.twitter-cardAuthor, body, engagement, and media.
social · media
document.receiptMerchant, items, and totals from any receipt.
finance · work
document.bank-checkRouting, account, payee, and amount.
finance · banking
document.us-drivers-licenseName, address, DOB, and license details.
identity
document.utility-billAccount, billing period, charges, and dues.
energy
document.insurance-claimClaimant, policy, incident, and amounts.
insurance
document.india.aadhaar-cardAadhaar number, name, address, and DOB.
identity
healthcare.medical-insurance-cardMember, coverage, group, and provider info.
healthcare · insurance
healthcare.pathology-reportSpecimen, diagnosis, and pathologist findings.
healthcare · medical
accounting.form-payslipEarnings, deductions, and YTD totals.
finance · hr
retail.ecommerce-product-captionVisual description, attributes, and delivery info.
retail · ecommerce
food.nutrition-facts-labelServing size, calories, macros, and ingredients.
retail · food
media.nba-game-stateScore, quarter, possession, and player stats.
media · sports
aerospace.remote-sensingLand features, infrastructure, and visibility.
aerospace
real-estate.lease-agreementParties, term, rent, and lease clauses.
real-estate
document.receiptMerchant, items, and totals from any receipt.
finance · work
document.bank-checkRouting, account, payee, and amount.
finance · banking
document.us-drivers-licenseName, address, DOB, and license details.
identity
document.utility-billAccount, billing period, charges, and dues.
energy
document.insurance-claimClaimant, policy, incident, and amounts.
insurance
document.india.aadhaar-cardAadhaar number, name, address, and DOB.
identity
healthcare.medical-insurance-cardMember, coverage, group, and provider info.
healthcare · insurance
healthcare.pathology-reportSpecimen, diagnosis, and pathologist findings.
healthcare · medical
accounting.form-payslipEarnings, deductions, and YTD totals.
finance · hr
retail.ecommerce-product-captionVisual description, attributes, and delivery info.
retail · ecommerce
food.nutrition-facts-labelServing size, calories, macros, and ingredients.
retail · food
media.nba-game-stateScore, quarter, possession, and player stats.
media · sports
aerospace.remote-sensingLand features, infrastructure, and visibility.
aerospace
real-estate.lease-agreementParties, term, rent, and lease clauses.
real-estate
Quick Start
Import a schema, call any VLM, get validated structured data. No prompts to engineer, no regex, no parsing.