Medici Land Governance Inc. logo
Medici Land Governance Inc. Verified
GovTech, Blockchain, Land Management, Fintech

Lead AI/ML Engineer

United StatesOnsiteFull TimeLeadPosted 2 months agoVisa sponsorship available

Is this role right for you?

Upload your resume and get a skill-by-skill breakdown — see exactly where you match, where you're close, and what to highlight. Not a mystery percentage.

Get a tailored resume highlighting what this role needs.

Role summary

We are seeking a hands-on Lead AI/ML Engineer to take end-to-end ownership of our document intelligence pipeline, which processes complex scanned court and property records. This role requires a technical leader who can architect, build, and improve systems for document intake, quality assessment, image preprocessing, OCR, segmentation, classification, and structured data extraction. You will focus on reducing hallucinations, improving accuracy, and establishing measurable quality standards for messy, real-world data. The position involves providing technical direction and hands-on leadership to a small AI engineering team, driving execution in ambiguous environments, and collaborating with stakeholders to define operational workflows and acceptable accuracy levels.

About the Company

We are hiring a hands-on AI/ML technical leader to own our document intelligence pipeline end to end. Our platform processes court documents, property records, and other complex scanned files. Some are clean new recordings. Many are historical records with poor image quality, degraded text, inconsistent structure, and difficult extraction conditions. Today, accuracy is not where it needs to be, and the pipeline can hallucinate. We need someone who can take ownership, make the technical calls, and drive execution.

About the Role

This is not a passive support role. This is for someone who wants to lead from the front, solve hard applied AI problems, and build production systems that work on messy real-world data.

Responsibilities

You will own the full document processing pipeline, including:

  • document intake and quality assessment
  • image preprocessing and enhancement
  • OCR and full-text extraction
  • document segmentation, classification, and routing
  • structured data extraction and indexing
  • validation, confidence scoring, and hallucination reduction
  • evaluation frameworks, error analysis, and production accuracy improvement

You will lead the path forward technically and operationally. We already have AI engineers on the team. What we need is someone who can establish direction, create accountability, and turn an ambiguous initiative into a shipping roadmap.

  • Architect and improve document AI pipelines for court records, property records, and other challenging document sets
  • Build systems that assess document quality and complexity on arrival and determine the right processing path
  • Improve preprocessing for skew, blur, noise, low contrast, bleed-through, rotation, cropping, and scan artifacts
  • Improve OCR and extraction quality across both high-quality modern documents and poor-quality historical records
  • Reduce hallucinations through validation layers, schema enforcement, confidence-based routing, and post-processing guardrails
  • Evaluate and select the right mix of models, OCR engines, document AI vendors, and custom components
  • Build measurable quality standards and benchmarks for extraction accuracy and indexing quality
  • Lead root-cause analysis on pipeline failures and drive iterative improvement
  • Work closely with product and business stakeholders to define acceptable accuracy and operational workflows
  • Provide technical leadership to a small AI engineering team while remaining deeply hands-on

Qualifications

  • Strong experience building production ML systems in document AI, OCR, computer vision, or information extraction
  • Experience with scanned documents, PDFs, legal records, forms, or other unstructured document workflows
  • Deep familiarity with tools and approaches such as OpenCV, PyTorch, Tesseract, PaddleOCR, LayoutLM, Donut, Azure Document Intelligence, Google Document AI, Textract, or similar
  • Strong understanding of image preprocessing, OCR optimization, extraction reliability, and layout-aware document pipelines
  • Experience reducing hallucinations and unreliable extraction outputs in production systems
  • Strong Python skills and comfort working across CV, NLP, and applied ML infrastructure
  • Proven ability to define technical direction and execute in ambiguous environments
  • High ownership, high urgency, and strong judgment

Required Skills

  • Experience with court documents, title/property records, legal documents, or public records
  • Experience with historical document digitization or degraded scans
  • Experience with indexing, retrieval, search, and downstream document intelligence workflows
  • Experience leading a small team or acting as the technical owner of a pipeline

Preferred Skills

  • Someone who wants real ownership, not just tickets
  • Someone who can make decisions and move
  • Someone who likes messy data and hard production problems
  • Someone who can lead engineers without hiding behind process
  • Someone who cares about accuracy, robustness, and shipping

Pay range and compensation package

Competitive and based on experience.

Equal Opportunity Statement

We are committed to diversity and inclusivity.

Ready to apply?
You'll be redirected to Medici Land Governance Inc.'s application page.

Similar roles