The Fact Pilot

AI-assisted factual investigation, built for insurance and legal work.

A multi-tenant SaaS that turns raw interview audio into court-ready statements, chronologies, and reports. Australian data residency and per-firm database isolation from day one.

3 months Next.js · Node · MongoDB · Vertex AI (Gemini 2.5) · AWS Launched Sep 2025
The Brief

Give factual investigators a single, secure workspace.

The context

An Australian factual-investigation firm working across insurance and legal matters needed to move off fragmented tooling. Interview recordings, transcripts, statements, and reports were spread across disconnected apps, with compliance and data-residency risk on every handoff.

The mandate

Ship a production SaaS that investigators trust: on-shore data, per-firm isolation, and AI that drafts structured witness statements and investigation reports without fabricating facts.

The bar

Build the pipeline investigators use every day: resilient uploads, diarised transcripts, AI extractions bound to a pre-agreed structure per participant type, and artefact exports that hold up in review.

Multi-tenant SaaSAI extractionAustralian data residencyInsurance & legal
Audio handled
60+ min
per recording, end-to-end
Data residency
AU region
Pinned to Australian regions end-to-end
Tenant isolation
1 DB / firm
Database-per-tenant by design
How we shipped it

Three months, production-ready.

What we did

  • Multi-tenant architecture with database-per-firm isolation
  • End-to-end interview pipeline: upload → transcribe → extract → draft → export
  • Role-based workspaces for investigators, reviewers, and firm admins
  • Artefact generation (PDF + DOCX) with firm branding
  • Tenant-level usage controls and audit trails
  • Australian-region deployment with MFA-protected access

Our process

Discovery & architecture
Week 1-2

Mapped the investigator workflow end-to-end, nailed down data-residency and isolation requirements, and designed the tenant model before writing production code.

Core pipeline
Week 3-7

Built the upload → transcription → AI extraction loop against a pre-agreed output shape per participant type (Claimant, Insured, Witness), so outputs stay structured and reviewable.

Workspaces & artefacts
Week 8-10

Shipped investigator, reviewer, and admin workspaces, plus PDF / DOCX statement and report generation with firm branding and usage caps.

Harden & launch
Week 11-12

Locked down TOTP MFA, per-tenant usage caps, and audit logging, then shipped the production deployment to the first firm.

Services covered

AI & LLM SystemsSaaS / MVP EngineeringCloud & DevOpsProduct Scaling
Under the hood

A stack chosen for trust, residency, and velocity.

Frontend
  • Next.js (App Router)
  • React
  • Tailwind
API
  • Node.js
  • Express
  • Mongoose
Database
  • MongoDB
  • Database-per-tenant
AI
  • Vertex AI
  • Gemini 2.5 Flash
  • australia-southeast1
Audio & storage
  • AWS Transcribe (diarization)
  • S3 (ap-southeast-2)
Artefact generation
  • PDFKit
  • docx
  • html-to-docx
Security & runtime
  • TOTP MFA (speakeasy)
  • AWS Elastic Beanstalk

Deployment pipeline

Deploy
AWS Elastic Beanstalk
Region-pinned deployment; PDF and audio runtime dependencies baked into the platform
Configure
Environment-scoped configs • Per-tenant tier
Govern
Usage caps enforced at the tenant level (statements, chat, transcription minutes, reports, cases)

Stack summary

Schema-enforced AI extraction
  • Structured outputs
    Each AI call is bound to a pre-agreed output shape per participant type, so every extraction is structured, auditable, and reviewable before anything reaches a legal artefact.
Database-per-tenant isolation
  • Physical isolation
    Each firm's data lives in its own MongoDB database. One firm's records physically never sit alongside another's; isolation is at the infrastructure layer, not a permission check in app code.
Australian data residency
  • Region-pinned pipeline
    Vertex AI and S3 are both pinned to Australian regions, so interview audio and inference never leave the jurisdiction.

Key integrations

AWS TranscribeVertex AI (Gemini)AWS S3SMTP email

Built-in safeguards

Australian region pipelineDatabase-per-tenant isolationTOTP MFA on every accountTenant-level usage capsAudit-safe PDF + DOCX exportsHuman review on every AI draft
Outcome

Launched as The Fact Pilot, live at thefactpilot.com.au.

Shipped in ~12 weeks from brief to production launch (Sep 2025).

Replaced a fragmented toolchain with one workspace covering the full investigator workflow, from intake to delivered report.

Firms onboard with their own isolated database, branding, and usage envelope.

Launched Sep 2025Production SaaSMulti-tenantAustralia-residentAI-assisted
Feature highlights
  • Per-firm workspaces with investigator, reviewer, and admin roles
  • Audio upload + AWS Transcribe with speaker diarization
  • Structured AI extraction per participant type (Claimant / Insured / Witness)
  • AI-drafted witness statements and investigation reports with human-in-the-loop review
  • PDF + DOCX artefact generation with firm branding
  • TOTP MFA, audit trails, and tenant-level usage caps

Innovations

Database-per-tenant isolation

Each firm gets its own MongoDB database. One firm's data physically never sits alongside another's. Isolation is enforced at the infrastructure layer, so a bug in application code can't cross-contaminate firms.

Schema-enforced AI extraction

Every AI call is constrained to a pre-agreed output shape per participant type (Claimant, Insured, Witness). The model can't improvise fields into a legal artefact. Outputs are structured, reviewable, and safe to export.

Australian data residency by design

Interview audio lives in S3 ap-southeast-2. Inference runs on Vertex AI australia-southeast1. Residency is wired into every hop of the pipeline, from upload to rendered artefact.

Why it matters
  • Investigators spend less time wrangling tools and more time on the facts of the case.
  • Firm principals get a compliance posture they can defend: on-shore data, isolated databases, MFA-gated access.
  • The AI layer accelerates draft work without introducing fabricated detail into court-adjacent artefacts.
Live & shipped
Who this is for
  • Insurance and legal investigation firms that need on-shore data, isolated tenancies, and AI that drafts structured artefacts without fabricating facts.
  • Factual investigators who want a single workspace for intake, transcription, extraction, and report delivery, not a toolchain stitched across six tabs.