A cloud replacement for Usagi & DQD

Map, validate and monitor clinical data, automatically.

Retire fragile R-scripts and Java desktop tools. Map local terminology with AI, monitor your OMOP and custom-CDM pipelines continuously, and run every computation inside your own VPC — patient rows never leave.


0
OHDSI checks / scan
0
Auto-map precision
0
Rows of PHI leaving VPC
omopstack-agent · vpc-run

# secure agent started inside local VPC

✓ control-plane endpoint connected

$ omopstack init-omop-scan --source prod_clinical

# compiling 3,450 OHDSI v6.0 conformance rules…

plausibility  99.8% ok

conformance  100.0% standard

$ omopstack ai-map --col local_drug_name --std RxNorm

"HCT 25mg / Lisinopril 20mg Tab" RxNorm 316866

✓ auto-assigned · confidence 0.98

2,451 variables mapped — no raw rows moved

Compatible with global clinical architectures

OMOP CDM v6.0 OHDSI ATLAS / ATHENA CDISC SDTM HL7 FHIR R4 SNOMED · RxNorm · LOINC
app.omopstack.io / observability Agent live · prod_clinical
Data quality & mapping
prod_clinical_redshift · last scan 2m ago
● all systems healthy
Concepts mapped
0
DQ pass rate
99.2%
▲ 0.4 vs last scan
Auto-map precision
0%
Open issues
3
2 conformance · 1 drift
Quality score · 30 days
DQD checks
Plausibility99.7%
Conformance82.1%
Completeness98.9%
The problem

Clinical data teams are fighting their own tools.

Mapping, quality and ETL still run on single-user desktop apps and hand-run R scripts — slow, siloed, and stale the moment a vocabulary updates. Here's the day-to-day, side by side.

Today · the desktop stack
  • Hours to rebuild the vocabulary index

    Usagi rebuilds a local Lucene index from Athena files — on every machine.

  • Single-user desktop, files emailed around

    No collaboration, no versioning, no audit trail.

  • Quality checks run by hand, then go stale

    DQD & Achilles in R → static HTML, out of date within days.

  • Your laptop, or a DIY Docker stack

    Ops burden on your team; PHI moving where it shouldn't.

With OMOPStack
  • Hosted, pre-indexed vocabulary

    Map in minutes with AI semantic search — nothing to build locally.

  • Multi-user, versioned, governed

    Git-style review & approval with a full audit trail.

  • Continuous quality, trended & alerting

    3,500+ checks always-on, with Slack / PagerDuty alerts.

  • Managed, inside your VPC

    Zero PHI egress. SOC 2 · HIPAA · BAA.

Why it matters

Terminology mapping is where OMOP projects stall. OMOPStack removes the bottleneck.

10M+
standard concepts indexed and searchable
~95%
auto-mapped with calibrated confidence
0
rows of PHI ever leave your VPC
Platform

Three modules, one observability plane.

Replace a sprawl of legacy desktop tools and R-scripts with a single secure, collaborative platform built for biostatistics and clinical-AI pipelines.

Usagi Cloud Mapping

Move past single-user desktop mapping. Build clinical dictionary maps collaboratively and let embedding models parse local shortcodes to SNOMED CT, RxNorm and LOINC.


  • Collaborative multi-user validation
  • Semantic pre-mapping & re-rank
  • Git-style change approval

DQD Live Observability

Automate database quality without hand-running R. Continuously execute 3,500+ OHDSI-compliant data-health checks inside your own database engine, with history and alerting.


  • Schema-drift auto-detection
  • Slack, PagerDuty & Teams alerts
  • Historic quality trend tracking

Multi-Model Synthesis

Transform, audit and clean data flowing from vendors' OMOP schemas into your proprietary custom models — without schema drift or lost clinical context.


  • Dual-ended transform checking
  • Custom CDM mapping interfaces
  • Native transformation logging
How it works

From raw source to monitored OMOP — one flow.

A single VPC-resident agent profiles, maps, validates and monitors. No desktop apps, no data leaving your boundary.

01

Connect

Point the VPC agent at your source database — outbound-only, no ports opened.

02

Profile

Continuous scan of schema, distributions, keys and drift — re-runs on every refresh.

03

Map with AI

Semantic automapping to OMOP standard concepts, scored — reviewed, never fabricated.

04

Validate

3,500+ OHDSI quality checks across conformance, completeness and plausibility.

05

Monitor

Always-on observability: trended over time, with alerts when thresholds break.

Capabilities

Every capability, in depth.

Browse the full feature set by area.

Dual-channel retrieval

Lexical (trigram) + semantic (BioLORD) candidate generation over the full vocabulary.

Constrained LLM re-rank

Ranks only real concepts — structurally cannot fabricate a concept_id.

Calibrated confidence

Per-domain auto-accept thresholds; everything else routed to review.

Review queue

Top-k candidates with evidence and rationale — approve, flag or reassign.

Git-style governance

Versioned change sets, approval workflow and a full audit trail.

Active learning

Corrections fine-tune the model — your maps get better the more you use it.

3,500+ OHDSI checks

The full Kahn framework: conformance, completeness and plausibility.

Continuous & scheduled

Runs on every data refresh — not by hand, not once a quarter.

Trending & drift

Time-series history so you catch regressions the moment they appear.

Alerting

Slack, PagerDuty and Teams notifications when thresholds break.

Drilldown

Per-table, per-field and per-concept violation detail.

OHDSI export

Standards-compliant JSON results for your own records.

Source profiling

Types, distributions, candidate / surrogate keys and dedup statistics.

Visual + AI mapping

Source→target arrows, AI-suggested and collaboratively reviewed.

Runnable ETL

Emits executable SQL / dbt — design and execution in one place.

OMOP 5.3 / 5.4 / 6.0

Targets the current CDM versions out of the box.

Custom CDM

Map to your own proprietary model — not just OMOP.

Vendor synthesis

Transform external OMOP feeds into your custom schema, cleanly.

Runs in your VPC

BYOC or on-prem; the computation engine never leaves your boundary.

Zero PHI egress

Only aggregate statistics ever reach the control plane.

SOC 2 · HIPAA · BAA

Audit-ready, with ready-to-sign business associate agreements.

SSO & RBAC

SAML / OpenID, role-based access and true multi-tenancy.

Outbound-only

No inbound ports; no database exposed to the internet.

Air-gap capable

Deploys into fully isolated, disconnected environments.

Replaces & integrates

One platform for the stack you're stitching together.

OMOPStack consolidates the OHDSI mapping, quality and ETL tools — most of them desktop, single-user, and run by hand — into one collaborative platform that runs continuously inside your VPC. It integrates with the analytics tools your team already trusts.

Terminology

Usagi Cloud Mapping

Replaces

UsagiIMOSymedicalApelon

Java desktop · single-user · string-matching · hours to build a local index


AI semantic automapping with confidence scores and human-in-the-loop review — collaborative, governed, and grounded in the standard vocabulary (no fabricated concept_ids).

Data quality

DQD Live Observability

Replaces

DataQualityDashboardAchillesARESCdmInspection

Manual R runs · static HTML · point-in-time · no trending or alerting


The 3,500+ OHDSI checks run continuously, trended over time with alerting and multi-user triage — the Kahn framework, hosted and always-on.

Structural ETL

Multi-Model Synthesis

Replaces

Rabbit-in-a-HatWhiteRabbitPerseus

Desktop arrow-drawing · emits docs, not runnable ETL · OMOP-rigid


AI-assisted source→target mapping that profiles continuously and emits executable ETL — for OMOP and your own custom CDM.

Integrates — not replaced

We connect to the analytics and infrastructure your team already trusts. No rip-and-replace of validated study methodology.

ATLASWebAPIHADESStrategusBroadseaAthena vocab

Full coverage — every tool, mapped

ToolCategoryOMOPStack
UsagiTerminologyReplaces
IMO · Symedical · ApelonTerminology (commercial)Competes
Hecate · LettuceSemantic vocab searchCompetes
DataQualityDashboardData qualityReplaces
Achilles · ARESCharacterizationReplaces
Rabbit-in-a-Hat · WhiteRabbitStructural ETLReplaces
PerseusWeb ETLCompetes
ATLAS · WebAPI · HADESAnalytics & evidenceIntegrates
Strategus · BroadseaOrchestration & infraIntegrates
Athena · OMOP vocabularyVocabulary sourceBuilt on

Replaces — we supersede it · Competes — modern tool in the same space · Integrates / Built on — we connect to it or anchor to it, never reinventing validated methodology or fabricating concept_ids.

Live demo

Try OMOPStack in real time.

Interact with live simulations of our two core products.

Source dictionary Agent live

Pick a messy clinical entry or write your own. The AI parses the text and matches it to the vocabulary using context.

Mapping result & governance Waiting for concept

No active term mapped

Select or enter a phrase on the left to run the engine.

Security

Patient data never leaves your VPC perimeter.

A zero-trust BYOC / on-premise framework: the control plane and analytics UI are hosted by us, while the computation engine runs entirely inside your private infrastructure.

Zero ingestion of PHI

The agent pushes only aggregate counts (e.g. 5 misses / 10M rows). Patient-level data stays put.

No inbound firewall openings

The local daemon makes outbound connections only — no exposed database ports.

Audit-ready compliance

Passes HIPAA, SOC 2 Type II and GDPR audits. Ready-to-sign BAAs.

OMOPStack SaaS

Control plane

  • · Multi-user web portal
  • · Analytics aggregates
  • · Alert orchestrator
https aggregate stats only

Client VPC subnet

Data plane

  • · OMOPStack computation engine
  • · Internal clinical databases
  • · Raw rows remain here
Pricing

What is manual monitoring costing you?

Estimate your platform subscription and the engineering value you recapture.

3 sources
115 databases
5 million
1M50M+
4 specialists
125+

Listed on AWS & Google Cloud Marketplaces — apply committed cloud budgets to bypass procurement.

System calculation

Recommended deployment

VPC Professional Subscription

AI-assisted terminology mapping for up to 8 tunnels.

platform base$60,000/yr
Dev hours saved1,250 hrsmapping & setup
Value recaptured$156,250reallocated eng.
Estimates assume typical clinical mapping vs. automated pipelines.
FAQ

Questions about clinical observability.

Usagi and DQD are excellent community tools but are desktop-based (large Java downloads, huge index files) and run manually via R scripts. OMOPStack turns those algorithms into a modern multi-user web app: collaborative terminology mapping with cloud models, and continuous pipeline-quality monitoring.

Yes — that's the core architecture. The agent is a lightweight docker container inside your private subnets. It talks only to your database, computes locally, and reports only metadata and health percentages. No patient-level records or PHI ever leave your VPC.

We provide full OHDSI OMOP CDM support out of the box, but the core is schema-agnostic. Load custom database maps via simple JSON/YAML to use the AI mapping interface and validation rules across any legacy healthcare model.

Rather than string-matching, it uses clinical semantic embedding models. Given a local description, it evaluates synonyms, abbreviations and target-vocabulary hierarchies to compute similarity scores against SNOMED, LOINC and RxNorm — with constrained re-ranking so no concept_id is fabricated.

Retire the desktop OHDSI stack.

Map terminology with AI, monitor data quality continuously, and deploy in your VPC within days — patient rows never leave your boundary.

SOC 2 Type IIHIPAA audit-readyVPC / air-gapped