A cloud replacement for Usagi & DQD

Map, validate and monitor clinical data, automatically.

Retire fragile R-scripts and Java desktop tools. Map local terminology with AI, monitor your OMOP and custom-CDM pipelines continuously, and run every computation inside your own VPC — patient rows never leave.

OHDSI checks / scan

Auto-map precision

Rows of PHI leaving VPC

omopstack-agent · vpc-run

# secure agent started inside local VPC

✓ control-plane endpoint connected

$ omopstack init-omop-scan --source prod_clinical

# compiling 3,450 OHDSI v6.0 conformance rules…

plausibility 99.8% ok

conformance 100.0% standard

$ omopstack ai-map --col local_drug_name --std RxNorm

"HCT 25mg / Lisinopril 20mg Tab" → RxNorm 316866

✓ auto-assigned · confidence 0.98

2,451 variables mapped — no raw rows moved

Compatible with global clinical architectures

OMOP CDM v6.0 OHDSI ATLAS / ATHENA CDISC SDTM HL7 FHIR R4 SNOMED · RxNorm · LOINC

app.omopstack.io / observability Agent live · prod_clinical

Data quality & mapping

prod_clinical_redshift · last scan 2m ago

● all systems healthy

Concepts mapped

DQ pass rate

99.2%

▲ 0.4 vs last scan

Auto-map precision

Open issues

2 conformance · 1 drift

Quality score · 30 days

conformance · completeness · plausibility

DQD checks

Plausibility99.7%

Conformance82.1%

Completeness98.9%

The problem

Clinical data teams are fighting their own tools.

Mapping, quality and ETL still run on single-user desktop apps and hand-run R scripts — slow, siloed, and stale the moment a vocabulary updates. Here's the day-to-day, side by side.

Today · the desktop stack

Hours to rebuild the vocabulary index
Usagi rebuilds a local Lucene index from Athena files — on every machine.
Single-user desktop, files emailed around
No collaboration, no versioning, no audit trail.
Quality checks run by hand, then go stale
DQD & Achilles in R → static HTML, out of date within days.
Your laptop, or a DIY Docker stack
Ops burden on your team; PHI moving where it shouldn't.

With OMOPStack

Hosted, pre-indexed vocabulary
Map in minutes with AI semantic search — nothing to build locally.
Multi-user, versioned, governed
Git-style review & approval with a full audit trail.
Continuous quality, trended & alerting
3,500+ checks always-on, with Slack / PagerDuty alerts.
Managed, inside your VPC
Zero PHI egress. SOC 2 · HIPAA · BAA.

Why it matters

Terminology mapping is where OMOP projects stall. OMOPStack removes the bottleneck.

10M+

standard concepts indexed and searchable

~95%

auto-mapped with calibrated confidence

rows of PHI ever leave your VPC

Platform

Three modules, one observability plane.

Replace a sprawl of legacy desktop tools and R-scripts with a single secure, collaborative platform built for biostatistics and clinical-AI pipelines.

Usagi Cloud Mapping

Move past single-user desktop mapping. Build clinical dictionary maps collaboratively and let embedding models parse local shortcodes to SNOMED CT, RxNorm and LOINC.

Collaborative multi-user validation
Semantic pre-mapping & re-rank
Git-style change approval

DQD Live Observability

Automate database quality without hand-running R. Continuously execute 3,500+ OHDSI-compliant data-health checks inside your own database engine, with history and alerting.

Schema-drift auto-detection
Slack, PagerDuty & Teams alerts
Historic quality trend tracking

Multi-Model Synthesis

Transform, audit and clean data flowing from vendors' OMOP schemas into your proprietary custom models — without schema drift or lost clinical context.

Dual-ended transform checking
Custom CDM mapping interfaces
Native transformation logging

How it works

From raw source to monitored OMOP — one flow.

A single VPC-resident agent profiles, maps, validates and monitors. No desktop apps, no data leaving your boundary.

Connect

Point the VPC agent at your source database — outbound-only, no ports opened.

Profile

Continuous scan of schema, distributions, keys and drift — re-runs on every refresh.

Map with AI

Semantic automapping to OMOP standard concepts, scored — reviewed, never fabricated.

Validate

3,500+ OHDSI quality checks across conformance, completeness and plausibility.

Monitor

Always-on observability: trended over time, with alerts when thresholds break.

Capabilities

Every capability, in depth.

Browse the full feature set by area.

Dual-channel retrieval

Lexical (trigram) + semantic (BioLORD) candidate generation over the full vocabulary.

Constrained LLM re-rank

Ranks only real concepts — structurally cannot fabricate a concept_id.

Calibrated confidence

Per-domain auto-accept thresholds; everything else routed to review.

Review queue

Top-k candidates with evidence and rationale — approve, flag or reassign.

Git-style governance

Versioned change sets, approval workflow and a full audit trail.

Active learning

Corrections fine-tune the model — your maps get better the more you use it.

3,500+ OHDSI checks

The full Kahn framework: conformance, completeness and plausibility.

Continuous & scheduled

Runs on every data refresh — not by hand, not once a quarter.

Trending & drift

Time-series history so you catch regressions the moment they appear.

Alerting

Slack, PagerDuty and Teams notifications when thresholds break.

Drilldown

Per-table, per-field and per-concept violation detail.

OHDSI export

Standards-compliant JSON results for your own records.

Source profiling

Types, distributions, candidate / surrogate keys and dedup statistics.

Visual + AI mapping

Source→target arrows, AI-suggested and collaboratively reviewed.

Runnable ETL

Emits executable SQL / dbt — design and execution in one place.

OMOP 5.3 / 5.4 / 6.0

Targets the current CDM versions out of the box.

Custom CDM

Map to your own proprietary model — not just OMOP.

Vendor synthesis

Transform external OMOP feeds into your custom schema, cleanly.

Runs in your VPC

BYOC or on-prem; the computation engine never leaves your boundary.

Zero PHI egress

Only aggregate statistics ever reach the control plane.

SOC 2 · HIPAA · BAA

Audit-ready, with ready-to-sign business associate agreements.

SSO & RBAC

SAML / OpenID, role-based access and true multi-tenancy.

Outbound-only

No inbound ports; no database exposed to the internet.

Air-gap capable

Deploys into fully isolated, disconnected environments.

Replaces & integrates

One platform for the stack you're stitching together.

OMOPStack consolidates the OHDSI mapping, quality and ETL tools — most of them desktop, single-user, and run by hand — into one collaborative platform that runs continuously inside your VPC. It integrates with the analytics tools your team already trusts.

Terminology

Usagi Cloud Mapping

Replaces

UsagiIMOSymedicalApelon

Java desktop · single-user · string-matching · hours to build a local index

→ AI semantic automapping with confidence scores and human-in-the-loop review — collaborative, governed, and grounded in the standard vocabulary (no fabricated concept_ids).

Data quality

DQD Live Observability

Replaces

DataQualityDashboardAchillesARESCdmInspection

Manual R runs · static HTML · point-in-time · no trending or alerting

→ The 3,500+ OHDSI checks run continuously, trended over time with alerting and multi-user triage — the Kahn framework, hosted and always-on.

Structural ETL

Multi-Model Synthesis

Replaces

Rabbit-in-a-HatWhiteRabbitPerseus

Desktop arrow-drawing · emits docs, not runnable ETL · OMOP-rigid

→ AI-assisted source→target mapping that profiles continuously and emits executable ETL — for OMOP and your own custom CDM.

Integrates — not replaced

We connect to the analytics and infrastructure your team already trusts. No rip-and-replace of validated study methodology.

ATLASWebAPIHADESStrategusBroadseaAthena vocab

Full coverage — every tool, mapped

Tool	Category	Stack	OMOPStack
Usagi	Terminology	Java desktop	Replaces
IMO · Symedical · Apelon	Terminology (commercial)	Closed SaaS	Competes
Hecate · Lettuce	Semantic vocab search	Rust · LLM	Competes
DataQualityDashboard	Data quality	R package	Replaces
Achilles · ARES	Characterization	R · static site	Replaces
Rabbit-in-a-Hat · WhiteRabbit	Structural ETL	Java desktop	Replaces
Perseus	Web ETL	Angular · Py · Java	Competes
ATLAS · WebAPI · HADES	Analytics & evidence	JS · Java · R	Integrates
Strategus · Broadsea	Orchestration & infra	R · Docker	Integrates
Athena · OMOP vocabulary	Vocabulary source	CSV · ~10M concepts	Built on

Replaces — we supersede it · Competes — modern tool in the same space · Integrates / Built on — we connect to it or anchor to it, never reinventing validated methodology or fabricating concept_ids.

Live demo

Try OMOPStack in real time.

Interact with live simulations of our two core products.

Source dictionary Agent live

Pick a messy clinical entry or write your own. The AI parses the text and matches it to the vocabulary using context.

Predefined messy terms

Or write your own

Domain

Vocab

Mapping result & governance Waiting for concept

No active term mapped

Select or enter a phrase on the left to run the engine.

FiltersLive

Mock metrics from an OMOP database of 10,000,000 patient records. Filter by OHDSI category.

Rule scan logprod_omop_redshift · steady

Quality score · 30 days 99.2% ▲ 0.4

OHDSI rule	Table	Violation	Status
IsPlausibleBeforeDeath	DRUG_EXPOSURE	0.00% · 0 / 10M	PASS
CheckFK_Person_Id	CONDITION_OCC	4.12% · 412k / 10M	WARN
CheckNotNull_Gender	PERSON	0.01% · 1k / 10M	PASS
IsPlausibleAgeUnder120	PERSON	0.00% · 4 / 10M	PASS
InvalidConceptIdsExist	MEASUREMENT	11.89% · 1.19M / 10M	FAIL

2 failed rules in latest build.

Security

Patient data never leaves your VPC perimeter.

A zero-trust BYOC / on-premise framework: the control plane and analytics UI are hosted by us, while the computation engine runs entirely inside your private infrastructure.

Zero ingestion of PHI

The agent pushes only aggregate counts (e.g. 5 misses / 10M rows). Patient-level data stays put.

No inbound firewall openings

The local daemon makes outbound connections only — no exposed database ports.

Audit-ready compliance

Passes HIPAA, SOC 2 Type II and GDPR audits. Ready-to-sign BAAs.

OMOPStack SaaS

Control plane

· Multi-user web portal
· Analytics aggregates
· Alert orchestrator

https aggregate stats only

Client VPC subnet

Data plane

· OMOPStack computation engine
· Internal clinical databases
· Raw rows remain here

Pricing

What is manual monitoring costing you?

Estimate your platform subscription and the engineering value you recapture.

Database sources3 sources

115 databases

Patient volume (M)5 million

1M50M+

Informatics specialists4 specialists

125+

Listed on AWS & Google Cloud Marketplaces — apply committed cloud budgets to bypass procurement.

System calculation

Recommended deployment

VPC Professional Subscription

AI-assisted terminology mapping for up to 8 tunnels.

platform base$60,000/yr

Dev hours saved1,250 hrsmapping & setup

Value recaptured$156,250reallocated eng.

Estimates assume typical clinical mapping vs. automated pipelines.

FAQ

Questions about clinical observability.

Usagi and DQD are excellent community tools but are desktop-based (large Java downloads, huge index files) and run manually via R scripts. OMOPStack turns those algorithms into a modern multi-user web app: collaborative terminology mapping with cloud models, and continuous pipeline-quality monitoring.

Yes — that's the core architecture. The agent is a lightweight docker container inside your private subnets. It talks only to your database, computes locally, and reports only metadata and health percentages. No patient-level records or PHI ever leave your VPC.

We provide full OHDSI OMOP CDM support out of the box, but the core is schema-agnostic. Load custom database maps via simple JSON/YAML to use the AI mapping interface and validation rules across any legacy healthcare model.

Rather than string-matching, it uses clinical semantic embedding models. Given a local description, it evaluates synonyms, abbreviations and target-vocabulary hierarchies to compute similarity scores against SNOMED, LOINC and RxNorm — with constrained re-ranking so no concept_id is fabricated.

Retire the desktop OHDSI stack.

Map terminology with AI, monitor data quality continuously, and deploy in your VPC within days — patient rows never leave your boundary.

SOC 2 Type IIHIPAA audit-readyVPC / air-gapped

Map, validate and monitor clinical data, automatically.

Clinical data teams are fighting their own tools.

Terminology mapping is where OMOP projects stall. OMOPStack removes the bottleneck.

Three modules, one observability plane.

Usagi Cloud Mapping

DQD Live Observability

Multi-Model Synthesis

From raw source to monitored OMOP — one flow.

Connect

Profile

Map with AI

Validate

Monitor

Every capability, in depth.

Dual-channel retrieval

Constrained LLM re-rank

Calibrated confidence

Review queue

Git-style governance

Active learning

3,500+ OHDSI checks

Continuous & scheduled

Trending & drift

Alerting

Drilldown

OHDSI export

Source profiling

Visual + AI mapping

Runnable ETL

OMOP 5.3 / 5.4 / 6.0

Custom CDM

Vendor synthesis

Runs in your VPC

Zero PHI egress

SOC 2 · HIPAA · BAA

SSO & RBAC

Outbound-only

Air-gap capable

One platform for the stack you're stitching together.

Usagi Cloud Mapping

DQD Live Observability

Multi-Model Synthesis

Try OMOPStack in real time.

Deciphered input

Matches in standard vocabulary

Patient data never leaves your VPC perimeter.

Zero ingestion of PHI

No inbound firewall openings

Audit-ready compliance

OMOPStack SaaS

Client VPC subnet

What is manual monitoring costing you?

Recommended deployment

Questions about clinical observability.

Retire the desktop OHDSI stack.

Book an architecture demo

Request a custom VPC quote

Download VPC agent CLI

Request received