Aphelion Dashboard Background
<Aphelion />

Synthetic Data,
Constraint-Safe & Intelligent.

Stop wrestling with broken foreign keys and manual seed scripts. Aphelion intelligently introspects your PostgreSQL schema to generate realistic, reproducible test data.

Built for Developers, Startups, and Enterprise.

Start Building Free
$ npm install -g @aphelion/data-generator

Works perfectly with your modern stack

Native CLI integration for Docker, CI/CD, and Seed Scripts

Docker
Node.js
PostgreSQL
React
GitHub Actions

Why Developers, Startups & Enterprises Love Us

Built by engineers tired of SQL seeds. Perfect for MVP velocity and Enterprise scale.

Constraint Safe

Never worry about foreign key violations. Topological dependency graph ensures perfect referential integrity.

Healthcare Generators

MRN (5 formats), ICD-10, RxNorm, SNOMED, LOINC, NDC codes. Comprehensive OMOP CDM & OpenMRS support.

Hierarchies & Trees

Deep trees (5-11 levels), ltree support, cycle detection, HierarchyID paths, JSONB trees.

Data Masking

HIPAA/PCI-DSS compliant. Hash, redact, partial masking. Auto-detect SSN, email, phone, credit cards.

Realistic Distributions

Zipfian, power-law, weighted distributions. Data skew, correlation, temporal patterns match reality.

Healthcare Generators

MRN (5 formats), ICD-10, RxNorm, SNOMED, LOINC, NDC codes. Comprehensive OMOP CDM & OpenMRS support.

Financial Features

Fraud detection (6 types), geolocation, device fingerprints, velocity metrics, PCI-DSS tokenization.

Rich Content

Markdown, code snippets, regex patterns, formatted text for social platforms and forums.

Temporal Constraints

Dates make sense across tables. Bookings before flights, appointments after registration.

Multi-Tenant & Sharding

Shard keys, tenant isolation, realistic data skew. 60% primary, 25% satellite, 15% remote.

Advanced Constraints

CHECK constraints, domains, composite keys, multi-column uniqueness, enum-like types.

Complex Data Types

XML columns, JSONB trees, ltree paths, HierarchyID, custom domains, PostgreSQL extensions.

Built for Compliance

Designed from the ground up to meet regulatory requirements without compromising data realism.

GDPR Compliant

The system does not process or retain personal data. It observes schema and aggregate statistics to generate synthetic datasets that do not relate to identifiable individuals.

HIPAA Compliant

The system generates de-identified synthetic data by design and blocks any free-text or narrative fields that could contain PHI.

Industry-Specific Solutions

Pre-built generators for healthcare, finance, e-commerce, and more.

Works With Your Existing Schema

No configuration needed to start. We introspect your database, detect types, and map them to realistic Faker generators automatically.

  • Smart Type Detection Maps `user_email` to `internet.email` automatically
  • Zero Config Start Just point it at your DB URL and go
  • JSON Export Export layout to JSON for fine-tuning
bash — 80x24
~ aphelion introspect postgres://localhost/myapp
> Connected to database 'myapp'
> Found 14 tables
> Detected 3 circular dependencies
> Generating schema map... Done
~ aphelion generate --rows 1000 --seed 42
> Generating data plan
> Phase 1: Base tables (users, products)...
> Phase 2: Dependent tables (orders, items)...
> Phase 3: Resolving circular refs...
> Successfully generated 14,000 rows in 1.2s

Simple, Transparent Pricing

Start free on your local machine. Scale when your team grows.

Hacker / Local

$0/mo

Everything you need to build your MVP or side project locally.

  • Unlimited Tables
  • 1,000 Rows per Table
  • Full CLI Functionality
  • Local Development Only
Download & Start Building
POPULAR

Pro Team

$49/mo

For teams automating CI/CD pipelines.

  • Millions of Rows (tested up to 1.5M)
  • CI/CD Auto-Approve Mode
  • Priority Support
  • Advanced Custom Generators

Secure payment via Stripe

🔒 You get realistic data without inheriting production risk.

We never copy, store, hash, or transform real data — we observe structure and generate new data from scratch. All PII is automatically detected and replaced with safe synthetic values.

Scale Transparency: Tested and proven with up to 1.5M rows (100K patients in healthcare demos). Production-ready for datasets up to 250K patients (~3.75M rows) with current configuration. For larger datasets, we offer streaming implementation and direct database loading options. View technical details.

Contact Sales & Support

Latest from the Blog

Updates, tutorials, and announcements.