Simple, transparent pricing

Generate fully customizable, high-quality synthetic data for LLM training.
Pay only for what you use, or save with a subscription.

Subscription Plans

Best value for regular usage. Fast, Balanced, and Diverse tiers included.

Starter

$59 USD/mo

3,000 records/month

21% off vs pay-as-you-go

$59/mo in credits. Use any model.
2 concurrent jobs
All schemas (instruction, preference, conflict, eval)
Quality report with every job
Advanced configuration (categories, policies, splits)
API access
Email support

Pro

$179 USD/mo

10,000 records/month

28% off vs pay-as-you-go

$179/mo in credits. Use any model.
5 concurrent jobs
All schemas (instruction, preference, conflict, eval)
Quality report with every job
Advanced configuration (categories, policies, splits)
API access
Priority support

Scale

$499 USD/mo

27,000 records/month

26% off vs pay-as-you-go

$499/mo in credits. Use any model.
10 concurrent jobs
All schemas (instruction, preference, conflict, eval)
Quality report with every job
Advanced configuration (categories, policies, splits)
API access
Dedicated support

Pay-as-you-go

No subscription required. Generate records on-demand at any quality tier.

Add a payment method

Choose a quality tier

Select any tier when creating a job

Pay per job

Card charged upfront. Auto-refund if 0 records produced.

Tier	Model	Diversity	instruction_v1 USD	preference_v1 USD	conflict_v1 USD	eval_v1 USD
Fast Quick prototyping and small datasets	GPT-4o Mini	80%	$0.50/1K	—	—	$0.50/1K
Balanced Production workhorse. Clinical depth with great diversity	GPT-4.1 Mini	94%	$3.00/1K	$4.00/1K	$4.00/1K	$3.00/1K
Diverse Highest diversity at scale. Rich, scenario-based outputs	GPT-5.4 Mini	98%	$8.00/1K	$10.00/1K	$10.00/1K	$8.00/1K
DeepPAYG only Maximum per-record depth. Textbook-level analysis	Claude Sonnet 4.6	82%	$25.00/1K	$30.00/1K	$30.00/1K	$25.00/1K

preference_v1 and conflict_v1 available on Balanced, Diverse, and Deep tiers. Deep tier is pay-as-you-go only.

All prices in USD. Local currency available at checkout.

Verified Quality

Optional second-pass validation with per-record scoring. Add to any generation job.

Standard QualityIncluded free

A/B/C/D quality grade
Pass rate & diversity score
Format compliance checks
Near-duplicate detection
Rejection breakdown

Add-on

Verified Quality

+$5 /1K records

Everything in Standard
Per-record quality scores (1-10)
LLM-as-judge second pass
Score & difficulty distributions
Low-scoring records auto-rejected

Subscription plans: verified records consume quota at 1.5x rate

Schema Availability

Each schema is optimized for specific ML training paradigms.

instruction_v1

Instruction-response pairs for supervised fine-tuning (SFT)

FastBalancedDiverseDeep

preference_v1

Paired responses with preference labels for RLHF/DPO

Requires Balanced tier or above for quality guarantee

FastBalancedDiverseDeep

conflict_v1

Multi-drive tension scenarios for alignment decision layers

Requires Balanced tier or above for quality guarantee

FastBalancedDiverseDeep

eval_v1

Evaluation datasets for benchmarking models

FastBalancedDiverseDeep

Ready to get started?

Create your free account and start generating synthetic data in minutes.

Start Free Trial View Documentation

Frequently Asked Questions

Every plan gives you monthly credits in dollars. You spend credits on any model at the listed per-1K rate. For example, $59 in Starter credits buys 118,000 Fast records, 19,666 Balanced records, 7,375 Diverse records, or 2,360 Deep records. Mix and match however you like. Credits reset each billing cycle (they do not roll over).

conflict_v1 generates structured tension scenarios where two valid objectives compete (e.g., safety vs. autonomy, honesty vs. helpfulness). Each record includes responses from both drives, a resolution with confidence score, and an explicit override condition. It is designed for alignment researchers, safety teams, and anyone training models to reason about tradeoffs rather than apply rigid rules. Priced at preference_v1 rates.

Both preference_v1 and conflict_v1 require high-quality paired outputs. Balanced (GPT-4.1 Mini), Diverse (GPT-5.4 Mini), and Deep (Claude Sonnet 4.6) all support these schemas with reliable quality validation. The Fast tier does not support preference_v1 or conflict_v1 due to insufficient pair quality.

One record is one complete data item in your chosen schema. For instruction_v1, that's one instruction-response pair. For preference_v1, it's one prompt with chosen and rejected responses. For conflict_v1, it's one multi-drive tension scenario with resolution metadata. For eval_v1, it's one evaluation example.

Yes. All four models (Fast, Balanced, Diverse, Deep) are available on every plan, including Free. Your subscription gives you monthly credits, and you spend them on whichever model fits your needs.

Deep uses Claude Sonnet 4.6 for the highest possible per-record depth (1,955 chars avg output). At $25/1K records, it is the most expensive model but produces the most detailed, nuanced responses. Like every model, it is available on all plans. Just spend your credits on it.

If you have auto-recharge enabled and a payment method on file, your card is charged for the difference automatically. Otherwise, the job is blocked until you have enough credits. Upgrade your plan for more monthly credits, or enable auto-recharge in account settings.

Diversity % measures how unique the generated records are relative to each other, using trigram analysis. Higher diversity means less repetition and more varied training data. Diverse tier achieves 98%, Balanced 94%, Deep 82%, and Fast 80%. Choose the tier that matches your quality needs.

Every generation job supports custom system prompts, category-based distribution, response policies, hard negatives for adversarial training, train/validation output splitting, and custom metadata fields. All advanced options are included at no extra charge with any plan or pay-as-you-go tier.

No. You can use the simple model shorthand: 'fast', 'balanced', 'diverse', or 'deep'. We map these to the best model for each tier automatically. Advanced users can still specify exact provider/model combinations.