Simple, transparent pricing

Generate fully customizable, high-quality synthetic data for LLM training.
Pay only for what you use, or save with a subscription.

Subscription Plans

Best value for regular usage. Fast, Balanced, and Diverse tiers included.

Starter

$59 USD/mo

3,000 records/month

21% off vs pay-as-you-go

  • $59/mo in credits. Use any model.
  • 2 concurrent jobs
  • All schemas (instruction, preference, conflict, eval)
  • Quality report with every job
  • Advanced configuration (categories, policies, splits)
  • API access
  • Email support
Most Popular

Pro

$179 USD/mo

10,000 records/month

28% off vs pay-as-you-go

  • $179/mo in credits. Use any model.
  • 5 concurrent jobs
  • All schemas (instruction, preference, conflict, eval)
  • Quality report with every job
  • Advanced configuration (categories, policies, splits)
  • API access
  • Priority support

Scale

$499 USD/mo

27,000 records/month

26% off vs pay-as-you-go

  • $499/mo in credits. Use any model.
  • 10 concurrent jobs
  • All schemas (instruction, preference, conflict, eval)
  • Quality report with every job
  • Advanced configuration (categories, policies, splits)
  • API access
  • Dedicated support

Pay-as-you-go

No subscription required. Generate records on-demand at any quality tier.

1

Add a payment method

Sign up, then add a card in Account Settings

2

Choose a quality tier

Select any tier when creating a job

3

Pay per job

Card charged upfront. Auto-refund if 0 records produced.

TierModelDiversityinstruction_v1 USDpreference_v1 USDconflict_v1 USDeval_v1 USD
Fast
Quick prototyping and small datasets
GPT-4o Mini80%$0.50/1K$0.50/1K
Balanced
Production workhorse. Clinical depth with great diversity
GPT-4.1 Mini94%$3.00/1K$4.00/1K$4.00/1K$3.00/1K
Diverse
Highest diversity at scale. Rich, scenario-based outputs
GPT-5.4 Mini98%$8.00/1K$10.00/1K$10.00/1K$8.00/1K
DeepPAYG only
Maximum per-record depth. Textbook-level analysis
Claude Sonnet 4.682%$25.00/1K$30.00/1K$30.00/1K$25.00/1K

preference_v1 and conflict_v1 available on Balanced, Diverse, and Deep tiers. Deep tier is pay-as-you-go only.

All prices in USD. Local currency available at checkout.

Verified Quality

Optional second-pass validation with per-record scoring. Add to any generation job.

Standard QualityIncluded free

  • A/B/C/D quality grade
  • Pass rate & diversity score
  • Format compliance checks
  • Near-duplicate detection
  • Rejection breakdown
Add-on

Verified Quality

+$5 /1K records

  • Everything in Standard
  • Per-record quality scores (1-10)
  • LLM-as-judge second pass
  • Score & difficulty distributions
  • Low-scoring records auto-rejected

Subscription plans: verified records consume quota at 1.5x rate

Schema Availability

Each schema is optimized for specific ML training paradigms.

instruction_v1

Instruction-response pairs for supervised fine-tuning (SFT)

FastBalancedDiverseDeep

preference_v1

Paired responses with preference labels for RLHF/DPO

Requires Balanced tier or above for quality guarantee

FastBalancedDiverseDeep

conflict_v1

Multi-drive tension scenarios for alignment decision layers

Requires Balanced tier or above for quality guarantee

FastBalancedDiverseDeep

eval_v1

Evaluation datasets for benchmarking models

FastBalancedDiverseDeep

Ready to get started?

Create your free account and start generating synthetic data in minutes.

Frequently Asked Questions

Every plan gives you monthly credits in dollars. You spend credits on any model at the listed per-1K rate. For example, $59 in Starter credits buys 118,000 Fast records, 19,666 Balanced records, 7,375 Diverse records, or 2,360 Deep records. Mix and match however you like. Credits reset each billing cycle (they do not roll over).
conflict_v1 generates structured tension scenarios where two valid objectives compete (e.g., safety vs. autonomy, honesty vs. helpfulness). Each record includes responses from both drives, a resolution with confidence score, and an explicit override condition. It is designed for alignment researchers, safety teams, and anyone training models to reason about tradeoffs rather than apply rigid rules. Priced at preference_v1 rates.
Both preference_v1 and conflict_v1 require high-quality paired outputs. Balanced (GPT-4.1 Mini), Diverse (GPT-5.4 Mini), and Deep (Claude Sonnet 4.6) all support these schemas with reliable quality validation. The Fast tier does not support preference_v1 or conflict_v1 due to insufficient pair quality.
One record is one complete data item in your chosen schema. For instruction_v1, that's one instruction-response pair. For preference_v1, it's one prompt with chosen and rejected responses. For conflict_v1, it's one multi-drive tension scenario with resolution metadata. For eval_v1, it's one evaluation example.
Yes. All four models (Fast, Balanced, Diverse, Deep) are available on every plan, including Free. Your subscription gives you monthly credits, and you spend them on whichever model fits your needs.
Deep uses Claude Sonnet 4.6 for the highest possible per-record depth (1,955 chars avg output). At $25/1K records, it is the most expensive model but produces the most detailed, nuanced responses. Like every model, it is available on all plans. Just spend your credits on it.
If you have auto-recharge enabled and a payment method on file, your card is charged for the difference automatically. Otherwise, the job is blocked until you have enough credits. Upgrade your plan for more monthly credits, or enable auto-recharge in account settings.
Diversity % measures how unique the generated records are relative to each other, using trigram analysis. Higher diversity means less repetition and more varied training data. Diverse tier achieves 98%, Balanced 94%, Deep 82%, and Fast 80%. Choose the tier that matches your quality needs.
Every generation job supports custom system prompts, category-based distribution, response policies, hard negatives for adversarial training, train/validation output splitting, and custom metadata fields. All advanced options are included at no extra charge with any plan or pay-as-you-go tier.
No. You can use the simple model shorthand: 'fast', 'balanced', 'diverse', or 'deep'. We map these to the best model for each tier automatically. Advanced users can still specify exact provider/model combinations.