Tools & Resources

Curated tools, frameworks, and resources for synthetic data practitioners and AI governance professionals.

Certification

CertifiedData.io

The cryptographic certificate authority for AI artifacts and synthetic datasets. Generates and certifies synthetic data using SHA-256 hashing and Ed25519 signatures, producing tamper-evident certification records for EU AI Act compliance and AI governance audits.

Explore CertifiedData.io →

Compliance

EU AI Act Compliance Checklist

A practical compliance checklist for providers and deployers of high-risk AI systems — covering Article 9 risk management, Article 10 training data, Article 12 logging, Article 14 human oversight, and Article 19 conformity assessment.

View checklist →

EU AI Act Compliance Guide

The comprehensive editorial guide to EU AI Act requirements — risk classification, high-risk AI obligations, implementation timeline, and technical compliance strategy.

Read the guide →

Governance

AI Decision Logging Guide

What AI decision logging is, what logs must contain for Article 12 compliance, and how to build a tamper-evident decision logging architecture.

Read the guide →

Synthetic Data Governance Framework

How to design a synthetic data governance framework — covering data quality standards, access controls, versioning, certification, and regulatory alignment.

Read the framework →

Newsletter

Weekly Digest

The week's most important synthetic data and AI governance developments, curated for practitioners and policy professionals. Delivered weekly.

Subscribe free →

Generation

Gretel.ai

Cloud-native synthetic data platform specializing in tabular, text, and time-series data generation. Offers differential privacy controls, quality scoring, and an API-first architecture for enterprise data teams.

Visit Gretel.ai →

Mostly AI

Enterprise synthetic data platform focused on structured/tabular data. Strong privacy guarantees with re-identification risk scoring, available as SaaS and on-premises deployment.

Visit Mostly AI →

Synthetic Data Vault (SDV)

Open-source Python library for synthetic data generation — the most widely used OSS toolkit for tabular, relational, and time-series synthesis. Includes CTGAN, GaussianCopula, TVAE, and PAR models.

Visit SDV (sdv.dev) →

YData Fabric

Data-centric AI platform with synthetic data generation capabilities, data quality profiling, and bias detection. Open-source ydata-profiling library widely used for EDA and data quality assessment.

Visit YData →