Documentation shaped around real usage
This site is organized for operators, lab builders, and SDK authors. It starts with scenarios and outputs, then moves into reference and extension surfaces.
Model complete enterprise worlds
Generate identity, infrastructure, repositories, applications, CMDB views, policies, access evidence, and external ecosystem data from a single scenario definition.
Drive labs and validation with the same dataset
Use the generated world to populate labs, seed exports, validate discovery tooling, and create richer demos without maintaining fragile hand-authored fixtures.
Stay scenario-first and plugin-safe
Author scenarios with templates, overlays, and the terminal wizard, then extend the dataset through plugins that add synthetic data rather than system-specific adapters.
Dial realism without losing structure
Keep enterprise richness intact while choosing a deviation profile that ranges from clean to aggressively messy for labs, demos, and regression suites.
A clean path from scenario to usable data
DataGen is intentionally focused on synthetic data generation. The docs emphasize a consistent flow so teams can build labs, exports, SDK extensions, and downstream adapters without guessing where each concern belongs.
Built for real working scenarios
The docs lead with repeatable workflows: install the module, author a scenario, generate a world, export it, and use the data to populate labs or validate consumer tooling.
AD Lab
Build a directory-heavy enterprise with hybrid identity, stale accounts, tiered admin surfaces, and realistic OU and policy structure.
Entra Tenant
Generate an Entra-first tenant with guests, admin units, cross-tenant trust, Microsoft 365 collaboration, and cloud governance.
CMDB-Rich World
Produce canonical configuration items plus realistic CMDB, discovery, and service catalog drift for downstream validation.