Dataflows
Saved transformation pipelines that define how data moves and transforms across steps — reusable, schedulable, and composable.
A Dataflow is a named, reusable pipeline that captures one or more transformation steps. Dataflows eliminate the need to recreate transformation logic for each Dataset and can be scheduled to run automatically as new data arrives.
Dataworkz supports two types of Dataflows:
Single Dataflow — Performs one operation: a union, a join, or a Dataset transformation.
Composite Dataflow — Chains multiple Single Dataflows into an end-to-end workflow, passing output from one step as input to the next.
Dataflows can run as a one-time execution or as a recurring scheduled job at a specified frequency. Saved Dataflows appear in the Dataflows tab and can be applied to other Datasets using Import from Dataflow in the Steps panel.
See Benefits of Dataflows for a summary of why using Dataflows improves consistency and reduces preparation time.
Last updated

