# Datasets

A dataset is a container for data that has been organized and structured to support specific use cases or analyses.

In Dataworkz, datasets can be created from external sources — such as APIs, databases, and files — as well as from internal sources such as user input and other Dataworkz objects. Once created, you can transform and manipulate dataset data using AI-based techniques for cleaning, filtering, and aggregation.

Datasets are a fundamental concept in Dataworkz. They provide a structured way to organize and work with data across applications and workflows.

* **Data source integration:** Create datasets from a variety of external sources to integrate data from different systems.
* **Data transformation:** Apply transformations to dataset data, including cleaning, filtering, and aggregation.
* **Data visualization:** Explore and understand dataset contents using built-in visualization tools.
* **Collaboration:** Share datasets across multiple users so teams can build and analyze data-driven workflows together.
