Google Cloud Storage

This document describes the Dataworkz connector configuration required to access Google Cloud Storage. Dataworkz requires authorizing a Service Account to access GCS storage. Follow the Setup Service Accountarrow-up-right instructions for the purpose of setting up a Service Account. Ensure that service account that has been created has access to the Google Storage bucket. The service account used by Spark/Dataworkz to write to the GCS bucket must have the roles/storage.objectCreator and roles/storage.objectViewer roles on the bucket. For example, if Dataworkz is running as service account [email protected], one has to explicitly add the permissions storage.objectCreator and storage.objectViewer for [email protected] account in customer's Google Cloud Storage console (this can also be done via cli tool).

Create Connector for Google Cloud Storage

  1. Login to Dataworkz Application

  2. Go to Configuration -> Cloud Data Platforms -> GCS Storage

  3. Click the + icon to add a new configuration

  1. Enter name for the configuration in the above screen

  2. Enter the Project ID

  3. Enter the Client ID

  4. Enter the Client Email ID

  5. Enter the Private Key ID

  6. Enter the Private Key

  7. Provide the storage base path

  8. Test the connection

  9. If successful click Save

Newly created connector would show up in the list of Google Cloud Storage configurations

Last updated