Google Cloud Storage
Last updated
Last updated
This document describes the Dataworkz connector configuration required to access Google Cloud Storage. Dataworkz requires authorizing a Service Account to access GCS storage. Follow the Setup Service Account instructions for the purpose of setting up a Service Account. Ensure that service account that has been created has access to the Google Storage bucket. The service account used by Spark/Dataworkz to write to the GCS bucket must have the roles/storage.objectCreator and roles/storage.objectViewer roles on the bucket. For example, if Dataworkz is running as service account datXXXXXXXXX@datXXXXXXXXXX.iam.gserviceaccount.com, one has to explicitly add the permissions storage.objectCreator and storage.objectViewer for datXXXXXXXXX@datXXXXXXXXXX.iam.gserviceaccount.com account in customer's Google Cloud Storage console (this can also be done via cli tool).
Login to Dataworkz Application
Go to Configuration -> Cloud Data Platforms -> GCS Storage
Click the + icon to add a new configuration
Enter name for the configuration in the above screen
Enter the Project ID
Enter the Client ID
Enter the Client Email ID
Enter the Private Key ID
Enter the Private Key
Provide the storage base path
Test the connection
If successful click Save
Newly created connector would show up in the list of Google Cloud Storage configurations