Github
This document describes the Dataworkz connector configuration required to access Github.
Step 1: Create GitHub Connected App
Follow the steps listed below for creating an OAuth App in Github for Dataworkz
Go to Github settings https://github.com/settings/profile .

In the settings panel, select Developer settings https://github.com/settings/apps

Select GitHub Apps and click on "New GitHub App".

Configure Basic Information
Configure User Authorization Callback URL. Callback URL can be found in Dataworkz UI (Refer Connecting to Github section for details)
https://<you.domain.com>/dataworkz-web/restApi/oauth/github/get_oauth_tokenAdditional Settings:
✅ Enable "Expire user authorization tokens"
✅ Enable "Request user authorization (OAuth) during installation"

Important: Disable webhooks entirely

Permissions/scopes need to be added for Github API. Read scope should be selected for the data that need to be accessed from Github via Dataworkz.
Repository Permissions:
Contents (read) - for repository content access
Metadata (read) - basic repository information
Installation Settings.
✅ Select "Any account" if setting up for an organization

Click "Create GitHub App". This will create Github connected App.
After successfully creating the app, click on “Generate a Private Key”. Once private key is generated, save it for future use.

Next scroll up in the app settings. Locate and copy the Client ID. Save it securely for OAuth configuration in Dataworkz.
Click on “Generate a new client secret". Copy and save the Client Secret immediately
⚠️ Critical: The client secret will only be shown once. Make sure to copy and store it immediately.

Go to the Advanced option in the left pane. Click "Make public" so it can be installed in other accounts.

Admin Installation.
Admin Role: Installs app on organization (one-time setup, not on a single account)

Step 2: Configure Github connector in Dataworkz
Prerequisite
Before configuring Dataworkz for Github, a "Connected App" needs to be configured in Github.
If Connected App setup is not done, then capture Callback URL from Dataworkz UI (Refer Required Steps section 3.b for details)
Make note of the Client ID & Secret from connected app. These details would be required at the time of creating the connector in Dataworkz.
Required Steps
Login to Dataworkz and go to Github section under Saas Applications and then create new configuration.
Provide the following information to create Github configuration.
Client Id - Client id of Github connected app
Client Secret - Client secret of Github connected app

Custom App already created
Select "Yes" if connected app has already been created.
Select "No". Screen with the redirect URL would pop-up. Copy the redirect URL and goto setup Github connected app to complete connected app setup.

Enter the Workspace name
Enter the Collection name
Click on save button to save the configuration. System will prompt for Github credentials. Click on login after providing credentials.

System will prompt to allow connecting to private instance. Allow this setup to complete the Dataworkz configuration for Github .

Newly created connection would show up in the list of Github configurations.

Add Task Configuration
Click the newly created connector and then click + icon to add new task configuration for Github

Enter name for the dataset
Select the Github repository path that you need to access.
Select the branch of the selected repository
Select the appropriate option for reading all the historical data or for a date range
Enable/disable recurring job
Click Add to save the configuration
This would complete the Dataworkz configuration for Github
Last updated


