2 min read

How do I link an Azure Data Factory (ADF) Instance to GitHub?

Azure Data Factory (ADF) is a serverless cloud-native service for managing batch data integration workloads. Access to the service is achieved using a data factory instance (often called “a data factory”)
👍
Interested in other ADF-related articles? Try these.

Create a Free Azure Account

Create a Resource Group

A Resource Group is a logical container that holds our Azure services. For example, one could create a Resource Group called Resource Group A and save Azure analytical services in the group.

Visit this link for official Azure documentation for Resource Groups.

The relationship between Tenant, Subscription and Resource Groups.

Create an Azure Data Factory

  • Read the official documentation for ADF here.
🦍
When on the Git configuration tab, tick the Configure Git later checkbox

After a little spinning, the ADF instance gets created.

Conceptual view of the ADF instance.

Code, scripts and other reusable bits and pieces which are the core ingredients of any software-driven solution should be version-controlled.

💡
Linking to a Git repo is easier from the ADF UX than from the Azure portal which is the reason behind choosing the Configure Git later option when you created your data factory.

ADF can connect to GitHub or Azure Repos. For this How Do I?, we will use GitHub.

Make a (or use an existing) GitHub Repo

💡
To create a new GitHub Repo, refer to GitHub's Official Documentation.

Go back to ADF Console to complete linking

Click Configure
Set Repository type to GitHub. In the GitHub repository owner text box, put a GitHub username (e.g. www.github.com/<owner>/<project>)

You may be asked to authorize GitHub integration with Azure Data Factory.

This is a typical GitHub Integration step; sometimes we use tokens, and sometimes these managed integrations!
Configure the connection.
A successful connection will result in a GitHub icon becoming a part of the top left-hand corner of ADF UX.

I write to remember, and if, in the process, I can help someone learn about Containers, Orchestration (Docker Compose, Kubernetes), GitOps, DevSecOps, VR/AR, Architecture, and Data Management, that is just icing on the cake.