Note
Access to this page requires authorization. You can try signing in or changing directories.
Access to this page requires authorization. You can try changing directories.
Important
CI/CD in Apache Airflow in Data Factory for Microsoft Fabric is currently in preview. This information relates to a prerelease product that may be substantially modified before it's released. Microsoft makes no warranties, expressed or implied, with respect to the information provided here.
Note
Apache Airflow job is powered by Apache Airflow.
Currently, private networks and virtual networks aren't supported with Fabric Apache Airflow jobs.
You can use source control, CI/CD, and a shared workspace with Apache Airflow to manage your workflows.
Microsoft Fabric includes two built-in tools to support CI/CD: Git integration and deployment pipelines. These tools help you organize your Airflow pipelines and workspace resources.
Git integration lets you connect to your own repositories in Azure DevOps or GitHub. Deployment pipelines help you move updates between environments, so you only update what’s needed. Together, they make it easier to build, test, and deploy your Airflow workflows.
Git integration for Airflow
Follow these steps to connect your Airflow in Data Factory to Git. Git helps you track changes, work with your team, and keep your work safe.
Prerequisites for Git integration
- You need a Power BI Premium license or Fabric capacity.
- Make sure these admin settings are turned on:
- You need either an Azure DevOps organization or a GitHub account.
- For Azure DevOps:
- Sign up for a free Azure account if you don't have one.
- Make sure you have access to a repository.
- For GitHub:
- Sign up for a free GitHub account if you don't have one.
- You need a fine-grained token with read and write permissions for Contents, or a GitHub classic token with repo scopes enabled.
- For Azure DevOps:
Step 1: Connect to a Git repository
To use Git integration with Airflow in Fabric, you first need to connect to a Git repository:
Sign in to Fabric and go to the workspace you want to connect to Git.
Select Workspace settings.
Select Git integration.
Choose your Git provider—either Azure DevOps or GitHub. If you pick GitHub, select Add account to connect your GitHub account. After you sign in, select Connect so Fabric can access your GitHub account.
Step 2: Connect to a workspace
Once you’ve connected to a Git repository, you need to connect to your workspace.
From the dropdown menu, fill in the details about the workspace and branch you want to use:
For Azure DevOps:
- Organization name
- Project name
- Repository name
- Branch name
- Folder name
For GitHub:
- Repository URL
- Branch name
- Folder name
Select Connect and sync.
After connecting, select Source control for information about the linked branch, the status of each item, and when it last synced.
Step 3: Commit changes to Git
You can commit your changes to Git by following these steps:
- Go to your workspace.
- Select the Source control icon. You see a number showing how many changes aren't committed yet.
- In the Source control panel, select the Changes tab. You see a list of everything you've changed, along with status icons.
- Choose the items you want to commit. To select everything, check the box at the top.
- (Optional) Add a commit comment about your changes.
- Select Commit.
Once you commit, those items disappear from the list, and your workspace points to the latest commit.
Deployment pipelines
Deployment pipelines for Airflow jobs use the same workflow as other Fabric Data Factory items. For the full setup steps, including prerequisites, creating a pipeline, assigning workspaces, and deploying content between stages, see Get started with deployment pipelines for Git.
Known limitations
Here are some of the current limitations when using CI/CD for Airflow in Data Factory in Microsoft Fabric.
- Git integration does not support Airflow Git-Sync (exporting with Git Sync will not export Git Sync properties. If you import onto an item with Git Sync enabled, Git Sync will be removed)
- Secrets are not supported. (Exporting with secrets will not export Git Sync properties. If you import onto an item with existing secrets they will NOT be removed).
- Importing/creating a customPool with files in the same operation is not currently supported.
- Soft limitation: We support up to 50 DAGS for now. ALM operations might potentially fail if you have more than 50 DAGs.