Articles

Integrations

Databricks Integrations

Databricks Integration Connection Guide


## Overview

Transcend maintains an integration for Databricks and Databricks Lakehouse databases that supports Structured Discovery and DSR Automation functionality, allowing you to:

- Scan your database to identify <GlossaryItem term="datapoint" pluralize /> that contain personal information
- Programmatically classify the data category and storage purpose of datapoints
- Define and execute DSRs directly against your database

## Connecting to Databricks

The first step to connecting Databricks to Transcend is to [add the Databricks integration through the Connect Integrations page](https://app.transcend.io/infrastructure/integrations/new?filters=%7B%22text%22%3A%22databricks%22%7D).

![Add the Databricks integration to Transcend](/public/docs/screenshot/2024-03-14-databricks-search.png)

Create a service principle via [these instructions](https://docs.databricks.com/en/dev-tools/auth/oauth-m2m.html#step-1-create-a-service-principal) and generate a Client ID and Secret. Be sure to add Account Admin role to the Service Principle.

Enter the Account ID, Account URL, Client Id and Client Secret string into the Integration Connection Form in Transcend and select _Connect_.

![Add the Databricks keys to connect](/public/docs/screenshot/2024-03-14-databricks-connect.png)

## Connecting to Databricks Lakehouse

The first step to connecting Databricks Lakehouse to Transcend is to [add the Databricks integration through the Connect Integrations page](https://app.transcend.io/infrastructure/integrations/new?filters=%7B%22text%22%3A%22databricks%22%7D).

Create a service principle via [these instructions](https://docs.databricks.com/en/dev-tools/auth/oauth-m2m.html#step-1-create-a-service-principal) and generate a Client ID and Secret. Be sure to add USE_CATALOG, USE SCHEMA, SELECT, and MODIFY privileges to the Service Principle for each catalog that you want Transcend to have access to.

Enter the SQL Warehouse ID, Warehouse URL, Client Id and Client Secret string into the Integration Connection Form in Transcend and select _Connect_.

![Add the Databricks Lakehouse keys to connect](/public/docs/screenshot/2024-03-14-databricks-lakehouse-connect.png)

## Additional Configuration Guides

After getting your Databricks integration connected, utilize the following guides to get the most out of your connection:

- [Structured Discovery Configuration for Databricks Integration](/_docs/integrations/databricks/structured-discovery-configuration-for-databricks-integration.mdx)
- [Databricks Integration Set Up for DSR Automation](/_docs/integrations/databricks/databricks-integration-set-up-for-dsr-automation.mdx)