Sync data from Databricks to PlanetScale
Connect your data from Databricks to PlanetScale with Hightouch. No APIs, no months-long implementations, and no CSV files. Just your data synced forever.
The world's most innovative companies choose Hightouch as their Composable CDP
Activate your data in 3 easy steps
Use cases
Sync data from Databricks to PlanetScale
- Run complex queries on your data source and copy the results into a PlanetScale table.
- Exercise granular control over batch sizes and more.
What methods can I use to model my Databricks data?
dbt model selector
Sync directly with your dbt models saved in a git repository.
SQL editor
Create and Edit SQL from your browser. Hightouch supports SQL native to Databricks.
Sigma model
Hightouch converts your Sigma workbook element into a SQL query that runs directly on Databricks.
Table selector
Select available tables and sheets from Databricks and sync using existing views without having to write SQL.
Customer Studio
For less technical users, pass traits and audiences from Databricks using our visual segmentation builder.
FAQs
There are several options to sync data between sources. You can manually build and maintain a data pipeline, use a point-to-point solution such as Zapier, or you can manually upload CSVs.
With Hightouch, you get:
- Automation: You do not need to build and maintain custom data pipelines and you do not have to have your team do manual work
- Simplicity: You avoid a complex web of integrations caused by point-to-point solutions by syncing data from your source
- Speed: You can get set up in quickly - the average Hightouch customer starts syncing data in 23 minutes
- Control: companies of all sizes have access to enterprise-level controls including observability, dbt integrations, and version control
- Security: Hightouch never stores your data and is HIPAA, GDPR, CCPA, and SOC-2 compliant
90% of all Hightouch syncs complete in 30 seconds or less, and the platform enables non-technical users to self-serve.
With Hightouch, you can sync data as frequently as it changes within your Databricks. You can trigger data syncs manually or schedule them to run at an interval or custom recurrence as often as once per minute.
Hightouch offers a basic mapper or advanced mapper that allows you to visually match columns from your Databricks to fields in PlanetScale.
Databricks is a data science and analytics platform built on top of Apache Spark. Databricks implement the Data Lakehouse concept in a single unified, cloud based platform.
PlanetScale is a MySQL-compatible serverless database that brings you scale, performance, and reliability — without sacrificing developer experience. With PlanetScale, you get the power of horizontal sharding, non-blocking schema changes, and many more powerful database features without the pain of implementing them.
Yes, if you integerate Databricks and PlanetScale using Hightouch, in-warehouse planning is supported.
Great, but what is in-warehouse planning?
Between every sync, Hightouch notices any and all changes in your data model. This allows you to only send updated results to your destination (in this case PlanetScale). With the baseline setup, Hightouch picks out only the rows that need to be synced by querying every row in your data model before diffing using Hightouch’s infrastructure.
The issue here is this can be slow for large models.
Warehouse Planning allows Hightouch to do this diff directly in your warehouse. Read more on how this works here.