Blog
Data-as-a-Service with Databricks Enterprise Cloud Service

Data-as-a-Service with Databricks Enterprise Cloud Service

STEVE TOUW

Published June 24, 2020

Last edited: April 3, 2025

The new Databricks Enterprise Cloud Service architecture provides powerful network security capabilities, however, a lesser known benefit is that it enables Data-as-a-Service.

What is Data-as-a-Service?

Data-as-a-Service gives you the ability to share data and provide compute and analytical tools along with it, providing data consumers with a full “data experience” in addition to a one-stop-shop for data. More specifically, it allows data providers to expose Databricks compute to data consumers, including consistent, analysis-ready data. This gives data consumers the ability to add their own data to the Databricks environment so they can create a robust set of inputs to drive new analytical models that provide a competitive edge.

How does Enterprise Cloud Service make this possible? At its foundation, Enterprise Cloud Service allows the creation of workspaces in a single VPC, across multiple VPCs in a single AWS account, or across multiple AWS accounts – all mapping to the same Databricks account. You can think of this as a dedicated Databricks URL for each data consumer.

A Guide to Automated Data Access

In Databricks Using Immuta

DOWNLOAD EBOOK

This provides many benefits, including:

Distinct Databricks workspaces for each data consumer
Multi-Workspace API that allows you to automate the provisioning of a workspace, and other APIs that allow you to bootstrap it according to your needs
Partitioning DBU cost per workspace, enabling you to charge individual data consumers for their compute
Whitelabeling those workspaces, allowing the data provider’s logo to dominate the data consumer experience
Separation of DBFS and the root bucket so things like cluster metadata are all separated completely between workspaces
Different authentication/identity management mechanisms for each workspace
Ability to share a single GLUE catalog across workspaces

Initializing the Enterprise Cloud Service data sharing environment is easy. The data provider can store their data once in S3, create a global GLUE Catalog defining their tables once, and then spin up white labeled workspaces for each data consumer. Below is a simplistic diagram of that architecture:

However, this is not the complete picture. As a data provider opening your data internally and externally, you must filter out certain tables or rows of data based on what the data consumer has paid for or what different business units should have access to. This may also include masking sensitive columns for untrusted data consumers while maintaining the ability to open those columns for trusted consumers who have signed a Business Associate Agreement (BAA), for example.

This severely complicates the above diagram because it means you would have to maintain individual copies of data as well as different catalog table definitions for every data consumer workspace. This quickly becomes too complex to manage. This is where the Databrick’s partner, Immuta, completes the Data-as-a-Service architecture.

With Immuta, you are able to reference the single Glue Catalog to build table-, row-, and column-level controls that will enforce policy dynamically, no matter the Databricks workspace. For example, I could build a rule in Immuta to hide any rows that contain data outside of the U.S. for data consumers that are U.S.-based. This rule will dynamically be applied natively in Databricks/Spark based on the user executing the query and their attributes (if in U.S. or not), no matter the Databricks workspace or identity management system.

This is the DaaS architecture secured with Immuta:

Leveraging this architecture allows for massive scalability because you can continue to maintain a single copy of your data and definition of your catalog tables, as well as a single consistent definition of your policies across all data consumers with Immuta. You will easily be able to provide a powerful and secure Data-as-a-Service platform that delights your data consumers.

Please reach out for more information on this architecture at [email protected] or request a free demo of Immuta.

3 Best Practices for Maximizing Data Management Efficiency

In 2020, global spending on cloud data services reached $312 billion. In 2022, Gartner estimates that this number will rise to a staggering $482 billion. This immense increase proves that the migration to and adoption of cloud platforms is the bona fide standard for contemporary information services and analysis. With...

8 Signs You Need Data Access Control for Databricks

Imagine this — You’re at the starting line of a road race. The gun goes off and the clock starts ticking. Runners take off — as you bend down to tie your shoes. You don’t have to be a runner to know that this tactic ends in your competitors leaving...

Resilient, Agile, and Future-Ready: A Roundtable on the Modern Data Stack

The modern data stack bears the immense responsibility of storing, protecting, analyzing, and operationalizing a resource that is constantly in flux. As data continues to increase and evolve, these tools need to make sure it is both being used effectively and kept safe from leaks. This issue and potential solutions...

your data

Put all your data to work. Safely.

Innovate faster in every area of your business with workflow-driven solutions for data access governance and data marketplaces.

Book a demo

Platform

Govern

Provision

Comply

Agentic Data Access

Integrations

All Integrations

Snowflake

Databricks

Teradata

Starburst

Resources

Blog

Resource Center

Documentation

Support

Live Learning

Webinars

In-Person Events

Book a Session with Us

Company

Company

Careers

Newsroom

Connect

Events

Contact Us

Data-as-a-Service with Databricks Enterprise Cloud Service

On this page

Share this article

What is Data-as-a-Service?

A Guide to Automated Data Access

This is the DaaS architecture secured with Immuta:

3 Best Practices for Maximizing Data Management Efficiency

8 Signs You Need Data Access Control for Databricks

Resilient, Agile, and Future-Ready: A Roundtable on the Modern Data Stack

Put all your data to work. Safely.

Platform

Govern

Provision

Comply

Agentic Data Access

Integrations

All Integrations

Snowflake

Databricks

Teradata

Starburst

Resources

Blog

Resource Center

Documentation

Support

Live Learning

Webinars

In-Person Events

Book a Session with Us

Company

Company

Careers

Newsroom

Connect

Events

Contact Us

Get Our Newsletter

Data-as-a-Service with Databricks Enterprise Cloud Service

On this page

Share this article

What is Data-as-a-Service?

A Guide to Automated Data Access

This is the DaaS architecture secured with Immuta:

3 Best Practices for Maximizing Data Management Efficiency

8 Signs You Need Data Access Control for Databricks

Resilient, Agile, and Future-Ready: A Roundtable on the Modern Data Stack

Put all your data to work. Safely.