Dataplex pricing

Dataplex pricing is based on pay-as-you-go usage. Dataplex currently charges based on the following SKUs:

  • Dataplex processing (standard and premium)
  • Dataplex shuffle storage
  • Metadata storage
  • Data Catalog API calls

The following is a high-level overview of how each key Dataplex capability is billed:

Capability Dataplex processing Dataplex shuffle storage Metadata storage
Cloud Storage metadata harvesting Standard N/A N/A
Data exploration workbench Premium Yes N/A
Data lineage Premium N/A Yes
Data quality Premium N/A Yes - if published to Data Catalog
Data profiling Premium N/A Yes - if published to Data Catalog
Enrich metadata in Data Catalog N/A N/A Yes

In addition to this billing, Data Catalog API calls are billed based on the Data Catalog API charges.

Other usage

Data organization features in Dataplex (lake, zone, or asset setup) and security policy application and propagation, are provided free of charge.

In addition, some Dataplex functionalities (including scheduled data quality and data ingestion tasks, and Dataplex managed connectors for ingesting metadata from CloudSQL and Looker) trigger job execution using Dataproc Serverless, BigQuery, Dataflow, and Cloud Scheduler. Those usages are charged according to the Dataproc, BigQuery, Dataflow, and Cloud Scheduler pricing models respectively, and charges will show up under Dataproc, BigQuery, and Dataflow instead of Dataplex.

Dataplex processing pricing

Dataplex standard and premium processing are metered by the Data Compute Unit (DCU). DCU-hour is an abstract billing unit for Dataplex and the actual metering depends on the individual features you use.

Dataplex standard processing pricing

Dataplex standard tier covers the data discovery functionality that discovers metadata across Dataplex managed data. The following are the prices as per the region of your choice.

Dataplex free tier

As part of the Google Cloud Free Tier, Dataplex offers some resources free of charge up to a specific limit. These free usage limits are available during and after the free trial period. If you go over these usage limits and are no longer in the free trial period, you will be charged according to the pricing as described in the sections above.

Resource Monthly free usage limits
Dataplex Processing 100 DCU-hour

Dataplex premium processing pricing

The Dataplex premium processing tier covers the data exploration workbench, data lineage, data quality, and data profiling.

DCU charges for each feature is calculated as follows:

  • For data exploration workbench, the DCU-hour is calculated based on the compute consumption of the session.

  • For data lineage, the DCU-hour is proportional to the processing involved to automatically parse lineage.

    For detailed examples on calculating the data lineage cost, see Estimate data lineage pricing.

  • For data profiling and data quality, the DCU-hour consumption is proportional to the processing involved in profiling the data and computing the data quality metrics. This is billed per second, with a minimum of one minute.

Dataplex shuffle storage pricing

Shuffle storage pricing covers any disk storage specified in the environments configured for the data exploration workbench.

Catalog pricing

This section describes the pricing for Dataplex Catalog and Data Catalog. For more information about the differences between Dataplex Catalog and Data Catalog, see Dataplex Catalog versus Data Catalog.

Dataplex Catalog charges apply to metadata storage for Dataplex Catalog including metadata stored for data lineage. These charges are effective Aug 1, 2024.

Data Catalog charges apply to metadata storage for Data Catalog and API calls made to the Data Catalog API.

Metadata storage and API call charges accrue daily. You can view unbilled usage on the Google Cloud console.

Metadata storage pricing

Metadata storage is measured in gibibytes (GiB), where 1 GiB is 1,073,741,824 bytes. Dataplex Catalog and Data Catalog measure the average amount of the stored metadata during a short time interval. For billing, these measurements are combined into a one-month average, which is multiplied by the monthly rate.

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

Dataplex Catalog storage pricing

Metadata storage charges (including those for entries and aspects) are billed to the project where the respective resource was created.

Monthly average storage Price (USD)
Any $2 per GiB per month

When a resource in Data Catalog is made simultaneously available in Dataplex Catalog, you are charged for only one active instance of such resource.

Data Catalog storage pricing

Monthly average storage Price (USD)
Up to 1 MiB No charge
Over 1 MiB $2 per GiB per month

API pricing

This section describes the pricing for Dataplex Catalog and Data Catalog APIs.

Dataplex Catalog API charges

As users interact with Dataplex Catalog, API calls for the following are free of charge:

  • Creating and managing Dataplex Catalog resources
  • Creating and managing lineage resources, except for lineage that is automatically harvested
  • Catalog search

Data Catalog API charges

Data Catalog API calls are billed as described in the following table:

API calls Price (USD)
1 million in a month No charge
Over 1 million in a month $10 per 100,000 API calls

If you pay in a currency other than USD, the prices listed in your currency on Cloud Platform SKUs apply.

Dataplex Catalog pricing examples

This section provides examples of how to calculate the Dataplex Catalog cost.

Small aspects

  • User A creates and applies small aspects (1024 bytes each). For $10 per month, the user can store 5 GiB of metadata, which corresponds approximately to 5M aspects. Assuming one aspect per table, this amounts to a total of 5M tables with aspects.

  • User B creates 5M aspects of 1 KB each on the 10th of the month, and deletes the aspects on the 20th. The cost is $3.33, calculated as 5 GiB of data divided by one-third month:

5 GiB * $2
* 1/3
= $3.33

Large aspects

  • User C creates and applies large aspects (10 KB each). For $10 per month, the user can store 5 GiB of metadata, which corresponds to approximately 500k aspects. Assuming one aspect per table, it amounts to a total of 500k tables with aspects.

  • User D creates 10k aspect types (for example ETL, data governance, data quality), and applies large aspects (10 KB each) using each of the 10 aspect types. For $10 per month, the user can store 5 GiB of metadata, which corresponds approximately to 500k aspects. Assuming 10 aspects per table, it amounts to a total of 50k tables with aspects.

What's next

Request a custom quote

With Google Cloud's pay-as-you-go pricing, you only pay for the services you use. Connect with our sales team to get a custom quote for your organization.
Contact sales