Overview of Pendo Data Sync

Last updated:

Pendo Data Sync allows you to push data out of Pendo into your data lake or warehouse and business intelligence (BI) tools so that you can perform complex queries and analyses. With Pendo data blended with other key data sources you can:

  • Measure the impact of product improvements on sales and renewals.
  • Identify friction in your end-to-end user journey, from top-of-funnel acquisition to in-product activation.
  • Calculate a comprehensive customer health score across data sources to shape renewal strategy.
  • Create a churn-risk model from a foundation of product usage and sentiment signals.
  • Identify up-sell and cross-sell opportunities to drive data-informed account growth.

Prerequisites

Data Sync is a paid feature. Contact your Pendo representative for access. You can also try Data Sync on a subset of data first. For information, see Test export in this article.

You must be a subscription admin in Pendo to set up Data Sync.

How it works

Screenshot 2023-12-13 at 16.56.27.png

Data Sync first involves choosing one of the following cloud storage services to set up as a destination for your Pendo data: Google Cloud Storage, Amazon S3, or Microsoft Azure Storage. Data Sync passes every event captured in Pendo in Avro file format to your chosen cloud storage destination.

Set up recurring daily exports and backfill up to three calendar years of historical data. For example, if you create an export on December 31, 2024, the export can extend back to January 1, 2021. Pendo also automatically sends updated files whenever a Page or Feature rule is added or updated in Pendo.

You then create an ETL pipeline to move data from your cloud storage service into a data lake or warehouse, such as Snowflake, Databricks, Google BigQuery, or Amazon Redshift. The following documentation provides guidance on the file schema and best practices for setting up your Data Sync ETL pipeline:

After data is loaded into your data lake or warehouse, your analytics team can blend it with other data sources and push those insights into your Business Intelligence (BI) source-of-truth, such as Looker, Tableau, Power BI, or Mode.

Setup

Data Sync sends Avro files to your cloud storage container, where they can be picked up and loaded into your data lake or warehouse. You can only set up one cloud storage destination per Pendo subscription. Data Sync supports Google Cloud Storage (GCS), Amazon Simple Storage Service (S3), or Microsoft Azure Storage. For instructions, see the following articles:

Test export

You can create a single test export containing one day of Pendo data so that your data engineering team can see how your Pendo data appears in Data Sync Avro files and plan the ETL pipeline required to pull Pendo data from your cloud storage.

Go to Settings > Data Sync and start the setup process for one of the three supported cloud storage services. After setting up a destination, you can create an export where you can choose the Test export option.

Was this article helpful?
4 out of 6 found this helpful