Data Stream FAQs

  • Updated

Click the links below to learn more about Data Stream.

Can you describe what Data Stream is?

Data Stream provides you with easy access to all the Demandbase Platform data. Instead of having to export from multiple places within the Demandbase Platform, all the data will be delivered to your preferred data warehouse or storage daily.  Most of the data is audience-based; therefore, you do need to create audiences on the Demandbase Platform in order for the Data Stream data to make sense.

What data is included in the Data Stream?

The dataset that is delivered to you contains a consistent type of data in 7 data categories. Below is an overview of what will be included in the package that is delivered to you: 

  1. Audience: All the audiences currently in customer’s Demandbase instance.
  2. Profiles: All the profiles created and also the audiences that are linked to the profile.
  3. Custom Attributes: Based on your company’s set up, this includes all the custom attributes for all the accounts within your account. For example, salesforce ID, salesforce opportunity, etc.
  4. Keywords (Intent): All the keywords for each of the accounts in every audience.  We also show which keywords are trending and how many times users have researched the keywords.
  5. Historical Keywords: This looks at all the keyword performance in the past 12 months in every 30-day period.
  6. Ad Campaign Performance: All the campaign performance data (including impressions clicks, etc.)  that are linked to the audience. Also included are two separate tables for the campaign performance on the account level and in the historical rollup view.
  7. Site Analytics: Details about all the incoming traffic to your website including the pages visited and by what companies.

If most of the data is on the Demandbase Platform, how can Data Stream be useful to me?

There are two key reasons why Data Stream is useful to your company:

  1. You will find more data in Data Stream. Data Stream contains all the data on the Demandbase Platform and more such as account scores. 
  2. No more manual process. You don’t need to complete the daily export and import tasks which can save you a lot of time. All of the data will be automatically delivered to you. With Data Stream, all the Demandbase data can be stored in the same place as all other data in your company which gives you a holistic view and allows you to gain insights from a large dataset.

What are the advantages of Data Stream delivery over a public API for data acquisition?

Demandbase wants to start offering Data Stream with data delivery before offering a public API to make it convenient for you. Within Data Stream, there can be tens of millions of rows for each customer. Even with batch API calls, that could translate to hundreds of thousands of calls per day. This is not a scalable way for you to access the data. Here are a few advantages over pulling the data yourself:

  • Tens of millions of rows of data delivered daily, at-once. This saves you time because you can easily access all the data.
  • Avoids “missing” data due to operational issues. Data Stream supports full redelivery upon request.
  • Support for integrated delivery to cloud data warehouses such as BigQuery and RedShift which also provide programmatic SQL APIs  (REST, JDBC, ODBC) and human/manual SQL UIs.
  • Provides the option to deliver a large-data raw format to cloud storage for processing on-demand. Ideal for engineers and scientists to do further ETL and/or analysis with technologies like Spark and Hadoop.

Who are the ideal consumers in my company for Data Stream?

The key purpose of Data Stream is to enable customers to build their own dashboards and run ML models; therefore, the main users are data scientists, analysts and technical marketing operation teams. A good example of the persona would be a data analyst who has a solid understanding of a company’s data structure and is proficient in SQL and is also an expert in Excel and Tableau. One of the key job functions is to work with multiple stakeholders in the marketing and product teams and to collect BRD for dashboards to drive insights they want to gather.  

The Analytics and BI team will be the direct users of Data Stream data to build dashboards for their internal teams such as marketing and sales. From the Demandbase product team, we will be looking to collect two sets of feedback from the analytics team for data related feedback and the marketing team for business related feedback.

How often will Demandbase deliver the data to me?

Demandbase will deliver the data package to your designated data warehouse everyday including weekends at 7 am EST which includes data from the previous day (unless you specify other times of the day.)  In the initial release, there will not be an option to change the delivery time as it takes time to collect all the data from different parts of the Demandbase Platform and to ensure accuracy of the rollup.

How can I expect the data to be structured in the daily delivery package?

If the data is being delivered to the data warehouse such as BigQuery, you will receive the ongoing partition table.  However, if you have requested the data to be delivered in parquet format, all the data will be in raw format which gives you 100% freedom to handle the data your way. 

Ongoing Partition Table: You will receive the same table everyday with additional rows containing new daily data. An extra date stamp column will be added to the table so you know when the data is entered into the table. The benefit of this is that you can use the latest table to query from instead of having to add new tables everyday into your data warehouse.

What is the format of the data?

A data package can be delivered in two forms: 

  1. It is delivered into your data warehouse such as BigQuery and Redshift.
  2. It is delivered into your data storage service such as GCS, Amazon S3, Azure Blob and your own data storage. When delivering into the data storage service, the package will be in parquet format.

How much storage do I need in order to use Data Stream?

The daily data package from Demandbase is between 5GB and 15GB in size (uncompressed). If the data is being delivered into the data warehouse, you can specify the length of the historical data you want to keep in the system.  If you have a tight quota on storage utilization, you can have your system purge the data that’s older than 6 months.

Is Data Stream compatible with the data warehouse I’m using?

Data Stream will support direct delivery into Google BigQuery, Amazon Redshift and Microsoft Cosmo DB. If you have a cloud data storage service or do not want the data to be delivered into your data warehouse, Data Stream will support delivering into folders in Google GCS, Amazon S3 and Microsoft Azure Blob.

Are there any limitations or restrictions on the data delivery?

You can only choose one delivery mechanism which means Demandbase will not support delivering the data into S3 and also into BigQuery.

How do you deliver the data if I do not use any of the data warehouses?

If you do not use any of the supported data warehouses, you will need to use any of the supported cloud storage services for the Data Stream data package to be delivered.

Can I use Data Stream if I host my own data center?

Unfortunately Demandbase will only support data delivery to either cloud data warehouses or cloud data storage services at this point. If you do not use any of these and are interested in Data Stream, please share your use case with your account manager or CSM.

Can I request additional data if the data I need is not part of Data Stream?

Yes. Since this is the first time Demandbase is allowing our customers to have access to the data directly, we are happy to work with any interested customers to make sure they can make the best out of Data Stream. If you feel the data you get from Data Stream is inadequate, please reach out to your CSM or account manager who will work directly with the Demandbase product team for an ideal solution for you.

In the case of losing data or if the data gets corrupted, how can Demandbase help?

When this happens, you should send an email request for the data packages to be re-delivered at audiencemart@demandbase.com.  You need to specify the dates of the packages you want to be re-delivered. You may request any data redelivery from within the past 60 days. Once we receive the request, we will confirm the content and the date when the information can be redelivered. Please allow up to two business days for the redelivery. Once the data is re-delivered, we will notify you through email.

Is there a process for me as a customer to validate or to preview the data before making the purchase?

You can request for a trial data package to understand what is included in Data Stream and how you can use the data on a daily basis. For the trial request, we can provide a 1-day data delivery. Demandbase will need to collect information needed for us to set up the trial which includes the data warehouse or storage service to deliver the data to and the date to start the delivery.

What is the process to get started with Data Stream?

If you are interested in Data Stream, please reach out to your account manager and CSM who will put you in touch with the product team to make sure you have the right setup to make the best of Data Stream. The product team at Demandbase will work directly with you to answer any questions you may have.  There are some configuration options you will need to select before we can get the data delivery set up for you. Once you are confirmed to move forward with Data Stream, you will work with your Demandbase account manager to have the order form processed and signed. You will continue to receive support from the product team if you need additional help.

How fresh is the data?

The data you receive everyday will be up to 12 am EST (5 am UTC time) of the same day.  Since the data is generated daily, you might see some information discrepancies between the data you just received and the data you see in Demandbase Platform at 12 pm EST because Data Stream does not  include the data in that 12-hour gap.

What are the configuration options for using Data Stream?

In order to properly and promptly deliver data to you, there is some information we need to collect from you. The format of how we collect the information can just be an Excel form sent to you from the Demandbase product team. We will also need a technical contact to work with at your company because of the complexity of the product. The following is some of the data we would need from you:

  • Trial or not
  • Account name & ID
  • Contact person’s name & title
  • Contact person’s email & phone number
  • Technical contact name if different from contact
  • Technical contact email & phone number if different from contact
  • Date to start the delivery
  • Delivery destination: BigQuery / Redshift / Azure Synapse  / GCS / S3 / Azure Blob / Other
  • Other delivery information

How can I be sure of the accuracy of the data and be sure the data from Data Stream is the same as what I can see on the Demandbase Platform?

The data from Data Stream is from the same database which is feeding the information to the Demandbase Platform; therefore, there should not be any data discrepancy between those two.  However, since Data Stream is delivered daily and the data is cut off at the end of the day, you will not have the data of the same day within Data Stream which might cause a difference in data.

How can Data Stream help my operation in targeting and engagement on Demandbase?

By using Data Stream, you can build your own dashboard or data warehouse with the data you get from multiple sources.  You can drive a new set of insights about your customers and campaigns. You can then use that information to come back to the Demandbase Platform to apply to an existing/new audience and campaign for better performance.  For example, you may find out there are different sets of keywords you can apply to the audience which can be more effectively targeted with your ad campaigns.

How do I measure how much using Data Stream helps me?

There are two key improvements with Data Stream that can help your company: Time and Insights. With Data Stream, you can save time from the manual process of daily exporting and importing data through automated delivery.  The other improvement is the amount of insights you can get from Data Stream. Since different teams could be using the data in their own way, you can measure the impact of using Data Stream by the number of new active dashboards and reports created using Data Stream data.  If you want to measure in a more granular level, you can have a good idea of the amount of improvements by evaluating the metrics tied to those new dashboards. If data is being used to generate recommendations for the sales team, you can measure how effective those recommendations are and how much revenue the recommendations lead to.

Was this article helpful?

6 out of 6 found this helpful