LogoLogo
  • 🦩Overview
  • 💾Datasets
    • Overview
    • Core Concepts
      • Columns & Annotations
      • Type & Property Mappings
      • Relationships
    • Basic Datasets
      • dbt Integration
      • Sigma Integration
      • Looker Integration
    • SaaS Datasets
    • CSV Datasets
    • Streaming Datasets
    • Entity Resolution
    • AI Columns
      • AI Prompts Recipe Book
    • Enrichment Columns
      • Quick Start
      • HTTP Request Enrichments
    • Computed Columns
    • Version Control
  • 📫Syncs
    • Overview
    • Triggering & Scheduling
    • Retry Handling
    • Live Syncs
    • Audience Syncs
    • Observability
      • Current Sync Run Overview
      • Sync History
      • Sync Tracking
      • API Inspector
      • Sync Alerts
      • Observability Lake
      • Datadog Integration
      • Warehouse Writeback
      • Sync Lifecycle Webhooks
      • Sync Dry Runs
    • Structuring Data
      • Liquid Templates
      • Event Syncs
      • Arrays and Nested Objects
  • 👥Audience Hub
    • Overview
    • Creating Segments
      • Segment Priorities
      • Warehouse-Managed Audiences
    • Experiments and Analysis
      • Audience Match Rates
    • Activating Segments
    • Calculated Columns
    • Data Preparation
      • Profile Explorer
      • Exclusion Lists
  • 🧮Data Sources
    • Overview
    • Available Sources
      • Amazon Athena
      • Amazon Redshift
      • Amazon S3
      • Azure Synapse
      • ClickHouse
      • Confluent Cloud
      • Databricks
      • Elasticsearch
      • Kafka
      • Google AlloyDB
      • Google BigQuery
      • Google Cloud SQL for PostgreSQL
      • Google Pub/Sub
      • Google Sheets
      • Greenplum
      • HTTP Request
      • HubSpot
      • Materialize
      • Microsoft Fabric
      • MotherDuck
      • MySQL
      • PostgreSQL
      • Rockset
      • Salesforce
      • SingleStore
      • Snowflake
      • SQL Server
      • Trino
  • 🛫Destinations
    • Overview
    • Available Destinations
      • Accredible
      • ActiveCampaign
      • Adobe Target
      • Aha
      • Airship
      • Airtable
      • Algolia
      • Amazon Ads DSP (AMC)
      • Amazon DynamoDB
      • Amazon EventBridge
      • Amazon Pinpoint
      • Amazon Redshift
      • Amazon S3
      • Amplitude
      • Anaplan
      • Antavo
      • Appcues
      • Apollo
      • Asana
      • AskNicely
      • Attentive
      • Attio
      • Autopilot Journeys
      • Azure Blob Storage
      • Box
      • Bloomreach
      • Blackhawk
      • Braze
      • Brevo (formerly Sendinblue)
      • Campaign Monitor
      • Canny
      • Channable
      • Chargebee
      • Chargify
      • ChartMogul
      • ChatGPT Retrieval Plugin
      • Chattermill
      • ChurnZero
      • CJ Affiliate
      • CleverTap
      • ClickUp
      • Constant Contact
      • Courier
      • Criteo
      • Crowd.dev
      • Customer.io
      • Databricks
      • Delighted
      • Discord
      • Drift
      • Drip
      • Eagle Eye
      • Emarsys
      • Enterpret
      • Elasticsearch
      • Facebook Ads
      • Facebook Product Catalog
      • Freshdesk
      • Freshsales
      • Front
      • FullStory
      • Gainsight
      • GitHub
      • GitLab
      • Gladly
      • Google Ads
        • Customer Match Lists (Audiences)
        • Offline Conversions
      • Google AlloyDB
      • Google Analytics 4
      • Google BigQuery
      • Google Campaign Manager 360
      • Google Cloud Storage
      • Google Datastore
      • Google Display & Video 360
      • Google Drive
      • Google Search Ads 360
      • Google Sheets
      • Heap.io
      • Help Scout
      • HTTP Request
      • HubSpot
      • Impact
      • Insider
      • Insightly
      • Intercom
      • Iterable
      • Jira
      • Kafka
      • Kevel
      • Klaviyo
      • Kustomer
      • Labelbox
      • LaunchDarkly
      • LinkedIn
      • LiveIntent
      • Loops
      • Mailchimp
      • Mailchimp Transactional (Mandrill)
      • Mailgun
      • Marketo
      • Meilisearch
      • Microsoft Advertising
      • Microsoft Dynamics
      • Microsoft SQL Server
      • Microsoft Teams
      • Mixpanel
      • MoEngage
      • Mongo DB
      • mParticle
      • MySQL
      • NetSuite
      • Notion
      • OneSignal
      • Optimizely
      • Oracle Database
      • Oracle Eloqua
      • Oracle Fusion
      • Oracle Responsys
      • Orbit
      • Ortto
      • Outreach
      • Pardot
      • Partnerstack
      • Pendo
      • Pinterest
      • Pipedrive
      • Planhat
      • PostgreSQL
      • PostHog
      • Postscript
      • Productboard
      • Qualtrics
      • Radar
      • Reddit Ads
      • Rokt
      • RollWorks
      • Sailthru
      • Salesforce
      • Salesforce Commerce Cloud
      • Salesforce Marketing Cloud
      • Salesloft
      • Segment
      • SendGrid
      • Sense
      • SFTP
      • Shopify
      • Singular
      • Slack
      • Snapchat
      • Snowflake
      • Split
      • Sprig
      • Stripe
      • The Trade Desk
      • TikTok
      • Totango
      • Userflow
      • Userpilot
      • Vero Cloud
      • Vitally
      • Webhooks
      • Webflow
      • X Ads (formerly Twitter Ads)
      • Yahoo Ads (DSP)
      • Zendesk
      • Zoho CRM
      • Zuora
    • Custom & Partner Destinations
  • 📎Misc
    • Credits
    • Census Embedded
    • Data Storage
      • Census Store
        • Query Census Store from Snowflake
        • Query Census Store locally using DuckDB
      • General Object Storage
      • Bring Your Own Bucket
        • Bring your own S3 Bucket
        • Bring your own GCS Bucket
        • Bring your own Azure Bucket
    • Developers
      • GitLink
      • Dataset API
      • Custom Destination API
      • Management API
    • Security & Privacy
      • Login & SSO Settings
      • Workspaces
      • Role-based Access Controls
      • Network Access Controls
      • SIEM Log Forwarding
      • Secure Storage of Customer Credentials
      • Digital Markets Act (DMA) Consent for Ad Platforms
    • Health and Usage Reporting
      • Workspace Homepage
      • Product Usage Dashboard
      • Observability Toolkit
      • Alerts
    • FAQs
Powered by GitBook
On this page
  • Overview
  • Getting Started
  • How It Works
  • Data Storage and Access
  • Querying Your Data
  • Data Lifecycle
  • Features and Capabilities
  • Data Access Methods
  • Best Practices
  • Security and Compliance

Was this helpful?

  1. Datasets

SaaS Datasets

Census allows you to create datasets directly from your CRM systems like HubSpot and Salesforce, making it easy to work with your business data alongside your warehouse data. This guide explains how SaaS Datasets work and how to get started.

Overview

SaaS Datasets allow you to:

  • Import any object from supported CRMs (including custom objects)

  • Control which fields are imported for each object

  • Automatically refresh data on your schedule (hourly, daily, or never)

  • Use all Census features with your CRM data (AI columns, enrichments, deduplication)

  • Create segments and syncs using your CRM data

  • Query the data from your existing data warehouse using Census's Iceberg catalog

Getting Started

To create your first SaaS Dataset:

  1. Navigate to the Datasets tab in Census

  2. Click + New Dataset in the top-right corner

  3. Select Import Dataset from SaaS App

  4. Choose your CRM system (HubSpot or Salesforce)

  5. Select the objects and fields you want to import

  6. Configure your refresh schedule

  7. Save your dataset

How It Works

Data Storage and Access

  1. Your data is imported using your CRM's API credentials

  2. The data is stored in Apache Iceberg format in Census Store

  3. Census provides an Iceberg catalog that makes this data queryable from your data warehouse

  4. Data is automatically refreshed based on your specified schedule

Querying Your Data

Because SaaS Datasets are stored in Apache Iceberg format, you can query them directly from your data warehouse using federated queries - meaning you don't need to copy the data into your warehouse first. This is sometimes called "Zero-Copy" or "Zero-ETL" access.

For example, if you're using Snowflake, you can query your SaaS Dataset tables just like any other table:

SELECT * FROM CENSUS_CATALOG.YOUR_WORKSPACE.HUBSPOT_CONTACTS;

Data Lifecycle

  • Data is automatically refreshed based on your configured schedule

  • When you delete a dataset, all associated data is permanently removed from storage

  • You maintain full control over which objects and fields are imported

Features and Capabilities

SaaS Datasets can be used with all Census features:

Data Access Methods

Your SaaS Dataset data is accessible through:

  1. Census UI - Browse and manage your datasets directly

  2. Your Data Warehouse - Query the data using federated queries via the Census Iceberg catalog

Best Practices

  • Start with a subset of fields to optimize initial load times

  • Set refresh schedules based on your data update frequency

  • Monitor your API usage to stay within CRM limits

  • Regularly review and clean up unused datasets

Security and Compliance

  • All data is encrypted at rest and in transit

  • Access controls are managed through Census permissions

  • Data storage follows Census's security and compliance standards

  • Option to use your own storage infrastructure for additional control

PreviousLooker IntegrationNextCSV Datasets

Last updated 15 days ago

Was this helpful?

When you create a SaaS Dataset, Census imports your selected CRM objects into , our managed storage solution. Here's what happens:

By default, data is stored in Census's secure infrastructure, but you can also for additional control.

See the for detailed instructions on setting up and using federated queries for your specific warehouse.

Add for AI-powered data enrichment

Apply

Perform

Create for targeted marketing

Build to other destinations

For more details about data storage, security, and querying options, see our .

💾
Census Store
AI Columns
third-party enrichments
deduplication
segments
syncs
Census Store documentation
use your own storage infrastructure
Census Store documentation