LogoLogo
  • 🦩Overview
  • 💾Datasets
    • Overview
    • Core Concepts
      • Columns & Annotations
      • Type & Property Mappings
      • Relationships
    • Basic Datasets
      • dbt Integration
      • Sigma Integration
      • Looker Integration
    • SaaS Datasets
    • CSV Datasets
    • Streaming Datasets
    • Entity Resolution
    • AI Columns
      • AI Prompts Recipe Book
    • Enrichment Columns
      • Quick Start
      • HTTP Request Enrichments
    • Computed Columns
    • Version Control
  • 📫Syncs
    • Overview
    • Triggering & Scheduling
    • Retry Handling
    • Live Syncs
    • Audience Syncs
    • Observability
      • Current Sync Run Overview
      • Sync History
      • Sync Tracking
      • API Inspector
      • Sync Alerts
      • Observability Lake
      • Datadog Integration
      • Warehouse Writeback
      • Sync Lifecycle Webhooks
      • Sync Dry Runs
    • Structuring Data
      • Liquid Templates
      • Event Syncs
      • Arrays and Nested Objects
  • 👥Audience Hub
    • Overview
    • Creating Segments
      • Segment Priorities
      • Warehouse-Managed Audiences
    • Experiments and Analysis
      • Audience Match Rates
    • Activating Segments
    • Calculated Columns
    • Data Preparation
      • Profile Explorer
      • Exclusion Lists
  • 🧮Data Sources
    • Overview
    • Available Sources
      • Amazon Athena
      • Amazon Redshift
      • Amazon S3
      • Azure Synapse
      • ClickHouse
      • Confluent Cloud
      • Databricks
      • Elasticsearch
      • Kafka
      • Google AlloyDB
      • Google BigQuery
      • Google Cloud SQL for PostgreSQL
      • Google Pub/Sub
      • Google Sheets
      • Greenplum
      • HTTP Request
      • HubSpot
      • Materialize
      • Microsoft Fabric
      • MotherDuck
      • MySQL
      • PostgreSQL
      • Rockset
      • Salesforce
      • SingleStore
      • Snowflake
      • SQL Server
      • Trino
  • 🛫Destinations
    • Overview
    • Available Destinations
      • Accredible
      • ActiveCampaign
      • Adobe Target
      • Aha
      • Airship
      • Airtable
      • Algolia
      • Amazon Ads DSP (AMC)
      • Amazon DynamoDB
      • Amazon EventBridge
      • Amazon Pinpoint
      • Amazon Redshift
      • Amazon S3
      • Amplitude
      • Anaplan
      • Antavo
      • Appcues
      • Apollo
      • Asana
      • AskNicely
      • Attentive
      • Attio
      • Autopilot Journeys
      • Azure Blob Storage
      • Box
      • Bloomreach
      • Blackhawk
      • Braze
      • Brevo (formerly Sendinblue)
      • Campaign Monitor
      • Canny
      • Channable
      • Chargebee
      • Chargify
      • ChartMogul
      • ChatGPT Retrieval Plugin
      • Chattermill
      • ChurnZero
      • CJ Affiliate
      • CleverTap
      • ClickUp
      • Constant Contact
      • Courier
      • Criteo
      • Crowd.dev
      • Customer.io
      • Databricks
      • Delighted
      • Discord
      • Drift
      • Drip
      • Eagle Eye
      • Emarsys
      • Enterpret
      • Elasticsearch
      • Facebook Ads
      • Facebook Product Catalog
      • Freshdesk
      • Freshsales
      • Front
      • FullStory
      • Gainsight
      • GitHub
      • GitLab
      • Gladly
      • Google Ads
        • Customer Match Lists (Audiences)
        • Offline Conversions
      • Google AlloyDB
      • Google Analytics 4
      • Google BigQuery
      • Google Campaign Manager 360
      • Google Cloud Storage
      • Google Datastore
      • Google Display & Video 360
      • Google Drive
      • Google Search Ads 360
      • Google Sheets
      • Heap.io
      • Help Scout
      • HTTP Request
      • HubSpot
      • Impact
      • Insider
      • Insightly
      • Intercom
      • Iterable
      • Jira
      • Kafka
      • Kevel
      • Klaviyo
      • Kustomer
      • Labelbox
      • LaunchDarkly
      • LinkedIn
      • LiveIntent
      • Loops
      • Mailchimp
      • Mailchimp Transactional (Mandrill)
      • Mailgun
      • Marketo
      • Meilisearch
      • Microsoft Advertising
      • Microsoft Dynamics
      • Microsoft SQL Server
      • Microsoft Teams
      • Mixpanel
      • MoEngage
      • Mongo DB
      • mParticle
      • MySQL
      • NetSuite
      • Notion
      • OneSignal
      • Optimizely
      • Oracle Database
      • Oracle Eloqua
      • Oracle Fusion
      • Oracle Responsys
      • Orbit
      • Ortto
      • Outreach
      • Pardot
      • Partnerstack
      • Pendo
      • Pinterest
      • Pipedrive
      • Planhat
      • PostgreSQL
      • PostHog
      • Postscript
      • Productboard
      • Qualtrics
      • Radar
      • Reddit Ads
      • Rokt
      • RollWorks
      • Sailthru
      • Salesforce
      • Salesforce Commerce Cloud
      • Salesforce Marketing Cloud
      • Salesloft
      • Segment
      • SendGrid
      • Sense
      • SFTP
      • Shopify
      • Singular
      • Slack
      • Snapchat
      • Snowflake
      • Split
      • Sprig
      • Stripe
      • The Trade Desk
      • TikTok
      • Totango
      • Userflow
      • Userpilot
      • Vero Cloud
      • Vitally
      • Webhooks
      • Webflow
      • X Ads (formerly Twitter Ads)
      • Yahoo Ads (DSP)
      • Zendesk
      • Zoho CRM
      • Zuora
    • Custom & Partner Destinations
  • 📎Misc
    • Credits
    • Census Embedded
    • Data Storage
      • Census Store
        • Query Census Store from Snowflake
        • Query Census Store locally using DuckDB
      • General Object Storage
      • Bring Your Own Bucket
        • Bring your own S3 Bucket
        • Bring your own GCS Bucket
        • Bring your own Azure Bucket
    • Developers
      • GitLink
      • Dataset API
      • Custom Destination API
      • Management API
    • Security & Privacy
      • Login & SSO Settings
      • Workspaces
      • Role-based Access Controls
      • Network Access Controls
      • SIEM Log Forwarding
      • Secure Storage of Customer Credentials
      • Digital Markets Act (DMA) Consent for Ad Platforms
    • Health and Usage Reporting
      • Workspace Homepage
      • Product Usage Dashboard
      • Observability Toolkit
      • Alerts
    • FAQs
Powered by GitBook
On this page
  • Where data is stored
  • Using an alternative object storage provider
  • Iceberg Catalog
  • Endpoints
  • Authentication
  • Integrate with Census Store

Was this helpful?

  1. Misc
  2. Data Storage

Census Store

Every Census workspace includes a Census Store catalog, which is used to store and retrieve data you create within Census.

PreviousData StorageNextQuery Census Store from Snowflake

Last updated 28 days ago

Was this helpful?

The data stored in Census Store includes:

  • SaaS datasets

  • CSV datasets

  • Entity Resolution datasets

  • ...plus AI Columns, Enrichment Columns, and Warehouse Writeback logs for all of these datasets

Your workspace’s Census Store catalog is created for you the first time you create one of these resources.

You can find Census Store settings by clicking Settings in the Census left navigation to open the Workspace Settings page, then selecting the Census Store tab.

Where data is stored

By default, data in Census Store, along with sync metadata like snapshots, and logs for datasets stored in Census Store, is stored in Census-provided object storage in your workspace’s region.

You may also choose to have Census Store use your own object storage provider. This allows Census to manage data on your behalf, while also maintaining strong guarantees that your data at rest is stored within your cloud.

Census Store is only compatible with AWS S3-based storage at this time.

Using an alternative object storage provider

To use a new or existing object storage location to store the data in your Census Store catalog:

  1. Click Settings in the left navigation to open Workspace Settings, then select the Census Store tab.

  2. Under the Storage Provider heading, the current storage location for your workspace catalog data is shown.

  3. Click the name of the current storage location to open a drop down showing available options.

    1. Select New storage location to configure a new to store your workspace catalog data.

    2. Select an existing in your account, including any currently-configured workspace or organization default provider.

Changing your Census Store catalog’s storage location after you begin using Census Store can break existing datasets and syncs. If you need to migrate existing Census store data to a new storage location, contact Census support.

Iceberg Catalog

Zero-Copy/Zero-ETL is the ability to query external data, like Census Store, directly from your existing data warehouse, without first having to use ELT to replicate it into your warehouse, using a standardized catalog and table format like Apache Iceberg. Support for this functionality varies by warehouse and is growing.

Endpoints

The Census Store Iceberg REST Catalog endpoints are regional. You must use the endpoint specific to the region associated with your workspace. Your Census Store catalog cannot be accessed from the incorrect regional endpoint.

Census data plane region
Iceberg REST Catalog endpoint

US

https://catalog.us.getcensus.com/api/catalog

EU

https://catalog.eu.getcensus.com/api/catalog

Authentication

The Census Store Iceberg REST Catalog uses OAuth2 client credentials authentication. To configure your tool or library to access the REST catalog, you will need to provide the REST catalog endpoint, catalog name, a client ID, and client secret.

Your credential provides read-only access to the data in Census Store. It is not possible to write catalog data using external tools.

Create Credential

To create new OAuth2 client credentials:

  1. In Census, navigate to Workspace Settings.

  2. Choose the Census Store tab.

  3. Under Iceberg Catalog, note your workspace’s Endpoint and Catalog Name, then click Create Client Credential.

  4. The newly-created Client ID and Client Secret are displayed on the screen.

The client secret is only visible when the credential is first created; store it securely.

Revoke Credential

To revoke an existing client credential:

  1. In Census, navigate to Workspace Settings.

  2. Choose the Managed Storage tab.

  3. Under Iceberg Catalog, locate the credential you want to revoke and click the trash can icon (Revoke Credential). The credential is destroyed and can no longer be used to authenticate to the Iceberg REST Catalog.

Integrate with Census Store

You can use Census Store's Iceberg catalog to integrate the data in Census Store with third-party tools and systems:

Census Store data is stored in the Apache Iceberg format, and Census provides an Iceberg REST Catalog you can use to access this data from external tools and services that support Iceberg REST Catalog, like Apache Spark, DuckDB, and .

Integrate Census Store with your Snowflake warehouse using Apache Iceberg for Zero-ETL access to SaaS and CSV datasets

📎
Sync Tracking
API Inspector
S3-based storage location
Custom Object Storage Provider
Snowflake
Query Census Store from Snowflake