LogoLogo
  • 🦩Overview
  • 💾Datasets
    • Overview
    • Core Concepts
      • Columns & Annotations
      • Type & Property Mappings
      • Relationships
    • Basic Datasets
      • dbt Integration
      • Sigma Integration
      • Looker Integration
    • SaaS Datasets
    • CSV Datasets
    • Streaming Datasets
    • Entity Resolution
    • AI Columns
      • AI Prompts Recipe Book
    • Enrichment Columns
      • Quick Start
      • HTTP Request Enrichments
    • Computed Columns
    • Version Control
  • 📫Syncs
    • Overview
    • Triggering & Scheduling
    • Retry Handling
    • Live Syncs
    • Audience Syncs
    • Observability
      • Current Sync Run Overview
      • Sync History
      • Sync Tracking
      • API Inspector
      • Sync Alerts
      • Observability Lake
      • Datadog Integration
      • Warehouse Writeback
      • Sync Lifecycle Webhooks
      • Sync Dry Runs
    • Structuring Data
      • Liquid Templates
      • Event Syncs
      • Arrays and Nested Objects
  • 👥Audience Hub
    • Overview
    • Creating Segments
      • Segment Priorities
      • Warehouse-Managed Audiences
    • Experiments and Analysis
      • Audience Match Rates
    • Activating Segments
    • Calculated Columns
    • Data Preparation
      • Profile Explorer
      • Exclusion Lists
  • 🧮Data Sources
    • Overview
    • Available Sources
      • Amazon Athena
      • Amazon Redshift
      • Amazon S3
      • Azure Synapse
      • ClickHouse
      • Confluent Cloud
      • Databricks
      • Elasticsearch
      • Kafka
      • Google AlloyDB
      • Google BigQuery
      • Google Cloud SQL for PostgreSQL
      • Google Pub/Sub
      • Google Sheets
      • Greenplum
      • HTTP Request
      • HubSpot
      • Materialize
      • Microsoft Fabric
      • MotherDuck
      • MySQL
      • PostgreSQL
      • Rockset
      • Salesforce
      • SingleStore
      • Snowflake
      • SQL Server
      • Trino
  • 🛫Destinations
    • Overview
    • Available Destinations
      • Accredible
      • ActiveCampaign
      • Adobe Target
      • Aha
      • Airship
      • Airtable
      • Algolia
      • Amazon Ads DSP (AMC)
      • Amazon DynamoDB
      • Amazon EventBridge
      • Amazon Pinpoint
      • Amazon Redshift
      • Amazon S3
      • Amplitude
      • Anaplan
      • Antavo
      • Appcues
      • Apollo
      • Asana
      • AskNicely
      • Attentive
      • Attio
      • Autopilot Journeys
      • Azure Blob Storage
      • Box
      • Bloomreach
      • Blackhawk
      • Braze
      • Brevo (formerly Sendinblue)
      • Campaign Monitor
      • Canny
      • Channable
      • Chargebee
      • Chargify
      • ChartMogul
      • ChatGPT Retrieval Plugin
      • Chattermill
      • ChurnZero
      • CJ Affiliate
      • CleverTap
      • ClickUp
      • Constant Contact
      • Courier
      • Criteo
      • Crowd.dev
      • Customer.io
      • Databricks
      • Delighted
      • Discord
      • Drift
      • Drip
      • Eagle Eye
      • Emarsys
      • Enterpret
      • Elasticsearch
      • Facebook Ads
      • Facebook Product Catalog
      • Freshdesk
      • Freshsales
      • Front
      • FullStory
      • Gainsight
      • GitHub
      • GitLab
      • Gladly
      • Google Ads
        • Customer Match Lists (Audiences)
        • Offline Conversions
      • Google AlloyDB
      • Google Analytics 4
      • Google BigQuery
      • Google Campaign Manager 360
      • Google Cloud Storage
      • Google Datastore
      • Google Display & Video 360
      • Google Drive
      • Google Search Ads 360
      • Google Sheets
      • Heap.io
      • Help Scout
      • HTTP Request
      • HubSpot
      • Impact
      • Insider
      • Insightly
      • Intercom
      • Iterable
      • Jira
      • Kafka
      • Kevel
      • Klaviyo
      • Kustomer
      • Labelbox
      • LaunchDarkly
      • LinkedIn
      • LiveIntent
      • Loops
      • Mailchimp
      • Mailchimp Transactional (Mandrill)
      • Mailgun
      • Marketo
      • Meilisearch
      • Microsoft Advertising
      • Microsoft Dynamics
      • Microsoft SQL Server
      • Microsoft Teams
      • Mixpanel
      • MoEngage
      • Mongo DB
      • mParticle
      • MySQL
      • NetSuite
      • Notion
      • OneSignal
      • Optimizely
      • Oracle Database
      • Oracle Eloqua
      • Oracle Fusion
      • Oracle Responsys
      • Orbit
      • Ortto
      • Outreach
      • Pardot
      • Partnerstack
      • Pendo
      • Pinterest
      • Pipedrive
      • Planhat
      • PostgreSQL
      • PostHog
      • Postscript
      • Productboard
      • Qualtrics
      • Radar
      • Reddit Ads
      • Rokt
      • RollWorks
      • Sailthru
      • Salesforce
      • Salesforce Commerce Cloud
      • Salesforce Marketing Cloud
      • Salesloft
      • Segment
      • SendGrid
      • Sense
      • SFTP
      • Shopify
      • Singular
      • Slack
      • Snapchat
      • Snowflake
      • Split
      • Sprig
      • Statsig
      • Stripe
      • The Trade Desk
      • TikTok
      • Totango
      • Userflow
      • Userpilot
      • Vero Cloud
      • Vitally
      • Webhooks
      • Webflow
      • X Ads (formerly Twitter Ads)
      • Yahoo Ads (DSP)
      • Zendesk
      • Zoho CRM
      • Zuora
    • Custom & Partner Destinations
  • 📎Misc
    • Credits
    • Census Embedded
    • Data Storage
      • Census Store
        • Query Census Store from Snowflake
        • Query Census Store locally using DuckDB
      • General Object Storage
      • Bring Your Own Bucket
        • Bring your own S3 Bucket
        • Bring your own GCS Bucket
        • Bring your own Azure Bucket
    • Developers
      • GitLink
      • Dataset API
      • Custom Destination API
      • Management API
    • Security & Privacy
      • Login & SSO Settings
      • Workspaces
      • Role-based Access Controls
      • Network Access Controls
      • SIEM Log Forwarding
      • Secure Storage of Customer Credentials
      • Digital Markets Act (DMA) Consent for Ad Platforms
    • Health and Usage Reporting
      • Workspace Homepage
      • Product Usage Dashboard
      • Observability Toolkit
      • Alerts
    • FAQs
Powered by GitBook
On this page
  • Getting Started
  • Using the Advanced Sync Engine
  • Notes
  • Need help connecting to Trino?

Was this helpful?

  1. Data Sources
  2. Available Sources

Trino

This page describes how to use Trino or Starburst as a source in Census.

PreviousSQL ServerNextOverview

Last updated 7 months ago

Was this helpful?

Getting Started

Census can use Trino (and any supported Trino catalog) as a source. Census has been tested with the open-source Trino as well as Starburst Galaxy and Starburst Enterprise.

  • Open Census and navigate to the Sources page.

  • Click New Source and select Trino from the list.

  • Configure your Trino connection:

    • Enter your Trino host name

    • Enter your Trino user name

    • Enter your Trino password

    • (Optional) If your Trino instance does not run on port 443, enter the port here. Census requires a TLS connection to Trino.

  • You’re all set! Head over to the Syncs page to activate your data.

How to find your Hostname in Starburst:

Census utilizes JDBC in order to connect to Trino/Starburst. In order to get the correct hostname to input into your Census connection you will want to follow the instructions to get your JDBC url for your desired cluster.

Once you have the JDBC url you'll only need to input the subdomain into the Census hostname field.

Example JDBC url: jdbc:trino://census-example-cluster.trino.galaxy.starburst.io:443?user=mytestuser@getcensus.com/accountadmin Hostname value to input into Census census-example-cluster.trino.galaxy.starburst.io

Using the Advanced Sync Engine

  • Your Trino cluster must have a catalog named CENSUS containing a schema named CENSUS

  • The connector you use for the CENSUS catalog must support:

    • CREATE TABLE and DROP TABLE

    • Table writes, including row-level INSERT, DELETE, and UPDATE operations. Tables that support these operations are sometimes called "transactional" tables in Trino connector documentation, although Census does not require true ACID transactions for the Advanced Sync Engine.

    • The CREATE OR REPLACE TABLE statement, which was added to Trino in October 2023 and is available in some Starburst releases (check the release notes for your Starburst Enterprise version or for Starburst Galaxy).

  • The account (service account or user) that Census uses to connect to your Trino cluster must have full permissions for the CENSUS.CENSUS schema.

We have successfully tested Census' Advanced Sync Engine with the following configurations - it's possible that other configurations are supported, and we encourage you to use the Census connection tester to obtain diagnostics if needed:

  • CENSUS catalog using the MySQL, Postgres, and Snowflake connectors in read-write mode

  • CENSUS catalog using the Iceberg connector with S3 object storage and the AWS Glue catalog

Census is unable to provide any additional Trino table options (such as location) to the WITH clause when creating or managing tables in the CENSUS catalog, so please ensure your catalog and schema are configured with any needed default table options.

Notes

As of December 2023, Warehouse Writeback is not yet supported but is coming soon - please reach out to your Census account executive for details.

Need help connecting to Trino?

Trino supports both of Census' : Basic and Advanced. In order to use the Advanced Sync Engine with Trino, all of the following must be true:

CENSUS catalog using the connector, on both the AWS Glue catalog and the Starburst Galaxy catalog

via support@getcensus.com or start a conversation with us via the chat.

🧮
⚙️
linked here in Starburst's documentation
Starburst Delta Lake
Contact us
in-app
Sync Engines