Google Cloud Storage
This page describes how to use Census with Google Cloud Storage.
Our Google Cloud connector behaves a little differently than other Census connectors. Instead of going through an OAuth connection flow, we provide you a Google Identity address to which you grant Storage Object permissions.
- Your new Google Cloud Storage connection will include a GCP service account for that specific Census connection. Click the copy button () to save it to your clipboard.
- Now head to the Google Cloud Storage bucket that you want to sync to, and click into that bucket's details.
- Click the Permissions tab on the Bucket, then click "Add Permissions".
- Paste the credentials in the New Principals portion and select "Storage Object Admin."
You will be able to send a successful sync if the file path variables are unique per sync run by only granting this service account "Storage Object Creator", but the Test Connection in the next step will fail.
You are good to go!
The sync will move data from your warehouse to your GCS bucket. In this step, you'll define how that will work.
- 2.Under What data do you want to sync?, choose your data warehouse as the Connection and your source: either a Model from the Census Models tab or a database table.
- 3.Under Where do you want to sync data to?, choose the name you assigned in Step 1 (we used GCS) as the Connection. Enter the File Path for the file where data will sync. The path can accept variables that will populate when the sync runs. See File Path Variables. Confirm the file path in the Template Preview field.__If the service account only has "Storage Object Creator" permissions, this file path needs to be unique per sync run💡
- 4.Under How should changes to the source be synced?, Mirror will be automatically selected. This is the only supported sync behavior for GCS.
- 5.Under Which properties should be updated?, choose whether to sync only Selected Properties or Sync All Properties. Syncing all properties will automatically add new properties to the sync if the model or database table changes.
- 6.To test your sync without actually syncing data, click Run Test and verify the results.
- 7.Click the Next button to see the final preview which will have a recap of what will happen when you start the sync. If you're happy, check the Sync Now checkbox and save the sync.
- 8.Confirm the data arrives in the GCS bucket!
When defining the File Path for an GCS sync, you can use variables that will be set when the sync runs. This allows you to create and sync to new files in the GCS bucket that reflect the date and time of the sync.
Update or Create syncs upload your whole dataset on the first run and only new changes on subsequent runs. Each sync run saves to a different file. The first run saves with "full" at the end of the file name. For example,
filename_12_12_23_full.csvif it runs on 12/12/2023. Later syncs save with a timestamp at the end, like
filename_12_12_23_1702426195.csv, so you can see how your data changes over time.
- Currently, the connector only supports syncing for files up to 5GB.
- Data arrives in one file to the designated bucket and file path.