Quick Start
Last updated
Last updated
Entity Resolution helps you de-duplicate and associate records across your datasets.
Step 1: Login to Census and Select New Dataset
button on the Datasets tab
Step 2: Fill in the Dataset name and choose "De-Duplication" as the transformation type.
Step 3: Select Dataset(s) that you want to use as source datasets. These datasets have duplicate records.
Step 4: Define criteria to identify duplicate records as Match Rules
Fuzzy Match can help match similar items.
Step 5: Define Merge Rules to identify winning record
When no merge rules are defined or when merge rules fail to detect a winning record, census uses lower of the unique IDs to identify winning record.
Step 6: Optional: Override column values of the winning record
Step 7: Review and confirm the configuration. Click Confirm and Census will start running Entity Resolution across your dataset.
Census creates an additional Lineage column on your resolved dataset to explore and debug the resolution.