Remove Duplicate Rows

Deletes duplicate rows from the table, optionally based on specified columns.

Overview

This procedure removes duplicate rows from a table. If no columns are specified, all columns are considered when identifying duplicates. This automation procedure is useful for cleaning up data and removing redundancy.

Input Concepts

Concept
Type
Description
Default
Required

table

table

The table from which duplicate rows should be deleted.

No default

Yes

uniqueness columns

list of strings

Column names to consider for identifying duplicates.

All columns are used by default.

No

Output Concepts

Concept
Description

table

The updated table with duplicate rows removed.

Examples

1. Removing Duplicates - Default Behavior

By default, all columns are considered when removing duplicates.

create a sample table
remove the duplicates from the table

2. Removing Duplicates by Specifying Columns to Consider

create a sample table
remove the duplicates from the sample table where
    the uniqueness columns are "name", "age"

Last updated

Was this helpful?