dropduplicates¶
Purpose¶
Drops duplicate observations from data.
Format¶
-
x_new =
dropduplicates(x[, varlist])¶ Parameters: - x (matrix or dataframe) – data
- varlist (string array) – Optional, list of variables to include in the check for duplicates. Default is across all variables.
Returns: dup_report (dataframe) – Returns a dataframe with duplicate observations from
xremoved.
Examples¶
new;
// Create file name with full path
fname = getGAUSSHome("examples/tips2.dta");
// Load the dataframe
tips2 = loadd(fname);
// Locate and remove the duplicate observations
tips_no_dups = dropduplicates(tips2);
See also
Functions getduplicates(), isunique()