Data transformation

DUPS: provides information about unique and duplicate observations in the dataset and, optionally, drops all duplicate observations. varlist is an optional variable list that determines which observations are duplicates (observations must match exactly on all variables in the list to be duplicates). If no varlist is given, then all variables in the dataset are used to determine duplicates. drop is a command line option that causes duplicate observations to be dropped from the dataset. Both terse and verbose options are available to modify the detail in the default output; terse gives bare minimum information; verbose lists detailed information about duplicate observations. See

EGENMORE: a package that extends the egen package with more features. Egen, in turn, is a package that extends on Stata’s standard generate function. The egen suite of commands allows you to generate new variables in Stata while incorporating helpful operations and functions in a single command. Maintained by Nicholas J. Cox, Durham University (link).

SDECODE: (“super decode”) creates an output string variable with values
from the input numeric variable varname, using labels if present and
formats otherwise. See

UNIQUE: reports the number of unique values for the listed variables. See

XTBALANCE: module that allows to transform an unbalanced panel dataset into a balanced dataset with a chosen sample range. Made by Lian YujunDepartment of Finance, Lingnan College, Sun Yat-Sen University (link).

XLS2DTA: onverts Excel files to Stata datasets using import excel. Names for Stata datasets are derived from the names of the Excel files (see Remarks). The program is a convenient tool to import and convert more than one Excel file and/or worksheet at a time. See