If you have a spreadsheet containing several fields of data (as in the example on the right), you can use Excel formulas to highlight duplicate rows in your spreadsheet.
To highlight duplicate rows in the example spreadsheet, first use the & operator to collate the data from columns A - C into column D. For example, the formula to be entered into cell D2 is:
In the above results spreadsheet, the dates of birth are shown as numeric values at the end of the combined text string. This is because Excel stores dates as numeric values and it is these underlying values that are displayed in the combined text strings.
As we are simply using the combined text strings to identify duplicate rows, we are not concerned with the appearance of the data in column D and so, for the purpose of this example, we will continue to work with the simple concatenation shown above.
However, if you did want to tidy up the data in column D, you could do this by adding spaces between the fields and using the Text function to display the dates as recognisable dates. The formula in cell D2 would then become:
Absolute & Relative References in the Countif Function
The Countif function used in this example uses an absolute reference for the first reference to cell D$2 (shown by the $ sign), and relative references for all other cell references.
Therefore, when the formula is copied down the rows, the initial reference to cell D$2 remains fixed while the remaining references are adjusted to refer to cells D3, D4, etc.
Once columns A - C are collated into column D, we need to highlight duplicates in the contents of column D. This can be done using the Countif function.
The function to be entered into cell E2 is:
Note that the Countif function used in this example uses a combination of Absolute and Relative Cell References.
The spreadsheet below shows column E populated with the Countif function:
The results of the formulas are shown in the spreadsheet below. It is seen that the duplicate entry in row 10 has the value "2" in cell E7, showing that this is a duplicate.
As an additional feature, in the spreadsheet below, Conditional Formatting has been used to highlight rows in which the value in column E is greater than 1.