Re: Excel question
If I have a spreadsheet (Office version 2003) with say 10,000 records, many of which have dupcliated data in every column, other than sorting and manually deleting them, is there a systematic way to delete duplicate rows?
[edit] And if not via Excel what if I converted it to Access?
Thanks.
There is simple formula's you can use to de-dup a spreadsheet as well.
First you would want to sort your data so that the duplicates end up next to each other.
So maybe you sort by the first 3 columns: First name, Last name, Account Number (example).
Then you can add a new column to the end of your data called DUPS that will compare the rows to see if that data matches.
=IF(CONCATENATE(A2,B2,C2)=CONCATENATE(A1,B1,C1),"dup","")
That formula only checks in a single direction, but would catch all of them in the example situation.
Once you have all the DUPS identified you can simply filter on the DUP row to only show ones that say "dup". Then delete all of those rows. Once you unfilter your data you should only have the unique rows you want.
Edit: Yep you figured out the same basic principle.