Find duplicates and originals in a spreadsheet using the Unix command line

    Sometimes, you need to find and group together the replicated records in a spreadsheet. There are several different ways to identify duplicated records (see this tutorial for a good one), but what I wanted was something a bit more fancy. I wanted not just the duplicates, but each original record as well. Furthermore, I wanted any replicates (original + duplicates) grouped together in neat little sets in the spreadsheet. Here is how I did it using the Unix command line. Category: HackingLicense: Verbatim only Full story here.


