Member-only story
I have been working on Excel datasets for a few weeks now. While the datasets did not necessarily need to be cleaned,there may come a time when a dataset crops up that is riddled with inconsistencies. It is for that reason that I decided to delve into the subject of cleaning datasets using Excel.
I have decided to cite ten ways that a dataset can be cleaned in Excel.
One. Remove duplicates
Use the Remove Duplicate feature (circled in red) in the Data tab of the ribbon to remove duplicates:-
Below is a screenshot of a dataset before the duplicates have been removed:-
When the Remove Duplicates feature is selected, a window will pop-up and the column of data where duplicates appear needs to be selected:-
Below is a screenshot of the dataframe after the duplicates have been removed:-