Skip to the content.

Hanze


Go back to the main page
Go back to the Excel overview page


Excel Data Cleaning

Exercises


Exercise 1

This first dataset was not loaded well. Use the Text to Columns feature to correct it. Source: Predicting Heart Disease Using Clinical Variables

Exercise 2

This dataset is not loaded well. In addition, the file contains the units in the cells. This makes it impossible to perform calculations (as the data type is a text string). Remove the units in order to make calculations possible.

Exercise 3

This dataset contains rows with duplicate data. Load the data and remove the duplicates from the data table.

Exercise 4

Load the Beta-Lactamase dataset in Excel using the csv import tool. This dataset contains several csv files. Just use the first (CHEMBL1989.csv).

This dataset contains empty data cells.

Make the empty cells more explicit in Excel by converting them to #N/A.
Count how many cells you have:


Go back to the main page
Go back to the Excel overview page
⬆️ Back to Top


This web page is distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Creative Commons License: CC BY-SA 4.0.