Skip to the content.

Hanze


Go back to the main page
Go back to the Data overview page


Data

Exercises


Exercise 1

Download the this dataset. Open it with a text editor (Notepad or TextEdit on a Mac).

Describe for the following columns what kind of data it holds. Also describe the scale.

Column Data type Scale
Drug Name    
NDC    
Price    

Exercise 2

Have a look at some raw data below:

Protein symbol: HBA1, HBA2, HBB, HBD, HBM
Length (aa): 142, 142, 147, 147, 141

Create a data table of the above data.
Make sure that the observations (or records) and variables (or features) are organized appropriately. Also sort the data based on the protein length (primary) and on protein symbol (secondary).

Exercise 3

Heat shock in biology refers to the cellular response to sudden exposure to elevated temperatures. This stress response triggers the production of heat shock proteins (HSPs), which help cells maintain proper protein folding, prevent protein aggregation, and promote cell survival under heat stress conditions. We can apply different temperatures to induce a heat shock for human cells cultured in a laboratory.

No heat shock is applied if a temperature of 98.6°F is used for 30 minutes.
A medium heat shock is applied if a temperature of 104.0°F is used for 30 minutes.
A strong heat shock is applied if a temperature of 107.6°F is used for 30 minutes.

What is the ratio for “medium” and “strong” compared to no heat shock?
Calculate the fold induction.

Exercise 4

The R library TidyVerse will work best with tidy data.
Remember that there are three interrelated rules which make a dataset tidy:

Convert the data table below to a data table that is tidy.

Cell growth in a 6-well plate over time for various cell types.

Cell type Day 1 Day 2
HEK293 1.2E04 2.2E04
A549 1.1E04 2.6E04
K562 0.9E04 1.9E04

Also mention, which of the interrelated rules which make a dataset tidy have not been met.

Exercise 5

Another untidy data table. Describe which of the three interrelated rules which make a dataset tidy have not been met and convert the data table to a data table that is tidy.

Cell type Day 1, Day 2
HEK293 1.2E04, 2.2E04
A549 1.1E04, 2.6E04
K562 0.9E04, 1.9E04

Exercise 6

Here we have a tidy data table. Spread the data set to make it wider and untidy.

Cell type Day Medium Number of cells
HEK293 1 DMEM 2.5E04
HEK293 1 RPMI 2.7E04
HEK293 2 DMEM 5.4E04
HEK293 2 RPMI 7.8E04
A549 1 DMEM 1.2E04
A549 1 RPMI 1.4E04
A549 2 DMEM 2.5E04
A549 2 RPMI 5.2E04

Go back to the main page
Go back to the Data overview page
⬆️ Back to Top


This web page is distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Creative Commons License: CC BY-SA 4.0.