Skip to the content.

Hanze

Data Analysis BML

Course Data Analysis and Visualization

Pic Source: https://www.deviantart.com/gabimedia/art/3D-Illustration-of-human-brain-nerve-4-977052356, Created using AI tools. Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License


Contents

Demo DataSaurus Dozen Dataset

Installation of MS Excel, R en RStudio

Plotting in Excel on a Macbook

Schedule and Outline per Lesson


Introduction

The health sciences and biological sciences are undergoing a transformation, driven by the ever-growing flood of data generated from diverse sources. Genomic sequencing, proteomics, pharmacological data, clinical trials and wearable devices all contribute to a vast and complex landscape of information. To navigate this landscape effectively and gain valuable insights, researchers are increasingly turning to the powerful tools of data analysis and visualization.

Data analysis involves the systematic process of collecting, cleaning, processing, and interpreting health and biology related data. This allows researchers to uncover hidden patterns, identify trends, and understand complex relationships within the data. However, raw data can be overwhelming and difficult to interpret. This is where data visualization comes into play.

By transforming data into clear and concise visual representations such as graphs, charts, and maps, data visualization empowers users to:

In conclusion, data analysis and visualization are critical tools in the modern health sciences and biology. They enable researchers to unlock the hidden potential of health-related data, leading to improved understanding of biological systems and improved healthcare delivery.

In this module, you will learn to distinguish and analyze different types of data. You will also learn to visualize data in an attractive way for reports. The focus will be on using the spreadsheet program Microsoft Excel and the programming language R (using the Tidyverse framework). The assignments can also be completed with Python, but there will be no explanation of Python on this website.

R and Python are currently the two most used programming languages in the field. R is used to gain some familiarity with the possibilities of data analysis with R, but not to teach you all the ins and outs of programming. The assignments that are offered should be completed with Excel and R. This module will conclude with an assignment that you will complete independently.

CureQ

This module is part of the CureQ project. CureQ is a consortium of partners that focuses on polyQ diseases with the aim to enable polyQ targeting therapies to better predict onset and progression of disease of the different patient groups (early-onset, adult-onset and carriers of intermediate repeats). The Hanze University is involved in two work packages of the CureQ project. During this course, we will often use datasets that are linked to various polyQ disorders.

Learning Outcomes


⬆️ Back to Top


Some text on this web page is copied, adapted and modified from Wikipedia.org
For some textual parts, AI (GPT) was used. The output was verified in all cases and modified where needed.
This web page is distributed under the terms of the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Creative Commons License: CC BY-SA 4.0.