C3760 AI in structural bioinformatics

Přírodovědecká fakulta
podzim 2025

Předmět se v období podzim 2025 nevypisuje.

Rozsah
1/1/0. 3 kr. Ukončení: k.
Vyučováno kontaktně
Vyučující
doc. RNDr. Radka Svobodová, Ph.D. (přednášející)
RNDr. Tomáš Raček, Ph.D. (pomocník)
Garance
doc. RNDr. Radka Svobodová, Ph.D.
Národní centrum pro výzkum biomolekul – Přírodovědecká fakulta
Kontaktní osoba: RNDr. Tomáš Raček, Ph.D.
Dodavatelské pracoviště: Národní centrum pro výzkum biomolekul – Přírodovědecká fakulta
Omezení zápisu do předmětu
Předmět je nabízen i studentům mimo mateřské obory.
Mateřské obory/plány
Cíle předmětu
The course is centered in providing practical skills in data management and data visualization using R for students and researchers, leveraging on the availability and applicability of artificial intelligence (AI) tools focused principally on GitHub Copilot. The objectives of the course are: improve the data manipulation skills using tidyverse package ; develop data visualization knowledge using ggplot2 package; integrate visualization into reports and web applications; application of AI tools in data analysis.
Výstupy z učení
After finishing the course, the students will be able to manipulate data effectively using the tidyverse package, with the ability to get clean, analysis-ready datasets from raw data. The participant will also be able to create visualizations of the data using the ggplot2 package, and to use the obtained plots into dynamic reports and web applications using RMarkdown and Shiny packages respectively. Ultimately, participants will develop the skills to write efficient R code, augmented by AI tools, to increase code reproducibility, efficiency, and troubleshooting.
Osnova
During this course, we will go through the main features of tidyverse package for data manipulation, such as performing mathematical operations, filtering, table joining, reshaping, with the objective to prepare cleaned datasets from raw data. Following, we will discuss the usage of the ggplot2 package, its features and variables, the functions implied and the data structure required to perform the data visualization successfully. Finally, the data manipulation steps and data visualization will be coupled with the usage of RMarkdown and Shiny packages with the objective to check their advantages and disadvantages so they can be applied in real case scenarios depending on the requirements. The usage of AI tools will be focused on GitHub Copilot for reproducibility issues and its integration with RStudio, however other alternatives will be introduced to the participant, and all the sections will include an explanation on a predefined code to achieve the ultimate results (data management, visualization or reporting) and then it will be shown how to use AI tools to obtain comparable results by writing clear and efficient prompts.
Literatura
  • Moon, K. W. (2017). Learn ggplot2 using shiny app. Springer.
  • Quicke, D. L., Butcher, B. A., & Welton, R. A. K. (2020). Practical R for Biologists: An Introduction. CABI.
  • BECKERMAN, Andrew P. a Owen L. PETCHEY. Getting started with R : an introduction for biologists. 1st ed. Oxford: Oxford University Press, 2012, x, 113. ISBN 9780199601615. info
  • YAU, Nathan. Visualize this : the FlowingData guide to design, visualization, and statistics. Indianapolis, Indiana: Wiley, 2011, xxvi, 358. ISBN 9780470944882. info
Výukové metody
Student projects, their presentations and discussions.
Metody hodnocení
Compulsory homework and practical final assignment.
Vyučovací jazyk
Angličtina
Další komentáře
Předmět je vyučován každoročně.
Výuka probíhá každý týden.

  • Permalink: https://is.muni.cz/predmet/sci/podzim2025/C3760