MKF_AIIF AI in Finance

Ekonomicko-správní fakulta
podzim 2022
Rozsah
0/0/0. 6 kr. Ukončení: zk.
Vyučující
prof. Ing. Štefan Lyócsa, PhD. (přednášející)
Garance
prof. Ing. Štefan Lyócsa, PhD.
Katedra financí – Ekonomicko-správní fakulta
Kontaktní osoba: Iva Havlíčková
Dodavatelské pracoviště: Katedra financí – Ekonomicko-správní fakulta
Předpoklady
SOUHLAS
Students are expected to be familiar with basic concepts of Statistics and Econometrics.
Omezení zápisu do předmětu
Předmět je nabízen i studentům mimo mateřské obory.
Mateřské obory/plány
Cíle předmětu
The ever increasing amount of data already dictates our understanding of finance. The ability to process, study, interpret and present such data leads to an enormous competitive advantage on the job market; the Machine Learning in Finance gives students the possibility to work towards this advantage. The course is centered on key topics of machine learning with specific emphasis on case studies applied in the context of financial markets, credit and profit scoring, hedonic price models for real estate and used cars. Key topics include: data pre-processing, unsupervised learning methods, predictive modelling via OLS, LASSO, RIDGE, EN, Complete Subset Regressions, Logistic regression, Random Forest. Basic principles of Gradient Boosting, Support Vector Machines or other methods are also discussed. Selected principles discussed in the course are handling of data-snooping bias, hyper-parameter tuning, bagging and boosting, ensemble learning. Course is primarily led in program R.
Výstupy z učení
After completing the course, the student should be able to: - identify areas where data techniques might be useful, - prepare and design data analysis, - understand key concepts of machine learning, - program and present analysis using R or Python.
Osnova
  • Introduction 1. Introduction to Artificial Intelligence in Finance – applications in Finance, variance bias trade-off, supervised and unsupervised learning, semi-supervised learning, reinforced learning, critique of machine learning with over-fitting and model interpretability. 2. Data pre-processing – handling missing data part I., outliers, data transformation, feature engineering. Supervised AI in Finance - continuous outcome 3. Standard machine learning framework – multivariate regression, interactions, dummies, model/variable selection, in-sample and out-of-sample approach, loss functions for continuous target variables. Data snooping bias – model confidence set. 4. Regularization techniques – LASSO, Ridge, Elastic net. hyper-parameter tuning – cross-validation, leave-one-out, grid search, Complete subset regressions, Decision trees, Random forest and support vector machines. 5. Tree-based methods - decision trees, pre-pruning, post-pruning, bagging, random forest, boosting trees. Supervised AI in Finance - discrete outcomes 6. Logistic regression – marginal effects. Discrete choice model evaluation – confusion matrix. AUC. 7. LASSO, Ridge, Elastic net, Complete Subset Logistic Regression. 8. Tree-Based methods, decision trees, pre-pruning, post-pruning, bagging, random forest, boosting for discrete outcomes. Unsupervised machine learning 9. Unsupervised machine learning – distance measures, k-means, k-medoids, CLARA, agglomeration clustering, cluster validation and optimization. 10. Principal component analysis, network based feature selection techniques. Further topics in machine learning 11. Forecast combination - ensemble techniques, Time-series models, model based approach to missing data, sample selection. 12. Further modelling options: Linear Discriminant Analysis, Naïve Bayes classification. 12. Further modelling options: Support vector machines, Neural Networks.
Literatura
    doporučená literatura
  • COQUERET, G a Guida T AMP. Machine Learning for Factor Investing: R Version. CRC Press, 2020, 341 s. ISBN 978-0-367-54586-4. info
  • CLASTER, W B. Mathematics and Programming for Machine Learning with R: From the Ground Up. CRC Press, 2020, 430 s. ISBN 978-0-367-56194-9. info
  • NWANGANGA, F a M CHAPPLE. Practical Machine Learning in R. Wiley, 2020. ISBN 1-119-59151-1. info
  • WILEY, Matt a Joshua F. WILEY. Advanced R statistical programming and data models : analysis, machine learning and visualisation. California: Apress, 2019, xx, 638. ISBN 9781484228715. info
  • WILEY, Matt a Joshua F. WILEY. Advanced R statistical programming and data models : analysis, machine learning and visualisation. California: Apress, 2019, xx, 638. ISBN 9781484228715. info
  • LANTZ, B. Machine learning with R: expert techniques for predictive modeling. Packt Publishing, 2019. ISBN 978-1-78439-390-8. info
  • BERNARDI, M a Catania L AMP. The model confidence set package for R. International Journal of Computational Economics and Econometrics. 2018, roč. 8, č. 2, s. 144-158. Dostupné z: https://dx.doi.org/10.1504/IJCEE.2018.091037. info
  • WRIGHT, M N a A ZIEGLER. ranger: A Fast Implementation of Random Forests for High Dimensional Data in C++ and R. JOURNAL OF STATISTICAL SOFTWARE. LOS ANGELES: JOURNAL STATISTICAL SOFTWARE, 2017, roč. 77, č. 1, s. 1-17. ISSN 1548-7660. info
  • JAMES, Gareth R. An introduction to statistical learning : with applications in R. New York: Springer, 2013, xiv, 426. ISBN 9781461471370. info
  • JAMES, Gareth R. An introduction to statistical learning : with applications in R. New York: Springer, 2013, xiv, 426. ISBN 9781461471370. info
  • ELLIOTT, G, A GARGANO a A TIMMERMANN. Complete subset regressions. Journal of Econometrics. 2013, roč. 177, č. 2, s. 357-373. Dostupné z: https://dx.doi.org/10.1016/j.jeconom.2013.04.017. info
Výukové metody
Lecture notes, problem sets, and case studies are necessary for successful passing of the course. As they are required they will be available in the eLearning module for free.
Metody hodnocení
Grading is in accordance with the internal guidelines of the Faculty of Economics and Administration of Masaryk’s University and is based on midterm I (20%), midterm II (20%), and final exam (60%). A total of 51% minimum is required to pass.
Vyučovací jazyk
Angličtina
Další komentáře
Předmět je vyučován každoročně.
Výuka probíhá blokově.
Poznámka k četnosti výuky: 26/0/0 tutorial 12 hodin.
Předmět je zařazen také v obdobích podzim 2023.