Module Catalogues

Statistics and Data Analytics II

Module Title Statistics and Data Analytics II
Module Level Level 4
Module Credits 5.00
Academic Year 2021/22
Semester SEM2

Aims and Fit of Module

This module is an advanced course in statistics and data analysis, with focus on introducing students to frontier statistical learning techniques using applications in R computer-based package. This module will illustrate how such statistical tools can aid in data analysis and in solving problems in insurance and finance. The module covers many prominent topics in statistical learning, including resampling methods, model selection and regularisation, decision tree and random forest, neural networks, and support vector machine.

Learning outcomes

A. Explain and apply linear regression and classification for data analysis. B. Identify and implement resample methods, including various cross-validation methods and Bootstrap. C. Discuss and apply linear model selection and the concept of regularization. D. Explain the shrinkage methods, Lasso and Ridge regression. E. Classify and make use of regression and classification trees. F. Illustrate and utilise the concepts of Bagging, Random forest and Boosting in data analysis. G. Explain and apply basic neural networks for regression and classification purposes. H. Identify and implement support vector machine for data analysis.

Method of teaching and learning

This module is delivered through formal lectures and tutorials.