This module is an advanced course in statistics and data analysis, which focuses on introducing students to frontier statistical learning techniques with R/Python programming. This module will illustrate how such statistical tools can aid in data analysis and in solving problems in insurance and finance. The module covers many prominent topics in statistical learning, including resampling methods, model selection and regularization, decision tree and random forest, neural networks, and support vector machine.
A. Explain and apply linear regression and classification for data analysis. B. Identify and implement resample methods, including various cross-validation methods and Bootstrap. C. Explain the shrinkage methods, Lasso, and Ridge regression. D. Classify and make use of regression and classification trees E. Illustrate and utilize the concepts of Bagging, Random Forest, and Boosting in data analysis. F. Explain and apply basic neural networks for regression and classification purposes. G. Identify and implement support vector machine for data analysis.
This module is delivered through formal lectures and computer labs