Module Catalogues, Xi'an Jiaotong-Liverpool University   
Module Code: DTS303TC
Module Title: Big Data Security and Analytics
Module Level: Level 3
Module Credits: 5.00
Academic Year: 2022/23
Semester: SEM1
Originating Department: Shool of AI and Advanced Computing
Pre-requisites: N/A
• To introduce the environment and the main application domains where Big Data Analytics (BDA) takes place;

• To introduce general framework and process of BDA;

• To study technologies, and platforms and , tools that are currently used in BDA;

• To study methods and algorithms that support BDA;

• To gain an understanding of the best practice in BDA.
Learning outcomes 
A. Demonstrate a solid understanding of concepts, processes and issues related to Big Data Analytics (BDA);

B. Determine the appropriate use of analysis methods, algorithms, technologies, tools, and software packages to support data analysis involving practical scenarios;

C. Be proficient with at least one data analytics software package.

D. Be aware of issues related to comouter and data security
Method of teaching and learning 
1. Introduction to Big Data Analytics

A. Data and data types;

B. Big data and Types

C. Big data analytics

2. BDA Process and Tasks

3. BDA platforms and tools

A. Hadoop and MapReduce

B. Spark

C. NoSQL and MongoDB

4. BDA methods and algorithms

A. Data preparation

B. Descriptive Data analyses

C. Explorative data analyses

D. Predictive data analyses

E. Prescriptive Data analyses

5. Computer amd Data Security

A Data Assurance

B Machine learning and Data Mining

C State-of-the-art technologies
1. Introduction to Big Data Analytics (6 lectures)

• What is Big data analytics (Advanced analytic techniques operate on big data sets). Differentiate with related concepts: such as: data mining, data analysis, data visualisation, statistics, SQL and data warehouse.

• What is Big data

i) Defining big data via three Vs.

ii) Data sources, data types and the value of the big data

iii) The evolution of the big data and the future of the big data

• What is advanced Data analytics Focuses on inference, the process, the tools of deriving a conclusion based on business environment and organisational goals;

i) Current state of big data analytics

ii) Advanced analytics: exploratory data analysis (EDA) and confirmatory data analysis (CDA);

2. Process and framework (6 lectures)

• Big data pipeline: acquisition, extraction, aggregation, modelling and interpretation.

• Big data source, Hunting for Data,Setting the Goal, Big Data Sources Growing, A Wealth of Public Information

• Realising Value, The Case for Big Data, The Rise of Big Data Options, With Choice Come Decisions

3. Technologies (5 lectures)

Big Data Acquisition, The Storage Dilemma, Bringing Structure to Unstructured Data, Processing Power

Analysis Algorithms, Data Analytics, Big Data and Compliance

, Compliance, Auditing, and Protection, The Intellectual Property Challenge

4. Systems and software (8 lectures)

• Requirements for a big data analytics solution

• Platforms: Hadoop and Hadoop Distributed File System (HDFS), Map-Reduce, CEP (Complex Event Processing) Streaming Analytics, SQL – related. Extreme SQL No-SQL, Clouds

• Approaches: Choosing among In-house, Outsourced, or Hybrid Approaches

5. Computer and Data Security (14 hours)

Information Assurane, Use of Machine learnining methods and Data Mininng for computer and data security, State-of-the-art Technologies:

•Detection of Malicious Executables

•Data Mining Applied to Intrusion Detection

•Intrusion Detection Alarm Clustering

•Behavioral Features for Network Anomaly Detection

•Cost-Sensitive Modeling for Intrusion Detection

•Data Cleaning and Enriched Representations for Anomaly Detection in System Calls

•Decision-Theoretic, Semi-Supervised Model for Intrusion Detection
Delivery Hours  
Lectures Seminars Tutorials Lab/Prcaticals Fieldwork / Placement Other(Private study) Total
Hours/Semester 28      26    96  150 


Sequence Method % of Final Mark
1 Final Exam 70.00
2 Lab Assessment Task1(Groupwork) 15.00
3 Lab Assessment Task2(Groupwork) 15.00

Module Catalogue generated from SITS CUT-OFF: 6/3/2020 1:49:48 AM