Module Catalogues, Xi'an Jiaotong-Liverpool University   
 
Module Code: CSE313
Module Title: Big Data Analytics
Module Level: Level 3
Module Credits: 5.00
Academic Year: 2019/20
Semester: SEM1
Originating Department: Computer Science and Software Engineering
Pre-requisites: N/A
   
Aims
• To introduce the environment and the main application domains where Big Data Analytics (BDA) takes place;

• To introduce general framework and process of BDA;

• To study technologies, and platforms and , tools that are currently used in BDA;

• To study methods and algorithms that support BDA;

• To gain an understanding of the best practice in BDA.


Learning outcomes 
A. Demonstrate a solid understanding of concepts, processes and issues related to Big Data Analytics (BDA);


B. Identify applications of BDA that can help improve business operations;

C. Determine the appropriate use of analysis methods, algorithms, technologies, tools, and software packages to support data analysis involving practical scenarios;

D. Be proficient with at least one data analytics software package.

Method of teaching and learning 
1. Introduction to Big Data Analytics (4 lectures)

A. Data and data types;

B. Big data and Types

C. Big data analytics


2. BDA Process and Tasks (2 lectures)

3. BDA platforms and tools (4 lectures + 8 tutorials)

A. Hadoop and MapReduce

B. Spark

C. NoSQL and MongoDB


4. BDA methods and algorithms (18 lectures)

A. Data preparation

B. Descriptive Data analyses

C. Explorative data analyses

D. Predictive data analyses

E. Prescriptive Data analyses

5. Best business practice (4 lectures)

Start Small with Big Data, Thinking Big, Avoiding Worst Practices, Hands-on Big Data, Big Data Visualisation, ownership and security

Syllabus 
1. Introduction to Big Data Analytics (6 lectures)

• What is Big data analytics (Advanced analytic techniques operate on big data sets). Differentiate with related concepts: such as: data mining, data analysis, data visualisation, statistics, SQL and data warehouse.

• What is Big data

i) Defining big data via three Vs.

ii) Data sources, data types and the value of the big data

iii) The evolution of the big data and the future of the big data

• What is advanced Data analytics Focuses on inference, the process, the tools of deriving a conclusion based on business environment and organisational goals;

i) Current state of big data analytics

ii) Advanced analytics: exploratory data analysis (EDA) and confirmatory data analysis (CDA);


2. Process and framework (6 lectures)

• Big data pipeline: acquisition, extraction, aggregation, modelling and interpretation.

• Big data source, Hunting for Data,Setting the Goal, Big Data Sources Growing, A Wealth of Public Information

• Realising Value, The Case for Big Data, The Rise of Big Data Options, With Choice Come Decisions


3. Technologies (15 lectures)

Big Data Acquisition, The Storage Dilemma, Bringing Structure to Unstructured Data, Processing Power

Analysis Algorithms, Data Analytics, Big Data and Compliance

Advanced data analysis:

i) Classification

ii) Association analysis: Link analysis and PageRanking

iii) Clustering

iv) Advertisement on Web

v) Recommendation on Web

Security, Compliance, Auditing, and Protection, The Intellectual Property Challenge


4. Systems and software (8 lectures)

• Requirements for a big data analytics solution

• Platforms: Hadoop and Hadoop Distributed File System (HDFS), Map-Reduce, CEP (Complex Event Processing) Streaming Analytics, SQL – related. Extreme SQL No-SQL, Clouds

• Approaches: Choosing among In-house, Outsourced, or Hybrid Approaches


Delivery Hours  
Lectures Seminars Tutorials Lab/Prcaticals Fieldwork / Placement Other(Private study) Total
Hours/Semester 28      26    96  150 

Assessment

Sequence Method % of Final Mark
1 Written Examination 70.00
2 Lab Assessment Task1 15.00
3 Lab Assessment Task2 15.00

Module Catalogue generated from SITS CUT-OFF: 7/21/2019 3:17:18 PM