• Offered by Rsch Sch of Finance, Actuarial Studies & App Stats
• ANU College ANU College of Business and Economics
• Course subject Statistics
• Areas of interest Statistics
• Course convener
• AsPr Dale Roberts
• Mode of delivery In Person
• Co-taught Course
• Offered in Second Semester 2022
Big Data Statistics (STAT7017)

This course involves on campus teaching. For students unable to come to campus there will be a remote option. See the Class Summary for more details.

This research-led course provides an introduction to recent developments in Random Matrix Theory and Online Learning that addresses the challenges and opportunities posed by the availability of large amounts of data.

In the first instance, we will review some classic results from multivariate statistical theory, matrix analysis, and probability theory. Then we will present the salient statistical features of big data (e.g., heterogeneity, noise accumulation, spurious correlation, and incidental endogeneity) and show how this impacts on traditional statistical methods and theory.

We follow with an introduction to modern Random Matrix theory and its application in statistics. Applications presented may include topics such as high-dimensional statistical inference, large covariance matrices, large-scale statistical learning through subsampling, sparsification of large matrices, principal component analysis, and dimension reduction.

We conclude with an introduction to the theory of online learning (aka. sequential prediction) to handle the situation of streaming data.

Students will use and learn about the latest computational tools to work with big and streaming data sets. Example data sets may be drawn from areas such finance, web analytics, digital marketing, and satellite imagery data.

Learning Outcomes

Upon successful completion, students will have the knowledge and skills to:

1. Explain in detail how statistical features of big data impacts traditional statistical methods and theory;
2. Discuss in depth random Matrix theory and its application in statistics on large scale;
3. Critically discuss the theory of sequential prediction and management of streaming data; and
4. Demonstrate in detail the use of computational tools to work with big and streaming data sets.

Indicative Assessment

1. Typical assessment may include, but is not restricted to: exams, assignments, quizzes, presentations and other assessment as appropriate (100) [LO 1,2,3,4]

The ANU uses Turnitin to enhance student citation and referencing techniques, and to assess assignment submissions as a component of the University's approach to managing Academic Integrity. While the use of Turnitin is not mandatory, the ANU highly recommends Turnitin is used by both teaching staff and students. For additional information regarding Turnitin please visit the ANU Online website.

Students are expected to commit 130 hours of work in completing this course. This includes time spent in scheduled classes and self-directed study time.

Not applicable

Requisite and Incompatibility

To enrol in this course you must have completed STAT6039 or STAT6013, and have completed STAT6038 or STAT6014 or STAT7001. Incompatible with STAT3017.

Prescribed Texts

Information about the prescribed textbook will be available via the Class Summary.

Fees

Tuition fees are for the academic year indicated at the top of the page.

Commonwealth Support (CSP) Students
If you have been offered a Commonwealth supported place, your fees are set by the Australian Government for each course. At ANU 1 EFTSL is 48 units (normally 8 x 6-unit courses). More information about your student contribution amount for each course at Fees

Student Contribution Band:
1
Unit value:
6 units

If you are a domestic graduate coursework student with a Domestic Tuition Fee (DTF) place or international student you will be required to pay course tuition fees (see below). Course tuition fees are indexed annually. Further information for domestic and international students about tuition and other fees can be found at Fees.

Where there is a unit range displayed for this course, not all unit options below may be available.

Units EFTSL
6.00 0.12500

Course fees

Domestic fee paying students
Year Fee
2022 \$4200
International fee paying students
Year Fee
2022 \$6000
Note: Please note that fee information is for current year only.

Offerings, Dates and Class Summary Links

ANU utilises MyTimetable to enable students to view the timetable for their enrolled courses, browse, then self-allocate to small teaching activities / tutorials so they can better plan their time. Find out more on the Timetable webpage.

The list of offerings for future years is indicative only.
Class summaries, if available, can be accessed by clicking on the View link for the relevant class number.

Second Semester

Class number Class start date Last day to enrol Census date Class end date Mode Of Delivery Class Summary
5696 25 Jul 2022 01 Aug 2022 31 Aug 2022 28 Oct 2022 In Person View