Unit of study_

# STAT3888: Statistical Machine Learning

### 2024 unit information

Data Science is an emerging and inherently interdisciplinary field. A key set of skills in this area fall under the umbrella of Statistical Machine Learning methods. This unit presents the opportunity to bring together the concepts and skills you have learnt from a Statistics or Data Science major, and apply them to a joint project with NUTM3888 Metabolic Cybernetics where Statistics and Data Science students will form teams with Nutrition students to solve a real world problem using Statistical Machine Learning methods. The unit will cover a wide breadth of cutting edge supervised and unsupervised learning methods will be covered including principal component analysis, multivariate tests, discrimination analysis, Gaussian graphical models, log-linear models, classification trees, k-nearest neighbours, k-means clustering, hierarchical clustering, and logistic regression. In this unit, you will continue to understand and explore disciplinary knowledge, while also meeting and collaborating through project-based learning; identifying and solving problems, analysing data and communicating your findings to a diverse audience. All such skills are highly valued by employers. This unit will foster the ability to work in an interdisciplinary team, and this is essential for both professional and research pathways in the future.

## Unit details and rules

#### Science

Study level Undergraduate Mathematics and Statistics Academic Operations 6
 Prerequisites: ? STAT2X11 and (DATA2X02 or STAT2X12) None STAT3914 or STAT3014 STAT3012 or STAT3912 or STAT3022 or STAT3922

At the completion of this unit, you should be able to:

• LO1. apply disciplinary knowledge in statistics and data science to solve problems in an interdisciplinary context (nutrition)
• LO2. find, define, and delimit authentic problems in order to address them
• LO3. create an investigation strategy, explore solutions, discuss approaches, and predict outcomes
• LO4. apply, formulate, interpret, and compare statistical machine learning methods including (wherever relevant) evaluation of model appropriateness
• LO5. demonstrate integrity, confidence, personal resilience, and the capacity to manage challenges, both individually and in teams
• LO6. collaborate with diverse groups across cultural and disciplinary boundaries to develop solution(s) to the project problems
• LO7. communicate project outcomes effectively to a broad audience
• LO8. identify appropriate machine learning problems to a particular problem, and judge the appropriateness of model evaluation procedures.

## Unit availability

This section lists the session, attendance modes and locations the unit is available in. There is a unit outline for each of the unit availabilities, which gives you information about the unit including assessment details and a schedule of weekly activities.

The outline is published 2 weeks before the first day of teaching. You can look at previous outlines for a guide to the details of a unit.

Session MoA   Location Outline
Semester 2 2024
Normal day Camperdown/Darlington, Sydney
Session MoA   Location Outline
Semester 2 2020
Normal day Camperdown/Darlington, Sydney
Semester 2 2021
Normal day Camperdown/Darlington, Sydney
Semester 2 2021
Normal day Remote
Semester 2 2022
Normal day Camperdown/Darlington, Sydney
Semester 2 2022
Normal day Remote
Semester 2 2023
Normal day Camperdown/Darlington, Sydney

Find your current year census dates

### Modes of attendance (MoA)

This refers to the Mode of attendance (MoA) for the unit as it appears when you’re selecting your units in Sydney Student. Find more information about modes of attendance on our website.