STAT4027: Advanced Statistical Modelling

2026 unit information

Applied Statistics fundamentally brings statistical learning to the wider world. Some data sets are complex due to the nature of their responses or predictors or have high dimensionality. These types of data pose theoretical, methodological and computational challenges that require knowledge of advanced modelling techniques, estimation methodologies and model selection skills. In this unit you will investigate contemporary model building, estimation and selection approaches for linear and generalised linear regression models. You will learn about two scenarios in model building: when an extensive search of the model space is possible; and when the dimension is large and either stepwise algorithms or regularisation techniques have to be employed to identify good models. These particular data analysis skills have been foundational in developing modern ideas about science, medicine, economics and society and in the development of new technology and should be in the toolkit of all applied statisticians. This unit will provide you with a strong foundation of critical thinking about statistical modelling and technology and give you the opportunity to engage with applications of these methods across a wide scope of applications and for research or further study.

Unit details and rules

Managing faculty or University school:

Science

Details

Study level	Undergraduate
Academic unit	Mathematics and Statistics Academic Operations
Credit points	6

Enrolment rules

Prerequisites: ?	(STAT3X12 or STAT3X22 or STAT4022) and (STAT3X13 or STAT3X23 or STAT4023)
Corequisites: ?	None
Prohibitions: ?	None
Assumed knowledge: ?	A three year major in statistics or equivalent including familiarity with material in DATA2X02 and STAT3X22 (applied statistics and linear models) or equivalent

Learning outcomes

At the completion of this unit, you should be able to:

LO1. Apply inference methods to estimate the model parameters. These methods include maximum likelihood, expectation maximumisation, iterative re-weighted least square, M-estimation, quasi-likelihood method and generalised estimating equation.
LO2. Understand the idea of generalised linear models and exponential family to model counts, binary data, and data with a positive domain.
LO3. Apply the different modeling strategies to describe the location of a data distribution including generalised additive model, regime switching, quantile, mixture and state space model.
LO4. Analyse survival data with censoring using Kaplan Meier model and perform regression using proportional hazard with Weibull, piece-wise exponential hazard and Cox's proportional hazard models.
LO5. Perform regression for count data allowing for different levels of dispersion using mixture model and Poisson, negative binomial and generalised Poisson distributions as well as allowing for zero inflation using zero-inflated and hurdle models.
LO6. Perform regression for binary data using logit, probit and complementary log-log link functions. Understand the properties of these models and goodness-of-fit. Apply Fisher exact test to 2x2 contingency table and measure association between two binary variables.
LO7. Perform regression for multinominal data in contingency table with different experimental designs using log-linear model and two logit structures: multinominal and hierarchical. Explore the relationship with Poisson and binominal regressions. Interpret the types of assoication for different log-linear models. Study special cases of collapsing table, decomposable table, incomplete table, symmetric (and quasi-symmetric) table and marginal homogenous table.
LO8. Perform regression for ordinal data using order logit link.
LO9. Perform beta regression for rate data.

Unit availability

This section lists the session, attendance modes and locations the unit is available in. There is a unit outline for each of the unit availabilities, which gives you information about the unit including assessment details and a schedule of weekly activities.

The outline is published 2 weeks before the first day of teaching. You can look at previous outlines for a guide to the details of a unit.

Current year
Previous years

Session	MoA ?	Location	Outline ?
Semester 2 2026	Normal day	Camperdown/Darlington, Sydney	Outline unavailable

Session	MoA ?	Location	Outline ?
Semester 2 2020	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2021	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2021	Normal day	Remote	View
Semester 2 2022	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2022	Normal day	Remote	View
Semester 2 2023	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2024	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2025	Normal day	Camperdown/Darlington, Sydney	View

Find your current year census dates

Modes of attendance (MoA)

This refers to the Mode of attendance (MoA) for the unit as it appears when you’re selecting your units in Sydney Student. Find more information about modes of attendance on our website.

Disclaimer

Important: the University of Sydney regularly reviews units of study and reserves the right to change the units of study available annually. To stay up to date on available study options, including unit of study details and availability, refer to the relevant handbook.

To help you understand common terms that we use at the University, we offer an online glossary.