Unit of study_

COMP5046: Natural Language Processing

2026 unit information

This unit introduces the computational methods, methodological concepts, and theoretical ideas that underpin modern methods for processing natural language (such as English) using computers. NLP is used in a wide range of applications, including information retrieval and extraction; question answering; machine translation; code generation; dialogue; and classifying and clustering of documents. To achieve these, NLP systems perform a range of tasks using mathematical representations of language. In the context of these applications, the unit will explore common modelling methods, including heuristics, linear models, and neural networks, up to and including large language models (LLMs). The unit covers key ideas relevant to NLP from machine learning, statistics, linguistics, and data science. Students will implement NLP systems and evaluation metrics in labs and assignments. The unit will also investigate the annotation process for creating evaluation data for NLP systems. Students will annotate data as part of completing a real-world NLP task.

Unit details and rules

Managing faculty or University school:

Engineering

Study level Postgraduate
Academic unit Computer Science
Credit points 6
Prerequisites:
? 
None
Corequisites:
? 
None
Prohibitions:
? 
COMP4446
Assumed knowledge:
? 
Knowledge of an OO programming language

At the completion of this unit, you should be able to:

  • LO1. apply basic linguistic knowledge to identify properties of text
  • LO2. understand the internal architecture of language models including the purpose of each component
  • LO3. implement and train machine learning based systems for solving natural language tasks
  • LO4. evaluate the performance of natural language processing systems
  • LO5. mplement and debug a large NLP system in a collaborative manner
  • LO6. annotate data using appropriate quality control methods
  • LO7. identify ethical concerns in NLP systems and ways to mitigate those issues

Unit availability

This section lists the session, attendance modes and locations the unit is available in. There is a unit outline for each of the unit availabilities, which gives you information about the unit including assessment details and a schedule of weekly activities.

The outline is published 2 weeks before the first day of teaching. You can look at previous outlines for a guide to the details of a unit.

Session MoA ?  Location Outline ? 
Semester 1 2026
Normal evening Camperdown/Darlington, Sydney
Session MoA ?  Location Outline ? 
Semester 1 2020
Normal day Camperdown/Darlington, Sydney
Semester 1 2021
Normal day Remote
Semester 1 2022
Normal evening Camperdown/Darlington, Sydney
Semester 1 2022
Normal evening Remote
Semester 1 2023
Normal evening Camperdown/Darlington, Sydney
Semester 1 2023
Normal evening Remote
Semester 1 2024
Normal evening Camperdown/Darlington, Sydney
Semester 1 2025
Normal evening Camperdown/Darlington, Sydney

Find your current year census dates

Modes of attendance (MoA)

This refers to the Mode of attendance (MoA) for the unit as it appears when you’re selecting your units in Sydney Student. Find more information about modes of attendance on our website.

Disclaimer

Important: the University of Sydney regularly reviews units of study and reserves the right to change the units of study available annually. To stay up to date on available study options, including unit of study details and availability, refer to the relevant handbook.

To help you understand common terms that we use at the University, we offer an online glossary.