DATA1902: Informatics: Data and Computation (Advanced)

2026 unit information

This unit covers computation and data handling, integrating sophisticated use of existing productivity software, e. g. spreadsheets, with the development of custom software using the general-purpose Python language. It will focus on skills directly applicable to data-driven decision-making. Students will see examples from many domains, and be able to write code to automate the common processes of data science, such as data ingestion, format conversion, cleaning, summarization, creation and application of a predictive model. This unit includes the content of DATA1002, along with additional topics that are more sophisticated, suited for students with high academic achievement.

Unit details and rules

Managing faculty or University school:

Engineering

Details

Study level	Undergraduate
Academic unit	Computer Science
Credit points	6

Enrolment rules

Prerequisites: ?	None
Corequisites: ?	None
Prohibitions: ?	INFO1903 or DATA1002
Assumed knowledge: ?	This unit is intended for students with ATAR at least sufficient for entry to the BSc/BAdvStudies(Advanced) stream, or for those who gained Distinction results or better, in some unit in Data Science, Mathematics, or Computer Science. Students with portfolio of high-quality relevant prior work can also be admitted

Learning outcomes

At the completion of this unit, you should be able to:

LO1. automate a computational process, when given a clear account of the algorithm to be applied (to be done by writing Python programs with core techniques of procedural programming)
LO2. demonstrate knowledge of Python syntax and semantics, to trace and understand idiomatic code typical of data science activities, including features such as user-defined functions, exception-raising, and handling
LO3. understand automation of the computational process needed for examples of the various activity in the data science pipeline: data ingestion and cleaning, data format conversion, data summarization, visual and tabular presentation of the results from summarization, creation of a predictive model of a given form, application of a predictive model to new data, evaluation of a predictive model (and also, automation of a pipeline that scripts use of existing tools for these activities)
LO4. understand programs in Python to automatically perform computational processes of data science, and awareness of the similarities and differences between tools
LO5. understand main issues for data management in connection with data science activities, including value of data, importance of metadata, and issues when sharing data across time and users
LO6. understand how data sets are represented in computer files, in particular, the many-to-many relationship between the physical representation and the logical representation; advantages and disadvantages of different representations
LO7. understand principles of charting and information presentation, and ability to produce good charts using Python libraries; also capability to evaluate charts for effectiveness in communication.
LO8. understand principles of machine learning and its role in data science, in particular creation, use, and limitations of predictive models for regression and classification tasks, issues of over-fitting and under-fitting, and evaluation of models.
LO9. understand the principles and cautions on the use of GenAI in the Data Science lifecycle
LO10. understand the basic machine learning related mathematics, such as linear algebra and statistics

Unit availability

This section lists the session, attendance modes and locations the unit is available in. There is a unit outline for each of the unit availabilities, which gives you information about the unit including assessment details and a schedule of weekly activities.

The outline is published 2 weeks before the first day of teaching. You can look at previous outlines for a guide to the details of a unit.

Current year
Previous years

Session	MoA ?	Location	Outline ?
Semester 2 2026	Normal day	Camperdown/Darlington, Sydney	View

Session	MoA ?	Location	Outline ?
Semester 2 2020	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2021	Normal day	Remote	View
Semester 2 2022	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2022	Normal day	Remote	View
Semester 2 2023	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2024	Normal day	Camperdown/Darlington, Sydney	View
Semester 2 2025	Normal day	Camperdown/Darlington, Sydney	View

Find your current year census dates

Modes of attendance (MoA)

This refers to the Mode of attendance (MoA) for the unit as it appears when you’re selecting your units in Sydney Student. Find more information about modes of attendance on our website.

Important enrolment information

Departmental permission requirements

If you see the ‘Departmental Permission’ tag below a session, it means you need faculty or school approval to enrol. This may be because it’s an advanced unit, clinical placement, offshore unit, internship or there are limited places available.

You will be prompted to apply for departmental permission when you select this unit in Sydney Student.

Read our information on departmental permission.

Additional advice

This unit requires departmental permission to ensure appropriate academic standard is met. Students must have obtained an ATAR of 95 or equivalent. Please attach your previous results (high school or non-University of Sydney tertiary study) in your permission request for review by the Faculty.

Disclaimer

Important: the University of Sydney regularly reviews units of study and reserves the right to change the units of study available annually. To stay up to date on available study options, including unit of study details and availability, refer to the relevant handbook.

To help you understand common terms that we use at the University, we offer an online glossary.