Skip to main content
Unit of study_

OCMP5310: Principles of Data Science

The focus of this unit is on understanding and applying relevant concepts, techniques, algorithms, and tools for the analysis, management and visualisation of data- with the goal of enabling discovery of information and knowledge to guide effective decision making and to gain new insights from large data sets. To this end, this unit of study provides a broad introduction to data management, analysis, modelling and visualisation using the Python programming language. Development of custom software using the powerful, general-purpose Python scripting language; Data collection, cleaning, pre-processing, and storage using various databases; Exploratory data analysis to understand and profile complex data sets; Mining unlabelled data to identify relationships, patterns, and trends; Machine learning from labelled data to predict into the future; Communicate findings to varied audiences, including effective data visualisations. Core data science content will be taught in normal lecture + tutorial delivery mode. Python programming will be taught through an online learning platform in addition to the weekly face-to-face lecture/tutorials. The unit of study will include hands-on exercises covering the range of data science skills above.

Code OCMP5310
Academic unit Computer Science
Credit points 6
COMP5310 or INFO3406
Assumed knowledge:
Good understanding of relational data model and database technologies as covered in ISYS2120 or COMP9120 (or equivalent UoS from different institutions)

The learning outcomes for this unit will be available two weeks before the first day of teaching.

Unit outlines

Unit outlines will be available 1 week before the first day of teaching for the relevant session.

There are no unit outlines available online for previous years.