CS950-15 Foundations of Computational Data Analytics
Introductory description
This module introduces basic concepts and techniques for data analysis with diverse types, including tabular, text, time series, and geospatial data. Students will gain practical skills in data cleaning, dimensionality reduction, visualisation, and exploratory analysis, as well as methods and algorithms for managing and analysing real-world datasets efficiently.
Module aims
The module will provide students with a broad foundation in computational data analytics, equipping them with essential knowledge for further specialisation. Data Analytics is a core discipline within computer science, with increasing importance in the age of digital transformation and emerging technologies, with significant economic impact. Because of the highly interdisciplinary nature of data analytics, students will benefit from being able to pursue working in a wide range of application domains.
Outline syllabus
This is an indicative module outline only to give an indication of the sort of topics that may be covered. Actual sessions held may differ.
Overview of common data modalities: tabular, JSON, time series, text, networks, and geospatial data
Data cleaning and preprocessing: standardisation, type casting, outlier detection
Exploratory data analysis: descriptive statistics, pattern discovery
Improving data quality: handling missing data, noise filtering, class balancing
Dimensionality reduction: feature projection, embeddings, variance preservation
Data partitioning and sampling: train-test splits, cross-validation, stratified sampling
Data visualisation: matplotlib, seaborn, ggplot2, t-SNE, and geospatial mapping
Correlation and dependency analysis: covariance, mutual information, partial correlation
Scalable data analysis: 5 Vs, stream processing, MongoDB, PySpark, TensorFlow, cloud computing, data lakes and warehousing
Learning outcomes
By the end of the module, students should be able to:
- Apply data cleaning, preprocessing, and quality assessment techniques to prepare real-world behavioural datasets for analysis.
- Perform exploratory data analysis and visualisation to uncover patterns, anomalies, and structural insights in human behaviour.
- Implement methods for data integration, dimensionality reduction, and feature projection for behavioural modelling.
- Utilise scalable tools and frameworks to analyse and process large behavioural datasets.
Indicative reading list
Reading lists can be found in Talis
Research element
Coursework will include a research element.
Subject specific skills
in line with the learning objectives students will acquire skills in:
Applying data cleaning, preprocessing, and quality assessment techniques to prepare real-world behavioural datasets for analysis;
Performing exploratory data analysis and visualisation to uncover patterns, anomalies, and structural insights in human behaviour;
Implementing methods for data integration, dimensionality reduction, and feature projection for behavioural modelling;
Utilising scalable tools and frameworks to analyse and process large behavioural datasets.
Transferable skills
Being able to apply Data Analytics knowledge and understanding of specialist theoretical and methodological approaches, suggesting and incorporating interrelationships with other relevant disciplines in abstract and unpredictably complex contexts.
Students will obtain the cognitive skills to critically contribute to existing discourses and methodologies in Data Analytics, suggesting new ideas, and designing systematic studies in Data Analytics based on critical analysis and evaluation.
Students will obtain practical skills in organising and communicating information, improving interpersonal, team
and networking skills through engaging in classes and computer laboratories. Formative assessment will allow students to strategically enhance their own learning.
Data Analytics is an area with immediate relevance for increasing ethical awareness and its practical application regarding privacy concerns. The associated values will help understanding the importance of personal responsibility and ethical leadership.
Study time
| Type | Required |
|---|---|
| Lectures | 20 sessions of 1 hour (13%) |
| Supervised practical classes | 9 sessions of 1 hour (6%) |
| Private study | 29 hours (19%) |
| Assessment | 92 hours (61%) |
| Total | 150 hours |
Private study description
Private study, background reading and revision.
Costs
No further costs have been identified for this module.
You do not need to pass all assessment components to pass the module.
Assessment group C
| Weighting | Study time | Eligible for self-certification | |
|---|---|---|---|
| Foundations of Computational Data Analytics Coursework 1 | 20% | 25 hours | No |
|
The coursework will consist of developing computer programs to solve practical problems in computational data analytics. |
|||
| Foundations of Computational Data Analytics Coursework 2 | 30% | 30 hours | No |
|
The coursework is a 1500-2000 word essay that requires students to connect the module's content to behavioral science. |
|||
| Foundations of Computational Data Analytics Exam | 50% | 37 hours | No |
|
Written 2h exam covering the entire module content, timetabled in January.
|
|||
Assessment group R
| Weighting | Study time | Eligible for self-certification | |
|---|---|---|---|
| Foundations of Computational Data Analytics Resit Exam | 100% | No | |
|
|||
Feedback on assessment
Individual written feedback on coursework.
Past exam papers.
Courses
This module is Core for:
- Year 1 of TPSS-C803 Postgraduate Taught Behavioural and Data Science