Skip to main content Skip to navigation

CS950-15 Foundations of Computational Data Analytics

Department
Computer Science
Level
Taught Postgraduate Level
Module leader
Weiren Yu
Credit value
15
Module duration
10 weeks
Assessment
Multiple
Study location
University of Warwick main campus, Coventry

Introductory description

This module introduces basic concepts and techniques for data analysis with diverse types, including tabular, text, time series, and geospatial data. Students will gain practical skills in data cleaning, dimensionality reduction, visualisation, and exploratory analysis, as well as methods and algorithms for managing and analysing real-world datasets efficiently.

Module aims

The module will provide students with a broad foundation in computational data analytics, equipping them with essential knowledge for further specialisation. Data Analytics is a core discipline within computer science, with increasing importance in the age of digital transformation and emerging technologies, with significant economic impact. Because of the highly interdisciplinary nature of data analytics, students will benefit from being able to pursue working in a wide range of application domains.

Outline syllabus

This is an indicative module outline only to give an indication of the sort of topics that may be covered. Actual sessions held may differ.

Overview of common data modalities: tabular, JSON, time series, text, networks, and geospatial data

Data cleaning and preprocessing: standardisation, type casting, outlier detection

Exploratory data analysis: descriptive statistics, pattern discovery

Improving data quality: handling missing data, noise filtering, class balancing

Dimensionality reduction: feature projection, embeddings, variance preservation

Data partitioning and sampling: train-test splits, cross-validation, stratified sampling

Data visualisation: matplotlib, seaborn, ggplot2, t-SNE, and geospatial mapping

Correlation and dependency analysis: covariance, mutual information, partial correlation

Scalable data analysis: 5 Vs, stream processing, MongoDB, PySpark, TensorFlow, cloud computing, data lakes and warehousing

Learning outcomes

By the end of the module, students should be able to:

  • Apply data cleaning, preprocessing, and quality assessment techniques to prepare real-world behavioural datasets for analysis.
  • Perform exploratory data analysis and visualisation to uncover patterns, anomalies, and structural insights in human behaviour.
  • Implement methods for data integration, dimensionality reduction, and feature projection for behavioural modelling.
  • Utilise scalable tools and frameworks to analyse and process large behavioural datasets.

Indicative reading list

Reading lists can be found in Talis

Research element

Coursework will include a research element.

Subject specific skills

in line with the learning objectives students will acquire skills in:
Applying data cleaning, preprocessing, and quality assessment techniques to prepare real-world behavioural datasets for analysis;
Performing exploratory data analysis and visualisation to uncover patterns, anomalies, and structural insights in human behaviour;
Implementing methods for data integration, dimensionality reduction, and feature projection for behavioural modelling;
Utilising scalable tools and frameworks to analyse and process large behavioural datasets.

Transferable skills

Being able to apply Data Analytics knowledge and understanding of specialist theoretical and methodological approaches, suggesting and incorporating interrelationships with other relevant disciplines in abstract and unpredictably complex contexts.

Students will obtain the cognitive skills to critically contribute to existing discourses and methodologies in Data Analytics, suggesting new ideas, and designing systematic studies in Data Analytics based on critical analysis and evaluation.

Students will obtain practical skills in organising and communicating information, improving interpersonal, team
and networking skills through engaging in classes and computer laboratories. Formative assessment will allow students to strategically enhance their own learning.

Data Analytics is an area with immediate relevance for increasing ethical awareness and its practical application regarding privacy concerns. The associated values will help understanding the importance of personal responsibility and ethical leadership.

Study time

Type Required
Lectures 20 sessions of 1 hour (13%)
Supervised practical classes 9 sessions of 1 hour (6%)
Private study 29 hours (19%)
Assessment 92 hours (61%)
Total 150 hours

Private study description

Private study, background reading and revision.

Costs

No further costs have been identified for this module.

You do not need to pass all assessment components to pass the module.

Assessment group C
Weighting Study time Eligible for self-certification
Foundations of Computational Data Analytics Coursework 1 20% 25 hours No

The coursework will consist of developing computer programs to solve practical problems in computational data analytics.

Foundations of Computational Data Analytics Coursework 2 30% 30 hours No

The coursework is a 1500-2000 word essay that requires students to connect the module's content to behavioral science.

Foundations of Computational Data Analytics Exam 50% 37 hours No

Written 2h exam covering the entire module content, timetabled in January.


  • Answerbook Pink (12 page)
  • Students may use a calculator
Assessment group R
Weighting Study time Eligible for self-certification
Foundations of Computational Data Analytics Resit Exam 100% No
  • Answerbook Pink (12 page)
  • Students may use a calculator
Feedback on assessment

Individual written feedback on coursework.
Past exam papers.

Past exam papers for CS950

Courses

This module is Core for:

  • Year 1 of TPSS-C803 Postgraduate Taught Behavioural and Data Science