IB9PB-15 Machine Learning and Data Analytics
Introductory description
This module should provide you with the skills you need to design and manage a data science project, and utilize techniques from machine learning. Big Data Analytics is about taking the myriad of data created by all of us, and fashioning it into actionable information.
Module aims
The module will provide hands on experience to acquaint non-specialists with the current machine learning techniques in regression, classification and unsupervised learning using the R programming language.
Outline syllabus
This is an indicative module outline only to give an indication of the sort of topics that may be covered. Actual sessions held may differ.
Sessions 1 & 2
- base R and RStudio for reproducible research.
- Introduce some key parts of the
tidyverseincludingdplyrandggplot2. data.tables
Sessions 3 & 4 -- Regression
- Revision of regression
- Regression and model trees
- Penalised regression
Sessions 5 & 6 -- Classification
- k nearest neighbours
- decision trees and random forests
- support vector machines
- linear discriminant analysis.
Sessions 7 & 8 -- unsupervised learning
- k-means
- hierarchical agglomerative clustering
- natural language processing and latent Dirichlet analysis
Sessions 9 & 10 -- neural networks
- introducing production-ready systems H2O and TensorFlow for machine learning at scale.
- network analytics
Learning outcomes
By the end of the module, students should be able to:
- Understand research methods in big data analytics and computational social science
- Demonstrate understanding of results of research in big data analytics and computational social science
- Understand and identify links between big data resources and real world events
- Apply methods in data analytics and computational social science
Indicative reading list
Reading lists can be found in Talis
Subject specific skills
- Apply methods in data analytics and computational social science
- Perform example analyses
- Understand research methods in big data analytics
and computational social science - Demonstrate understanding of results of research in big data analytics and computational social science
- Understand and identify links between big data resources and real world events
Transferable skills
- Critically evaluate empirical research
- Preprocess data sets to allow their subsequent application to real world problems
- Visualise extensive data sets, applying methods which
both allow the visualisation consumer to ask their own questions, and methods which directly answer specific questions - Formulate hypotheses and apply statistical methods for their evaluation
- Demonstrate business relevant data analytics skills
- Write in an academically appropriate way
- Demonstrate confidence in discussing research results and its practical relevance in the real world
Study time
| Type | Required |
|---|---|
| Seminars | 10 sessions of 3 hours (20%) |
| Private study | 48 hours (32%) |
| Assessment | 72 hours (48%) |
| Total | 150 hours |
Private study description
self study and reflective learning
Costs
No further costs have been identified for this module.
You must pass all assessment components to pass the module.
Assessment group A
| Weighting | Study time | Eligible for self-certification | |
|---|---|---|---|
Assessment component |
|||
| Individual assignment | 100% | 72 hours | Yes (extension) |
Reassessment component is the same |
|||
Feedback on assessment
module leader feedback
There is currently no information about the courses for which this module is core or optional.