Statistical Data Science
The MSc in Statistical Data Science is excellent preparation for careers in any field requiring a strong background in data science, with particular emphasis on statistics.
Accreditation

The MSc in Statistical Data Science is excellent preparation for careers in any field requiring a strong background in data science, with particular emphasis on statistics.
This course trains data scientists for posts in any of the multitude of companies and organisations that make use of data. Core modules give a thorough grounding in modern applications of statistics to data science and there is the opportunity to choose additional topics to match your interests and future career plans.
The School has a strong reputation for world-class research and a well-established system of support and training with a high level of flexibility, and frequent contact between staff and research students. Postgraduate students develop analytical, communication and research skills. Developing computational skills and applying them to mathematical problems forms a significant part of the postgraduate training in the School, setting you up for a great career.
A good 2:2, 2:1 or First Class degree, with a substantial amount of mathematics at university level.
All applicants are considered on an individual basis and additional qualifications, professional qualifications and relevant experience may also be taken into account when considering applications.
Please see our International Student website for entry requirements by country and other relevant information. Due to visa restrictions, students who require a student visa to study cannot study part-time unless undertaking a distance or blended-learning programme with no on-campus provision.
The University requires all non-native speakers of English to reach a minimum standard of proficiency in written and spoken English before beginning a postgraduate degree. Certain subjects require a higher level.
For detailed information see our English language requirements web pages.
Please note that if you are required to meet an English language condition, we offer a number of pre-sessional courses in English for Academic Purposes through Kent International Pathways.
The following modules are indicative of those offered on this programme. This list is based on the current curriculum and may change year to year in response to new curriculum developments and innovation. Most programmes will require you to study a combination of compulsory and optional modules. You may also have the option to take modules from other programmes so that you may customise your programme and explore other subject areas that interest you.
The curriculum introduces (and revises for some students) the essentials of probability and classical (frequentist) statistical inference.
Probability: review of elementary probability, concept of random variable, discrete and continuous probability distributions, cumulative distribution function, expectation and variance, joint distributions, marginal and conditional distributions, generating functions and transformation of random variables.
Statistics: sampling distributions, point estimation, method of moment and maximum likelihood estimation, confidence intervals, hypothesis testing, association between variables and linear regression.
Linear model. Least squares. General linear model; simple and multiple regression, polynomial regression. Model selection, residuals, outliers, diagnostics. Analysis of variance. Generalised linear model.
Discrete data analysis. Review of Binomial, Poisson, negative binomial and multinomial distributions. Properties, estimation, hypothesis tests.
Contingency tables. Tests for independence. Measures of association. Logistic models.
Multidimensional tables. Log–linear models; fitting and model selection.
Bayes Theorem for density functions; Conjugate models; Predictive distribution; Bayes estimates; Sampling density functions; Gibbs and Metropolis-Hastings samplers; Winbugs/OpenBUGS; Bayesian hierarchical models; Bayesian model choice; Objective priors; Exchangeability; Choice of priors; Applications of hierarchical models.
Introduction: Machine learning and data visualisation with R.
Classification and prediction: Generalised linear model (GLM), linear discrimination analysis (LDA), k-nearest neighbors (KNN). R-based worked examples.
Resampling methods: Cross-validation (CV) and bootstrap. R-based worked examples.
Regression tree-based methods: Classification and regression trees (CART), bagging, random forests and boosting. R-based worked examples.
Support vector machines (SVM): Support vector classifier, regression SVM. R-based worked examples.
Machine Learning in Action:
(a) Biomedical and health data analysis;
(b) Bond default data analysis;
(c) Insurance data analysis;
(d) Financial data analysis;
(e) Other big data analysis.
Multivariate normal distribution, Inference from multivariate normal samples, principal component analysis, mixture models, factor analysis, clustering methods, discrimination and classification, graphical models, the use of appropriate software.
Statistics methods contribute significantly to areas such as biology, ecology, sociology and economics. The real data collected does not always follow standard statistical models. This module looks at modern statistical models and methods that can be utilised for such data, making use of R programs to execute these methods.
Indicative module content: Motivating examples; model fitting through maximum likelihood for specific examples; function optimization methods; profile likelihood; score tests; Wald tests; confidence interval construction; latent variable models; EM algorithm; mixture models; simulation methods; importance sampling; kernel density estimation; Monte Carlo inference; bootstrap; permutation tests; R programs.
In addition, for level 7 students: advanced EM algorithm methods, advanced simulation methods, writing R programs for advanced methods and applications.
This is a practical module to develop the skills required by a professional statistician (report writing, consultancy, presentation, wider appreciation of assumptions underlying methods, selection and application of analysis method, researching methods).
Software: R, SPSS and Excel (where appropriate/possible). Report writing in Word. PowerPoint for presentations.
In addition, for level 7 students:
The module enables students to undertake an independent piece of work in a particular area of statistics, or statistical finance/financial econometrics and to write a coherent account of the material.
There is no specific syllabus for this module.
Introduction Deeplearning with python.
Backgrounds of python software: Basic knowledge of python, as wellas python deep learning library, for example Keras, Tensorflow.
Fundamentals of deep learning: What is deep learning? Concept ofneural network.
Neural network: Feed Forward Neural Networks (FFNN), ConvolutionalNeural Networks (CNN), Recurrent Neural Networks (RNN), Deep SequenceModelling, Deep Generative Models.
Algorithms: Backpropagation, Stochastic Gradient Descent (SGD).
Applications: Image Recognition, Mortality Forecasting, Insuranceand Finance.
This module is designed to cover: Ethics and compliance of data science. Impact of international regulations. Appropriate handling of data. Simple random sampling. Sampling for proportions and percentages. Estimation of sample size. Stratified sampling. Systematic sampling. Cluster sampling. Data streams. Finding frequentist items. Estimating the number of distinct elements. Sparse recovery. Weight-based sampling. Real time analytics. Network data: Density, clustering coefficient, centrality and degree distribution.
The module enables students to undertake an independent piece of work in a particular area of statistics, or statistical finance/financial econometrics and to write a coherent account of the material.
There is no specific syllabus for this module.
Duration: 1 year full-time
You undertake a substantial project in statistical data science, supervised by an experienced researcher. Some projects are focused on the analysis of particular complex data sets, motivated by the supervisor's current research projects, while others focus more on methodology.
You gain experience of analysing real data problems through practical classes and exercises. The programme includes training in the computer language R.
Assessment is through coursework involving: complex theoretical questions, analysis of real-world data using appropriate computing packages over a range of areas of application; written unseen examinations; dissertation.
This programme aims to:
Kent’s Computing Service central facility runs Windows. Within the School, postgraduate students can use a range of UNIX servers and workstations. Packages available include R, MATLAB, SPSS, STAN and Python.
Staff publish regularly and widely in journals, conference proceedings and books. Among others, they have recently contributed to: Annals of Statistics; Biometrics; Biometrika; Journal of Royal Society, Series B; Statistics and Computing. Details of recently published books can be found within our staff research interests.
All students registered for a taught Master's programme are eligible to apply for a place on our Global Skills Award Programme. The programme is designed to broaden your understanding of global issues and current affairs as well as to develop personal skills which will enhance your employability.
There has been research in the area of statistical ecology at Kent for many years. We are part of the National Centre for Statistical Ecology (NCSE), which was established in 2005.
The research conducted in this area at Kent is mainly on Bayesian variable selection, Bayesian model fitting, Bayesian nonparametric methods, Markov chain Monte Carlo with applications.
Research is focused on statistical modelling and inference in biology and genetics with applications in complex disease studies. Over the past few decades, large amounts of complex data have been produced by high through-put biotechnologies. The grand challenges offered to statisticians include developing scalable statistical methods for extracting useful information from the data, modelling biological systems with the data, and fostering innovation in global health research.
This theme encompasses both theory and applications. Theory is involved with supervised and unsupervised learning, matrix factorisation, modelling of high-dimensional time series, differential privacy, deep learning and networks, shape analysis and statistics on manifolds, and neuroimaging. Applications in biology, industry, medicine and psychiatry. Often new computational methods are the key to analysing complex big data problems.
In order to describe the data, it is common in statistics to assume a specific probability model. Unfortunately, in many practical applications (for instance in economics, population genetics and social networks) it is not possible to identify a specific structure for the data. Nonparametric methods provide statistical tools for addressing inference in these situations.
At Kent there is particular interest in the use of nonparametric methods including quantile regression and Bayesian nonparametric approaches. Application areas include modelling of the business cycle and capacity utilisation, calculating sovereign credit ratings, modelling of stock return data, and predicting inflation.
Students often go into careers as professional statisticians or data scientists in industry, government, research and teaching but our programmes also prepare you for careers in other fields requiring a strong statistical background. You have the opportunity to attend careers talks from professional statisticians working in industry and to attend networking meetings with employers.
Our graduates have started careers in diverse areas such as the pharmaceutical industry, financial services and sports betting.
The taught programmes in Statistical Data Science provide exemption from the professional examinations of the Royal Statistical Society and qualification for Graduate Statistician status.
The 2024/25 annual tuition fees for this course are:
For details of when and how to pay fees and charges, please see our Student Finance Guide.
For students continuing on this programme fees will increase year on year by no more than RPI + 3% in each academic year of study except where regulated.* If you are uncertain about your fee status please contact information@kent.ac.uk.
The University will assess your fee status as part of the application process. If you are uncertain about your fee status you may wish to seek advice from UKCISA before applying.
For details of when and how to pay fees and charges, please see our Student Finance Guide.
For students continuing on this programme, fees will increase year on year by no more than RPI + 3% in each academic year of study except where regulated.*
The University will assess your fee status as part of the application process. If you are uncertain about your fee status you may wish to seek advice from UKCISA before applying.
Find out more about general additional costs that you may pay when studying at Kent.
We have a range of subject-specific awards and scholarships for academic, sporting and musical achievement.
Supporting your success
Kent ranked top 50 in The Complete University Guide 2024.
Fees and funding
Kent has risen 11 places in THE’s REF 2021 ranking, confirming us as a leading research university.
Accommodation