To main content
 
MSc in Data Sciences
Digitalization is a phenomenon which now permeates nearly all human endeavor and most aspects of daily life, encompassing the spheres of education, finance, health, industry, and culture.

It is now possible to collect data on the scale of Exabytes (1 billion gigabytes), from an incredibly wide range of digital sources, such as cameras, sensors (thermal, light, motion), smart phones, and medical devices. In addition to the data sources, we have enormous volumes of the routine transactional data of digitalized daily life, data that was previously uncollected or, if collected, ignored.

It is not just the data but the meta-data about the data, including the details of how, when, and where it was collected, that facilitates the identification of patterns, and, conversely, anomalies, in ways that are increasingly useful. This phenomenon of high-volume data collection is motivating the creation of advanced analytic techniques that can achieve results across disciplinary boundaries.

What began as “Big Data” has become “Data Science, enabling significant advances in applications ranging from health care (improved diagnostics and outcomes) to energy consumption (smart homes, factories, and cities). The ubiquity of digital devices is greatly expanding both the quality and quantity of data generated, with an effect amplified by the rapid emergence and utilization of the IPv6 protocols and the resulting plethora of “internet-of-things” applications.

The exponential growth in data sources and volume has presented non-trivial challenges to every step of data management: collection, storage, validation, preservation, transmission, access, analytics, and also raises new challenges to the related issues of anonymity, privacy, and security.

Data Science is the scientific discipline that covers the full range of the data life cycle. It includes both theoretical and practical methods for organizing, processing, and analyzing the data and transforming data into information and, increasingly, information into actionable “intelligence”.

In this regard, “intelligence” is feasible through major innovations in the application areas of Artificial Intelligence, Machine Learning, and Deep Learning, which are benefitting from new models of cognition and learning, and substantial improvements in computing resources and methods to manage computational complexity.

The result is that Data Science is achieving results that were previously conceived only speculatively or in the literature of science fiction. As but a single example, a machine recently taught itself chess over a weekend, and proceeded to defeat the top human players in the world.

Similar demonstrations are emerging in more practical application areas, such as medical imaging diagnostics, where AI analytics are achieving more accurate recognition rates than human experts.

The field of Data Science is a rapidly growing interdisciplinary specialty that is directly relevant to the national
development priorities for Kazakhstan.

The Master of Science in Data Science at Nazarbayev University will provide in-depth education in Data Science, incorporating key concepts from multiple fields. The program includes the following subject areas: Databases, Data Mining, Big Data, Business Analytics, Artificial Intelligence, Information Retrieval, Machine Learning, Deep Learning, Image and Video Processing, Bioinformatics, Cybersecurity, Data Analysis and Visualization, Mathematical and Statistical Modeling, Data Storage and Processing Infrastructures, and Cloud-based Solutions.

Aims and Objectives

The Master of Science in Data Science is a two-year full-time program (120 ECTS credits) at the School of Engineering and Digital Sciences (SEDS) of Nazarbayev University.

The first term of study is designed to provide a foundation in the fundamentals of the field, and to provide a baseline for all incoming admissions streams from several related but distinct fields of undergraduate study.

The program includes discipline core courses, and a set of topical electives that provide the continuity of study across the two years. The discipline courses are augmented by an arc of courses that provide milestones for program advancement and completion of the Master Thesis Project. This arc consists of a course in Research Methods, Research Seminar, Thesis Proposal, and then a final term to conclude and defend the work.

The program is aligned with professional society guidance from the ACM and IEEE disciplinary societies, and incorporates specific topical coverage as requested from Kazakh government and industry partners. The program provides a framework for collaboration with partners from education, government, and industry, so as to align opportunities for targeted research and collaboration on projects related to national development priorities as articulated in a series of national strategy documents such as Digital Kazakhstan, Kazakhstan 2030, the 100 Steps, and Kazakhstan 2050.

The Data Science program is unique in Kazakhstan, due to its interdisciplinary focus and a pedagogical approach which integrates research and emphasizes innovation. Starting from the first semester students interact closely with faculty. Graduates of the program are trained to become data science professionals prepared to enter careers in industry, government, or education.

Total Number of Credits: 120 ECTS
Year 1: Fall Semester (Semester 1)
TYPE COURSE CODE & TITLE ECTS
Core DS 501 FUNDAMENTALS OF DATA SCIENCE 6
Core DS 502 PROBABILITY AND STATISTICS FOR DATA SCIENCE 6
Core DS 507 DATABASE MANAGEMENT SYSTEMS 6
Innovation DS 551 PROCESS AND PROJECT MANAGEMENT 6
Research SEDS 591 RESEARCH METHODS 6

Year 1: Spring Semester (Semester 2)

TYPE COURSE CODE & TITLE ECTS
Core DS 504 DATA MINING AND DECISION SUPPORT 6
Core CSCI 545 BIG DATA ANALYTICS 6
Research SEDS 592 RESEARCH SEMINAR 6
Elective ELECTIVE 1 - SEDS 502 TEACHING PRACTICUM (RECOMMENDED) 6
Elective ELECTIVE 2 6

Year 2: Fall Semester (Semester 3)

TYPE COURSE CODE & TITLE ECTS
Research DS 693 THESIS PROPOSAL 6
Innovation DS 552 DATA DRIVEN INNOVATION 6
Elective ELECTIVE 3 - MATH 540 STATISTICAL LEARNING (RECOMMENDED) 6
Elective ELECTIVE 4 6
Elective ELECTIVE 5 6

Year 2: Spring Semester (Semester 4)

TYPE COURSE CODE & TITLE ECTS
Research DS 694 THESIS 30
Program Core Courses
Elective Courses Descriptions