Data Science and Artificial Intelligence, MSE

Penn’s Master of Science in Engineering (MSE) in Data Science & Artificial Intelligence (DATS & AI) prepares students for a wide range of data-driven and AI-powered careers, whether in technology and engineering, consulting, healthcare, science, policy-making, or understanding patterns in literature, art or communications.  Building on a shared foundation in statistics, optimization, machine learning, and algorithms, students can tailor their studies through two concentrations—Data Science, which includes coursework on data management and decision making, and Artificial Intelligence, which includes coursework on building systems that can perform tasks that require human intelligence.  The DATS & AI Program can typically be completed in one-and-a-half to two years. 

The degree also allows students to apply what they have learned to a number of different application areas through a thesis or practicum.  Potential areas of application can be explored through the many centers and institutes across campus – including trustworthy AI (the ASSET center), network science (the Warren Center for Network and Data Science), digital humanities (the Price Lab for Digital Humanities), biomedicine (the Institute for Biomedical Informatics), and public policy (the Penn Wharton Budget Model and the Annenberg Center for Public Policy) — as well as more traditional opportunities in Computer and Information Science and Electrical and Systems Engineering.


The degree and major requirements displayed are intended as a guide for students entering in the Fall of 2026 and later. Students should consult with their academic program regarding final certifications and requirements for graduation.


Curriculum

10 course units are required for the Data Science & Artificial Intelligence degree.
The ten required course units are divided into three categories: Core Courses, Concentrations, and Electives. (As long as the prerequisites for the courses are met, students can complete these courses in any sequence.)

Core Requirements (4 cu's)
Linear Algebra or Convex Optimization
CIS 5150Fundamentals of Linear Algebra and Optimization1
or MATH 5130 Computational Linear Algebra
or MATH 5140 Advanced Linear Algebra
or ESE 6050 Modern Convex Optimization
or STAT 5810 Convex Optimization for Statistics and Data Science
Statistics
ESE 5420Statistics for Data Science1
or STAT 5110 Statistical Inference
or STAT 5120 Mathematical Statistics
or STAT 5350 Forecasting Methods for Management
or STAT 5420 Bayesian Methods and Computation
Machine Learning
CIS 5190Applied Machine Learning1
or CIS 5200 Machine Learning
or ESE 5460 Principles of Deep Learning
Algorithms
CIS 5030Algorithms for Big Data1
or CIS 5020 Analysis of Algorithms
or CIS 6770 Advanced Topics in Algorithms and Complexity
Concentration (choose one)6
Students are required to select one of the following tracks.
Data Science Concentration
Big Data Analytics
Database and Information Systems
Modern Data Mining
4 CUs of Electives (Choose courses from any of the elective buckets)
AI Concentration
Artificial Intelligence
Computer Vision & Computational Photography
Natural Language Processing
Advanced Topics in Machine Perception
Advanced Topics in Natural Language Processing
4 CU's of Electives (Two of which should come from the ML/multimodal AI bucket)
Total Course Units10

Elective Buckets
Machine Learning, Multi-modal AI and Data Analysis

Artificial Intelligence
Deep Learning for Data Science
Trustworthy Machine Learning
Natural Language Processing
Big Data Analytics
Machine Perception
Computer Vision & Computational Photography
Advanced Topics in Machine Learning
Theory of Machine Learning
Advanced Topics in Natural Language Processing
Advanced Topics in Machine Perception
Graph Neural Networks
Machine Learning for Time-Series Data
Machine Learning for Data Science
Principles of Deep Learning
Learning for Dynamics and Control
Deep Generative Models
Learning in Robotics
Machine Learning and Its Applications in Materials Science
Modern Data Mining

AI and Data Science for Discovery

Biological Data Science II: Data Mining Principles for Epigenomics
Introduction to Neuroengineering
Brain-Computer Interfaces
Networked Neuroscience
Data Science for Biomedical Informatics
Foundations of Artificial Intelligence in Health
Advanced Methods and Health Applications in Machine Learning
Natural Language Processing for Health
Exploring Data Science Methods with Health Care Data
Introduction to Bioinformatics
Fundamentals of Computational Biology
Biomedical Image Analysis
Theoretical and Computational Neuroscience

Optimization, Systems and Control

Linear Systems Theory
Feedback Control Design and Analysis
Introduction to Optimization Theory
Modern Convex Optimization
Combinatorial Optimization
Learning for Dynamics and Control
Model Predictive Control

Social and Network Science

Ethical Algorithm Design
Econometrics I: Fundamentals
Econometrics II: Methods & Models
Econometrics III: Advanced Techniques of Cross-Section Econometrics
Econometrics IV: Advanced Techniques of Time-Series Econometrics
Applied Probability Models in Marketing

Surveys and Statistical Methods

Data and Analysis for Marketing Decisions
Business Analytics
Forecasting Methods for Management
Accelerated Regression Analysis for Business
Predictive Analytics for Business
Sample Survey Methods
Observational Studies
Bayesian Statistical Theory and Methods
Modern Regression for the Social, Behavioral and Biological Sciences

 Data-Centric Programming

Software Systems
Database and Information Systems
Advanced Programming
Internet and Web Systems
Programming and Problem Solving
GPU Programming and Architecture
GPU Computing for Machine Learning Systems
Software Engineering
Advanced Topics in Databases
Computer Systems Programming
Hardware/Software Co-Design for Machine Learning

 Robotics

RoboRacer Autonomous Racing Cars
Learning in Robotics
Introduction to Robotics
Advanced Robotics

 Simulation

Molecular Modeling and Simulations
Computational Science of Energy and Chemical Transformations
Multiscale Modeling of Chemical and Biological Systems
Finite Element Analysis
Thermodynamics: Foundations, Energy, Materials
Computational Mechanics

 Mathematical and Algorithmic Foundations

Advanced Linear Algebra
Analysis of Algorithms
Algorithms for Big Data
Theory of Machine Learning
Advanced Topics in Algorithms and Complexity
Algorithms and Computation
Numerical Methods and Modeling
Data-driven Modeling and Probabilistic Scientific Computing
Simulation Modeling and Analysis
Introduction to Optimization Theory
Data Mining: Learning from Massive Datasets
Modern Convex Optimization
Information Theory
Stochastic Models
Advanced Statistical Inference I
Bayesian Statistical Theory and Methods

Other Electives 

Introduction to Human Computer Interaction
Special Topics Only relevant data science / AI topics upon approval

Thesis / Practicum

Master's Independent Study (1 or 2 cu's of Practicum Total (1 cu per semester)) DATS Practicum and Thesis courses are not mandatory
Master's Thesis (2 cu's total needed (consecutive semesters)) DATS Practicum and Thesis courses are not mandatory

10 course units are required for the Data Science & Artificial Intelligence degree.
The ten required course units are divided into three categories: Core Courses, Concentrations, and Electives. (As long as the prerequisites for the courses are met, students can complete these courses in any sequence.)

Plan of Study Grid
First Year
FallCourse Units
Three courses (3 cu's) 1 3
 Course Units3.00
Spring
Three courses (3 CU's) 3
 Course Units3.00
Second Year
Fall
Three courses (3 CU'S) 3
 Course Units3.00
Spring
One Course (1 CU) 1
 Course Units1.00
 Total Course Units10.00