phone iconSpeak with our expert +1 617 539 7216

Learn more about the course

Get details on syllabus, projects, tools, and more

Name
Email
Mobile Number

By submitting this form, you consent to our Terms of Use & Privacy Policy and to be contacted by us via Email/Call/Whatsapp/SMS.

Data Science and Machine Learning: Making Data-Driven Decisions

Data Science and Machine Learning: Making Data-Driven Decisions

Build industry-valued AI, Data Science, and Machine Learning skills

Application closes 10th Jul 2025

Upskill in AI, Data Science & ML

  • List icon

    Live Mentorship from Industry Practitioners

    Join weekend live virtual sessions with AI, data science and machine learning professionals. Benefit from real-time guidance from experienced practitioners at global organizations.

  • List icon

    Modules on Responsible AI and Generative AI

    Deepen understanding of ethical AI with the Responsible AI module and explore innovations in Generative AI, covering tools, techniques, and real-world applications.

overview icon

Program Outcomes

Key takeaways for career success in AI, Data Science, and Machine Learning

Designed for learners to gain hands-on experience and build industry-valued skills

  • List icon

    Understand the intricacies of Data Science and Artificial Intelligence techniques and their applications to real-world problems

  • List icon

    Implement various Machine Learning techniques to solve complex problems and make data-driven business decisions

  • List icon

    Explore two major realms of Artificial Intelligence: Machine Learning and Deep Learning, and understand how they apply to domains such as Computer Vision and Recommendation Systems

  • List icon

    Choose how to represent your data effectively when making predictions

  • List icon

    Explore the practical applications of Recommendation Systems across various industries and business contexts

  • List icon

    Build an industry-ready portfolio of projects and demonstrate your ability to extract valuable business insights from data

Earn a certificate of completion from MIT IDSS

  • U.S. News & World Report, 2024

    U.S. #2

    U.S. News & World Report Rankings, 2024-2025

  • QS World University Rankings, 2025

    World #1

    QS World University Rankings, 2025

Key program highlights

Why choose the Data Science and Machine Learning program

  • List icon

    Learn from MIT faculty

    Learn from the vast knowledge of MIT AI, Data Science and Machine Learning faculty through recorded sessions.

  • List icon

    Collaborative peer networking

    Engage in a collaborative environment, networking with global AI, Data Science, and Machine Learning peers.

  • List icon

    Build your AI, Data Science, and Machine Learning Portfolio

    Showcase your AI and data science skills with 3 real-world projects and 50+ hands-on case studies in your e-portfolio.

  • List icon

    Personalized mentorship sessions

    Benefit from personalized weekend mentorship by experienced AI, Data Science and ML practitioners from leading global organizations.

  • List icon

    Dedicated Program support

    Connect with dedicated program managers to assist with queries and guide you throughout the course.

  • List icon

    Generative AI Masterclasses

    Get access to 3 masterclasses on Generative AI and its use cases by industry experts.

Skills you will learn

Python

Machine Learning

Deep Learning

Recommendation Systems

Computer Vision

Predictive Analytics

Generative AI

Prompt Engineering

Retrieval-Augmented Generation

Ethical AI

Python

Machine Learning

Deep Learning

Recommendation Systems

Computer Vision

Predictive Analytics

Generative AI

Prompt Engineering

Retrieval-Augmented Generation

Ethical AI

view more

  • Overview
  • Curriculum
  • Projects
  • Tools
  • Certificate
  • Faculty
  • Mentors
  • Reviews
  • Fees
optimal icon

This program is ideal for

Professionals ready to advance their skills in AI, Data Science, and Machine Learning

View Batch Profile

  • Building Expertise for AI-driven Roles

    Professionals looking to build expertise in AI, Data Science, and Machine Learning through hands-on projects and real-world applications.

  • Driving Actionable Insights

    Individuals seeking to enhance their ability to turn complex data into actionable insights for better business decision-making.

  • Leading AI Initiatives

    Professionals aiming to lead or contribute to AI and Data Science initiatives across industries.

  • Solving Business Challenges

    Professionals interested in applying advanced AI techniques like Generative AI, Deep Learning, and Recommendation Systems to solve business challenges.

Curriculum

Designed by MIT IDSS faculty in collaboration with industry experts, the curriculum covers the most relevant technologies in data science today, including machine learning, deep learning, recommendation systems, network analytics, graph neural networks, time series forecasting, ChatGPT, and Generative AI.

Pre-work: Introduction to Data Science and AI

The Pre-Work gets you onboarded into the world of Data Science, Artificial Intelligence, and Generative AI, where they came from, how they are used in the industry, and how business problems are typically solved using them, thereby preparing you to explore the transformative potential of data-driven decision making and equipping you with essential programming skills. This will allow you to get to zoom out and see the big picture before you start learning the details.

This course is a foundational step in your journey through the program. It'll get you acquainted with the Data Science and AI world and equip you with basic Python programming skills, thereby laying a foundation for the learning throughout the program. Completing this module ensures you're well-prepared to tackle the program with confidence and ease.


  1. Introduction to the World of Data 
  2. Introduction to Python 
  3. Introduction to Generative AI 
  4. Application of DS and AI 
  5. Data Science Lifecycle 
  6. Maths and Stats behind DS and AI 
  7. History of DS and AI

Week 0: Data Science and AI Applications

This module gives you an overview of the complete lifecycle of an AI application through a comprehensive case study analysis. By examining real-world scenarios, you will gain a holistic understanding of how AI is leveraged to address and solve complex business challenges. This detailed walkthrough will equip you with the insights needed to see the broader context of AI's role in driving business solutions, from conception to execution.

Weeks 1 and 2: Foundations of AI

This course will aim to build the programming and statistical foundations necessary for the rest of the program. It will cover the concepts, tools, and techniques required to effectively learn and implement the ideas presented in subsequent courses. 


(Numpy arrays and Functions, Pandas Series and DataFrames, Pandas Functions, Saving and loading datasets using Pandas. Data Visualization using Seaborn, Matplotlib, and Plotly. Introduction to Inferential Statistics, Fundamentals of Probability Distributions, The Central Limit Theorem, Hypothesis Testing, Univariate Analysis, Bivariate Analysis, Missing Value Treatment, Outlier Treatment)

Week 3: Masterclass 1: Data Analysis with Generative AI

Week 4: Making Sense of Unstructured Data

This course equips you with the tools and techniques to harness a vast amount of unstructured data and uncover hidden patterns that can enhance performance in various fields. 


(Supervised & Unsupervised Learning: Understanding Classification and Clustering Methods. K-Means Clustering, Dimensionality Reduction Techniques: PCA and t-SNE)

Week 5: Project 1 on Clustering and PCA and Masterclass 2: Learning from Text Data

Week 6: Regression and Prediction

In this course, you will delve into the world of predictive modeling using both classical and cutting-edge regression techniques. This course builds upon foundational knowledge to equip you with skills to analyze and predict using data, whether the focus is on understanding past data trends or forecasting future outcomes. The emphasis will be on identifying the relationship between inputs and outputs and practically applying these insights. The course will not only highlight traditional linear and non-linear regression methods but will also introduce modern approaches suited for high-dimensional datasets. You'll gain an understanding of causal inference, enabling you to differentiate between correlation and causation in your predictive models. 


(Linear and Non-Linear Regression, Causal Inference, Regression with High-Dimensional Data, Regularization Techniques, Model Evaluation, Cross-Validation, and Bootstrapping)

Week 7: Classification and Hypothesis Testing

This course dives into essential techniques of classification to determine the class of observations, utilizing tree-based algorithms like Decision Trees and Random Forests. Additionally, gain insights into Hypothesis Testing to make informed inferences about population parameters. Through this course, you will learn to efficiently address diverse data science challenges and enhance decision-making processes using statistical tests and classification models. 


(Introduction to Classification, Logistic Regression, Decision Trees, and Random Forest, Type 1 Error & Type 2 Error in Classification Problems, Hypothesis Testing)

Week 8: Project 2 on Machine Learning Classification and Masterclass 3: AI-Powered Text Labeling

Week 9: Deep Learning and Computer Vision

In this course, you will dive deep into the world of Deep Learning, a transformative technology that outperforms classical machine learning techniques by handling complex unstructured data. You will explore the fundamental concepts of representation learning and Neural Networks, understanding how they transcend the constraints of traditional feature engineering. Learn to build and implement Artificial Neural Networks using TensorFlow and Keras, enhancing your ability to solve intricate prediction problems with remarkable accuracy. 


(Introduction to Deep Learning, Neural Network Representations: One Hidden Layer, Hidden Neurons, Multiple Layers & Multi-class Predictions, Introduction to Computer Vision, ANN vs CNN, Basic terminologies related to CNN, CNN architecture, Transfer Learning)

Week 10: Recommendation Systems

In this comprehensive course, you will explore Recommender Systems, a powerful solution for tackling the problem of information overload faced by users in today's digital age. Learn the foundational principles and the necessity of Recommendation Systems and how they can personalize user experiences by delivering the most relevant content. Delve into various recommendation techniques, from simple solutions using statistical and Machine Learning methods to advanced Collaborative Filtering approaches, designed to analyze user data and provide customized recommendations. 


(Recommendation Systems - Overview & background, Collaborative Filtering & Singular Value Threshold)

Week 11: Ethical and Responsible AI

This course on Ethical and Responsible AI provides a focused overview of key ethical considerations throughout the AI lifecycle. You'll learn to identify biases, comprehend causality, and safeguard privacy within AI systems. The course also explores the interconnections and interdependencies across AI domains, equipping you to build systems that are ethically sound and aligned with societal values. 


(Introduction to AI Lifecycle, Introduction to Bias and Its Examples, Introduction to Causality and Privacy, Interconnections and Domains, Interdependency and Feedback in AI Systems)

Week 12: Project 3 on Recommendation System and Masterclass 4: AI on Proprietary Data

Optional Week 13: Masterclass 5: Agentic AI workflows

Prompt Engineering (Self-paced)

This course will introduce you to how to design and create prompts, various Prompting Techniques, help gain insights on how to interact effectively and elicit the desired responses from LLM models, and various applications and use cases where prompt engineering plays a vital role. 


(Operationalizing Generative AI, LLM Training and Inference, OpenAI Journey and GPT Training, Model Deployment and tokens, Introduction to prompt engineering, Prompt Engineering review and reusable templates, Zero-Shot and Few-Shot Prompting, Chain-of-Thought Prompting, Introduction to APIs for LLMs)

Networking and Graphical Models (Self-paced)

In this course, you'll delve into the fascinating world of networks, an integral component in areas like social networking and gene regulation. While prior modules focused on regression, classification, and recommendation systems, this module shifts the focus to interactions and correlations as primary data or interest points. The course provides a systematic exploration of methods for analyzing complex network structures and inferring unseen data. With an emphasis on graphical models, you'll understand their powerful role in modeling network processes and facilitating statistical computations. Professors Caroline Uhler and Guy Bresler will guide you through foundational network concepts, the applications of network data, and methods for constructing and understanding network behavior, thereby providing you with the tools to leverage networks and graphical models effectively. 


(Introduction to Networks, Network Analysis, Graphical Models)

Predictive Analytics (Self-paced)

This course focuses on the integral aspect of temporal data and its significance in predictive modeling. By understanding how data evolves, learners can build robust models to anticipate future trends. Dive into the dynamics of temporal data, learning to define inputs and outputs for superior prediction accuracy. Explore feature engineering techniques that transform temporal data into valuable insights. Evaluate prediction models to ensure they're ready for real-world deployment. This course includes hands-on examples and strategies such as Deep Feature Synthesis, preparing you to effectively apply predictive models in various problem domains. 


(Introduction to Predictive Analytics and Feature Engineering, Deep Feature Synthesis - Primitives and Feature Engineering, Model Selection Techniques, K-Fold Cross Validation)

ChatGPT and Generative AI – The Development Stack (Self-paced)

This course offers a hands-on, layered introduction to the core components behind today’s generative AI systems. You'll begin with the foundations of Generative AI and NLP, then explore how deep learning evolved into powerful transformer-based architectures like GPT. You'll understand how large language models are trained, deployed, and fine-tuned using techniques like prompt engineering, Retrieval-Augmented Generation (RAG), and reinforcement learning. With practical demos, notebooks, and solution walkthroughs, the course equips you with the skills to prototype and optimize your own AI assistants using modern tools. 


(Demystifying Generative AI, Overview of Natural Language Processing, Advancements in AI – From deep learning to transformers and LLMs, Understanding GPT and LLM Training, Hands-on Demonstration – Building a prototypical AI assistant using LLMs and RAG)

Projects and Case Studies

The program follows a learn-by-doing pedagogy, helping you build your skills through real-world case studies and hands-on practice. Below are samples of potential project topics and case studies you will work on.

  • 3

    hands-on projects

  • 50+

    case studies

project icon

Retail

Customer Personality Segmentation

About the Project

It focuses on customer segmentation, a common practice in retail to improve marketing strategies, customer retention, and resource allocation. By analyzing customer demographics, purchasing behavior, and interactions with marketing campaigns, the retail company aims to understand its customer base better and tailor its offerings to meet the preferences and needs of different customer segments.

Skills you will learn

  • Python
  • Exploratory Data Analysis
  • Data Pre-processing
  • K-means Clustering
project icon

EdTech (Educational Technology)

Potential Customers Prediction

About the Project

The problem statement involves predicting potential customers in this rapidly growing sector by analyzing leads and their interactions with the company, ExtraaLearn.

Skills you will learn

  • Python
  • Decision tree
  • Random forest
project icon

E-Commerce and Technology

Amazon Product Recommendation System

About the Project

This project involves developing a product recommendation system for Amazon, focusing on providing personalized suggestions based on users' previous product ratings. By utilizing techniques like collaborative filtering, the goal is to enhance user engagement and satisfaction, ultimately driving sales and improving the user experience on the platform.

Skills you will learn

  • Python
  • Knowledge/Rank-based
  • Similarity-Based Collaborative filtering
  • Matrix Factorization Based Collaborative Filtering
  • Clustering-based recommendation system
  • Content-based collaborative filtering
project icon

Healthcare

Hospital Loss Prediction

About the Project

This case study focuses on building a regression-based machine learning solution to predict the Length of Stay (LOS) of patients using data available at admission and from initial tests. The goal is to identify key factors influencing LOS, derive actionable insights, and support hospital policy planning to enhance infrastructure and revenue generation.

Skills you will learn

  • Exploratory Data Analysis
  • Data Preprocessing
  • Regression Modeling
  • Data Interpretation
  • Python Programming
project icon

Human Resources

HR Employee Attrition Prediction

About the Project

This case study involves developing a predictive model to identify employees at risk of attrition using organizational data. By uncovering patterns in employee behavior and characteristics, the model helps to optimize retention efforts and reduce costs by targeting incentives only to high-risk individuals.

Skills you will learn

  • Exploratory Data Analysis
  • Data Preprocessing
  • Logistic Regression
  • Linear Discriminant Analysis (LDA)
  • Quadratic Discriminant Analysis (QDA)
  • Python Programming
project icon

Geospatial Technology

Street View Housing Number Digit Recognition

About the Project

This case study focuses on building a deep learning solution to recognize house numbers from street-level images using the SVHN dataset. The model automates the transcription of numeric address data from image patches, supporting geospatial applications such as improving digital map accuracy and pinpointing building locations.

Skills you will learn

  • Exploratory Data Analysis
  • Data Preprocessing
  • Artificial Neural Networks (ANNs)
  • Convolutional Neural Networks (CNNs)
  • Python Programming
project icon

E-commerce

Book Recommendation System

About the Project

This case study explores the development of a book recommendation system that suggests titles based on user preferences. By leveraging various collaborative filtering techniques and user-item interaction data, the system delivers relevant suggestions to enhance user experience and drive sales. Widely applicable across major e-commerce platforms, such systems help reduce browsing time and increase purchase value.

Skills you will learn

  • Exploratory Data Analysis
  • Data Preprocessing
  • Knowledge/Rank-Based Recommendations
  • Similarity-Based Collaborative Filtering
  • Matrix Factorization
  • Python Programming

Languages and Tools covered

  • tools-icon

    Python

  • tools-icon

    NumPy

  • tools-icon

    Keras

  • tools-icon

    Tensorflow

  • tools-icon

    Matplotlib

  • tools-icon

    Skitlearn

  • And More...

Earn a certificate of completion from MIT IDSS

Certificate from the MIT Schwarzman College of Computing and IDSS upon successful completion of the program

  • World #1

    World #1

    MIT ranks #1 in World Universities – QS World University Rankings, 2025

  • U.S. #2

    U.S. #2

    MIT ranks #2 among National Universities – U.S. News & World Report Rankings, 2024–2025

certificate image

* Image for illustration only. Certificate subject to change.

Program Faculty

  • Munther Dahleh - Faculty Director

    Munther Dahleh

    Program Faculty Director, MIT Institute for Data, Systems, and Society (IDSS)

    Know More
  • John N. Tsitsiklis - Faculty Director

    John N. Tsitsiklis

    Clarence J. Lebel Professor, Dept. of Electrical Engineering & Computer Science (EECS) at MIT

    Know More
  • Ankur Moitra - Faculty Director

    Ankur Moitra

    Rockwell International Career Development Associate Professor, Mathematics and IDSS, MIT

    Know More
  • Caroline Uhler - Faculty Director

    Caroline Uhler

    Henry L. & Grace Doherty Associate Professor, EECS and IDSS, MIT

    Know More
  • David Gamarnik - Faculty Director

    David Gamarnik

    Nanyang Technological University Professor of Operations Research, Sloan School of Management and IDSS, MIT

    Know More
  • Devavrat Shah - Faculty Director

    Devavrat Shah

    Professor, EECS and IDSS, MIT

    Know More
  • Guy Bresler - Faculty Director

    Guy Bresler

    Associate Professor, EECS and IDSS, MIT

    Know More
  • Jonathan Kelner - Faculty Director

    Jonathan Kelner

    Professor, Mathematics, MIT

    Know More
  • Kalyan Veeramachaneni - Faculty Director

    Kalyan Veeramachaneni

    Principal Research Scientist at the Laboratory for Information and Decision Systems, MIT.

    Know More
  • Philippe Rigollet - Faculty Director

    Philippe Rigollet

    Professor, Mathematics and IDSS, MIT

    Know More
  • Stefanie Jegelka - Faculty Director

    Stefanie Jegelka

    X-Consortium Career Development Associate Professor, EECS and IDSS, MIT

    Know More
  • Tamara Broderick - Faculty Director

    Tamara Broderick

    Associate Professor, EECS and IDSS, MIT.

    Know More
  • Victor Chernozhukov - Faculty Director

    Victor Chernozhukov

    Professor, Economics and IDSS, MIT

    Know More

Program Mentors

Interact with dedicated and experienced industry experts who will guide you in your learning and career journey

  •  Bradford Tuckfield - Mentor

    Bradford Tuckfield

    Founder and Data Science Consultant Kmbara
    Kmbara Logo
  •  Vaibhav Verdhan - Mentor

    Vaibhav Verdhan

    Analytics Leader, Global Advanced Analytics AstraZeneca
    AstraZeneca Logo
  •  Mayan Murray - Mentor

    Mayan Murray

    Senior Data Scientist and UX Consultant IBM
    IBM Logo
  •  Vibhor Kaushik - Mentor

    Vibhor Kaushik

    Data Scientist Amazon
    Amazon Logo
  •  Amit Agarwal - Mentor

    Amit Agarwal

    Senior Data Scientist Oracle
    Oracle Logo
  •  Kemal Yilmaz - Mentor

    Kemal Yilmaz

    Senior Data Scientist Walmart Connect
    Walmart Connect Logo
  •  Xiaojun Su - Mentor

    Xiaojun Su

    Data Science Product Manager Unilever
    Unilever Logo
  •  Juan Castillo - Mentor

    Juan Castillo

    Machine Learning Engineer SEPHORA
    SEPHORA Logo
  •  Andrew Marlatt - Mentor

    Andrew Marlatt

    Data Scientist - Revenue Expansion Shopify
    Shopify Logo
  •  Rohit Dixit - Mentor

    Rohit Dixit

    Senior Data Scientist Siemens Healthineers
    Siemens Healthineers Logo
  •  Srikanth Pyaraka - Mentor

    Srikanth Pyaraka

    Data Science Product Manager Verizon
    Verizon Logo
  •  Angel Das - Mentor

    Angel Das linkin icon

    Data Science Consultant IQVIA Asia Pacific
    IQVIA Asia Pacific Logo
  •  Shirish Gupta - Mentor

    Shirish Gupta

    Lead Data Scientist Novartis
    Novartis Logo
  •  Vanessa Afolabi - Mentor

    Vanessa Afolabi

    Senior Data Scientist Loblaw Companies Limited
    Loblaw Companies Limited Logo
  •  Thinesh Pathmanathan - Mentor

    Thinesh Pathmanathan

    Data Scientist TD
    TD Logo
  •  Grivine Ochieng - Mentor

    Grivine Ochieng

    Lead Data Engineer Xetova
    Xetova Logo

Watch inspiring success stories

  • learner image
    Watch story

    "The people behind the program were amazing, I believe this was best part of the program"

    The favourite part was the hackathon competition, where we had to combine everything that we had learnt and build the model

    Arlindo Almada

    ,

  • learner image
    Watch story

    "The program helped me restructure my professional life after COVID"

    Francisco JosÉ Valencia Alaix

    ,

  • learner image
    Watch story

    ""

    Towo Adeyemi

    ,

Course fees

The course fee is 2,500 USD

Invest in your career

  • benifits-icon

    Learn from world-renowned MIT IDSS faculty and top industry leaders

  • benifits-icon

    Build an impressive portfolio with 3 projects and 50+ case studies

  • benifits-icon

    Get personalized assistance with a dedicated Program Manager from Great Learning

  • benifits-icon

    Earn a certificate of completion from MIT IDSS and 8.0 Continuing Education Units (CEUs)

project icon

Easy payment plans

Avail our EMI options & get financial assistance

Payment Partners

Check our different payment options with trusted partners

benifits-icon benifits-icon benifits-icon benifits-icon

*Subject to partner approval based on applicable regions & eligibility

timer
00 : 00 : 00

Unlock exclusive course sneak peek

Application Closes: 10th Jul 2025

Application Closes: 10th Jul 2025

Talk to our advisor for offers & course details

Application Process

  • steps icon

    1. Fill application form

    Apply by filling a simple online application form.

  • steps icon

    2. Application Screening

    A panel from Great Learning will review your application to determing your fit for the program.

  • steps icon

    3. Join program

    After a final review, you will receive an offer for a seat in the upcoming cohort of the program.

Batch start date

  • Online · 12th Jul 2025

    Admission closing soon

Got more questions? Talk to us

Connect with a program advisor and get your queries resolved

Speak with our expert +1 617 539 7216 or email to dsml.mit@mygreatlearning.com

career guidance

Delivered in Collaboration with:

MIT Professional Education is collaborating with online education provider Great Learning to offer Data Science and Machine Learning: Making Data-Driven Decisions. This program leverages MIT's leadership in innovation, science, engineering, and technical disciplines developed over years of research, teaching, and practice. Great Learning collaborates with institutions to manage enrollments (including all payment services and invoicing), technology, and participant support. Accessibility