Master of Professional Studies: Data Science

UMBC’s Master of Professional Studies (M.P.S.) in Data Science program prepares students from a wide range of disciplinary backgrounds for careers in data science. In the core courses, students will get a fundamental understanding of data science through classes that highlight machine learning, data analysis and data management. The core courses will also introduce students to ethical and legal implications surrounding data science.

Beyond the core courses, students will take three courses in domain specific pathways developed in collaboration with academic departments across the university. Through these pathways, students will be able to utilize the skills and techniques they learned in the core courses within their own field or area of expertise.

Required Core Courses

The goal of this class is to give students an introduction to and hands on experience with all phases of the data science process using real data and modern tools. Topics that will be covered include data formats, loading, and cleaning; data storage in relational and non-relational stores; data governance, data analysis using supervised and unsupervised learning using R and similar tools, and sound evaluation methods; data visualization; and scaling up with cluster computing, MapReduce, Hadoop, and Spark.

Prerequisite: Enrollment in the Data Science program. Other students may be admitted with instructor permission.

This course provides a broad introduction to the practical side of machine-learning and data analysis. This course examines the end-to-end processing pipeline for extracting and identifying useful features that best represent data, a few of the most important machine algorithms, and evaluating their performance for modeling data. Topics covered include decision trees, logistic regression, linear discriminant analysis, linear and non-linear regression, basic functions, support vector machines, neural networks, Bayesian networks, bias/variance theory, ensemble methods, clustering, evaluation methodologies, and experiment design.

Prerequisite: DATA 601: Introduction to Data Science and enrollment in the Data Science program. Non-Data Science students may be permitted with instructor permission.

The goal of this course is to introduce methods, technologies, and computing platforms for performing data analysis at scale. Topics include the theory and techniques for data acquisition, cleansing, aggregation, management of large heterogeneous data collections, processing, information and knowledge extraction. Students are introduced to map-reduce, streaming, and external memory algorithms and their implementations using Hadoop and its eco-system (HBase, Hive, Pig and Spark). Students will gain practical experience in analyzing large existing databases.

Prerequisite: Enrollment in the Data Science program and DATA 601. Other students may be admitted with program director’s permission.

This course introduces students to the data management, storage and manipulation tools common in data science. Students will get an overview of relational database management systems and various NoSQL database technologies, and apply them to real scenarios. Topics include: ER and relational data models, storage and concurrency preliminaries, relational databases and SQL queries, NoSQL databases, and Data Governance.

Prerequisite: Enrollment in the Data Science program. Other students may be admitted with instructor permission. Corequisite: DATA 601: Introduction to Data Science

This course provides a comprehensive overview of important legal and ethical issues pertaining to the full life cycle of data science. The student learns how to think through the ethics of making decisions and inferences based on data and how important cases and laws have shaped the data science field. Students will use real and hypothetical case studies across various domains to explore these issues.

Prerequisite: Enrollment in the Data Science program. Other students may be admitted with instructor permission. Corequisite: DATA 601: Introduction to Data Science

This is a semi-independent course that provides the advanced graduate student in the Data Science program the opportunity to apply the knowledge, skills and tools they’ve learned to a real-world data science project. Students will work with a real data set and go through the entire process of solving a real-world data science project. The project may be conducted with industry, government and academic partners, who can provide the data set, with guidance and feedback from the instructor.

Prerequisite: Completion of all other core courses.

In the rapidly evolving field of data science, technical expertise alone is not sufficient for success. Effective leadership is essential to navigate the complexities of data-driven decision-making and drive impactful outcomes. The course is designed as a practical stage-by-stage field guide for our students to their careers in data science. It provides valuable insights and strategies for individuals at different career stages, from aspiring data science tech leads to seasoned data science executives. Through a comprehensive examination of several case studies, students will develop a deep understanding of the leadership skills, capabilities, and virtues necessary for success in the field of data science.

Electives

Data science relies heavily on the principles of probability theory and inferential statistics for extracting meaningful insight from complex datasets. DATA 608 introduces students to the essential concepts and tools of probability theory and statistics that form the backbone of data-driven decision-making processes. The course emphasizes a combination of theoretical tools, and application-oriented analysis to enable students to utilize statistical methods effectively in real-world data science scenarios.

The contemporary literature on data science and machine learning primarily focuses on algorithms and methodologies and assume that the reader is already equipped with the relevant mathematics and statistics background. We have found students who want to delve into the foundations of basic machine learning and data science methods typically struggle with the required mathematical knowledge. This course brings the mathematical foundations of data science and machine learning to the fore and collects the necessary skillset in a single place so that the skills gap is minimized. The covered contents in this course consist of two major parts: The necessary and relevant concepts of linear algebra and the relevant calculus skills needed for solving optimization problems. For the linear algebra part, first an overview of the vector analysis, treatment of linear systems, matrices, subspaces, and determinants are presented. The main step in the linear algebra portion of the course deals with various matrix decomposition techniques. Specifically, the QR-decomposition, the LU-decomposition, the eigen decomposition, and the Singular Value decomposition are discussed in details and some of their applications (in data science and machine learning) are introduced. For the optimization portion of the course, first an overview of the necessary tools such as partial and directional derivatives, definition of critical points of multivariable functions, and the Hessian is provided. Then the Lagrange multiplier method for optimizations in the presence of constraints is introduced. At the end of the course, we will touch upon convex optimization and the gradient descent approach. If time permits, basic tools of the information theory will be discussed.

This course is designed to equip students with the essential skills to convey their findings in a compelling and accessible manner. Students will learn how to craft narratives that translate intricate data analyses into meaningful stories and engaging narratives for diverse audiences, including stakeholders, executives, and non-technical professionals.

This course reviews modern methods used in deep learning and neural network design from a practical perspective. Both a broad set of techniques that are commonly used in state-of-the-art neural network architectures and network styles prevalent in specific sub-domains like computer vision, natural language processing, and social network analysis are discussed. Students learn how to use these techniques in modern frameworks and how to apply these methods to new problems.

Prerequisite: Enrollment to MPS Data Program and DATA 602 or CMSC 678.

This course aims to teach the use of natural language processing (NLP) as a set of methods for exploring and reasoning about text as data. The focus will be on the applied side of NLP. Students will use existing NLP methods and libraries in Python to textual problems. Topics include language modeling, text classification, sentiment analysis, summarization, and machine translation.

Prerequisites: Enrollment in the MPS DS program; DATA 602 or CMSC 478/678.

This course introduces Generative AI (GenAI) by focusing on practical applications and hands-on experience with cutting-edge GenAI models. Students will learn to implement and apply GenAI models to generate text, images, music, and videos while addressing the ethical challenges inherent in GenAI.

Prerequisites: Enrollment to MPS Data Program and DATA 602 or CMSC 678.

This course provides a comprehensive introduction to applying data science techniques in the finance industry. It begins with an overview of financial data science and the various sources of financial data, moving through data cleaning, visualization, and analysis methods. Students will learn to apply supervised and unsupervised machine learning techniques, time series analysis, and natural language processing to real-world financial datasets. The course also covers essential financial models, portfolio optimization, anomaly detection, and algorithmic trading.

Prerequisites: Enrollment to MPS Data Program and DATA 602 or CMSC 678.

The special topics courses will cover emerging or specialized Data Science topics on an as-needed basis.

Supervision of student internship/co-op activities in the cybersecurity discipline. A short technical report that describes the activities conducted relevance to theoretical or operational concepts learned in other coursework and lessons gained through the internship/co-op experience is required at the end of the course. The course grade will be based on the technical report. The report will be submitted to the student’s Graduate Program Director by the last day of the semester.

Individualized research activities under faculty supervision related to data science.

Pathway Programs & Courses

The pathways will allow students who work in a particular domain to take classes specific to their industry. Each pathway will consist of three courses. See pathways at UMBC Main Campus, Shady Grove, and those that are available at both campuses.

UMBC Main Campus

Advanced Computing and Analytics

In collaboration with the Department of Computer Science and Electrical Engineering

  • CMSC 615 Introduction to Systems Engineering
  • CMSC 625 Modeling and Simulation of Computer Systems
  • CMSC 627 Wearable Computing
  • CMSC 628 Mobile Computing
  • CMSC 636 Data Visualization
  • CMSC 653 Information and Coding Theory
  • CMSC 655 Numerical Computations
  • CMSC 661 Principles of Database Systems
  • CMSC 668 Service-Oriented Computing
  • CMSC 671 Principles of Artificial Intelligence
  • CMSC 673 Introduction to Natural Language Processing
  • CMSC 675 Introduction to Neural Networks
  • CMSC 676 Information Retrieval
  • CMSC 678 Machine Learning
  • CMSC 691 Special Topics in Computer Science (Permission from Graduate Program Director)
  • Any other relevant graduate-level course in Computer Science with permission from the Graduate Program Director.

Clinical Informatics (with UMB)

Students can transfer in coursework from the Clinical Informatics program at the University of Maryland, Baltimore to serve as a nine-credit Clinical Informatics pathway within the M.P.S. All courses are online/asynchronous.

  • INFO 601: Foundations in Clinical and Health Informatics
  • INFO 602: Clinical Information Systems
  • INFO 604: Decision Support Systems in Healthcare

Learn more about the Clinical Informatics pathway with UMB.

Cybersecurity

In cooperation with the Cybersecurity M.S. program

  • CYBR 620 Introduction to Cybersecurity
  • CYBR 650: Managing Cybersecurity Operations
  • CYBR 658: Risk Analysis and Compliance

Click here to learn more about the cybersecurity pathway.

Data Science Analysis

In collaboration with the Department of Information Systems:

  • IS 661 – Biomedical Informatics Applications
  • IS 706 – Interfaces For Info. Visualization & Retrieval
  • IS 707 – Applications of Intelligent Technologies
  • IS 721 – Semi-Structured Data Management
  • IS 722 – Systems and Information Integration
  • IS 728 – Online Communities
  • IS 731 – Electronic Commerce
  • IS 733 – Data Mining
  • IS 777 – Data Analytics for Statistical Learning
  • Other courses may also qualify. Please consult the Program Director.

Learn more about the data science analysis pathway.

Economics/Econometrics

In collaboration with the Department of Economics:

  • PUBL 604 – Statistical Analysis
  • ECON 601 – Microeconomic Analysis
  • ECON 602 – Macroeconomic Analysis
  • ECON 611 – Advanced Econometric Analysis I
  • ECON 612 – Advanced Econometric Analysis II
  • ECON 652 – Economics of Health

Click here to learn more about the economics/econometrics pathway.

Healthcare Analytics

In cooperation with the Health IT program

  • HIT658: Health Informatics I
  • HIT759: Health Informatics II
  • HIT723: Public Health Informatics
  • HIT674: Process and Quality Improvement within Health IT
  • HIT751: Introduction to Healthcare Databases

Click here to read more about the healthcare analytics pathway.

Management Sciences

In collaboration with the College of Engineering and Information Technology, choose 3 of the following courses:

  • ENMG 650: Project Management Fundamentals
  • ENMG 654: Leading Teams and Organizations
  • ENMG 658: Financial Management
  • ENMG 659: Strategic Management
  • ENMG 660: Systems Engineering Principles
  • ENMG 661: Leading Global Virtual Teams
  • ENMG 663: Advanced Project Management Applications
  • ENMG 664: Quality Engineering & Management
  • ENMG 668: Project and Systems Engineering Management
  • ENMG 690: Innovation and Technology Entrepreneurship
  • SYST/ENMG 672: Decision and Risk Analysis

Click here to read more about the management sciences pathway.

Policy Analysis

In collaboration with the School of Public Policy

  • PUBL 601 Political and Social Context of the Policymaking Process
  • PUBL 603 Theory and Practice of Policy Analysis
  • PUBL 607 Statistical Applications in Evaluation Research
  • PUBL 608 Applied Multivariate Regression Analysis
  • PUBL 610 (special topics)

Click here to read more about the policy analysis pathway.

Project Management

In collaboration with the College of Engineering and Information Technology:

  • ENMG 650: Project Management
  • ENMG 661: Leading Virtual/Global Teams
  • ENMG 663: Advanced Project Management Applications

Note: Students pursuing the Project Management pathway are eligible for a certificate in Project Management upon completion. Read more about the Project Management pathway here.

Shady Grove

Bioinformatics (FAES @ NIH)

Students can transfer in coursework from Foundation for Advanced Education in Science (FAES) at the National Institutes of Health (NIH) to serve as a nine-credit Bioinformatics pathway within the MPS. Within Bioinformatics, FAES offers three-credit courses, as well as one-credit and two-credit courses. The one and two credit courses will need to be combined to be considered as equivalents to three-credit graduate level courses offered at UMBC. See more at professional.umbc.edu/faes.

Clinical Informatics (with UMB)

Students can transfer in coursework from the Clinical Informatics program at the University of Maryland, Baltimore to serve as a nine-credit Clinical Informatics pathway within the M.P.S. All courses are online/asynchronous.

  • INFO 601: Foundations in Clinical and Health Informatics
  • INFO 602: Clinical Information Systems
  • INFO 604: Decision Support Systems in Healthcare

Learn more about the Clinical Informatics pathway with UMB.

Cybersecurity

In cooperation with the Cybersecurity M.S. program

  • CYBR 620 Introduction to Cybersecurity
  • CYBR 650: Managing Cybersecurity Operations
  • CYBR 658: Risk Analysis and Compliance

Click here to learn more about the cybersecurity pathway.

Programs at Both Campuses

Aging Studies

In collaboration with the UMBC Erickson School of Aging

Required:

  • AGNG 600: Social and Economic Context of Aging
  • AGNG 604: Policy Foundations of Aging Services

Students who chose the “Aging Studies” pathway will take AGNG 600, AGNG 604, and one of the other courses listed below:

  • AGNG 620: An Overview of Dementia & Dementia Care Services
  • AGNG 621: Policy Foundations in Dementia Care Services
  • AGNG 624: Strategy, Marketing, and Service Delivery in Aging Services
  • AGNG 632: Diversity in Management of Aging Services

Read more about the aging studies pathway.

Clinical Informatics (with UMB)

Students can transfer in coursework from the Clinical Informatics program at the University of Maryland, Baltimore to serve as a nine-credit Clinical Informatics pathway within the M.P.S. All courses are online/asynchronous.

  • INFO 601: Foundations in Clinical and Health Informatics
  • INFO 602: Clinical Information Systems
  • INFO 604: Decision Support Systems in Healthcare

Learn more about the Clinical Informatics pathway with UMB.

Cybersecurity

In cooperation with the Cybersecurity M.S. program

  • CYBR 620 Introduction to Cybersecurity
  • CYBR 650: Managing Cybersecurity Operations
  • CYBR 658: Risk Analysis and Compliance

Click here to learn more about the cybersecurity pathway.

Looking for more info?