Skip to main content

CMPS 6790 Data Science

July 27, 2022

This course is designed for both graduate students and advanced undergraduate students interested in understanding of both the fundamental and advanced concepts, techniques, and technologies required for collecting, processing, and deriving insight into data. Data Science is an interdisciplinary set of topics that includes everything you need to create data driven answers and solutions to specific business, scientific, or sociological questions. Topics typically covered include an introduction to one or more data collection and management systems, e.g., SQL, web scraping, and various data repositories; exploratory and statistical data analysis, e.g., bootstrapping, measures of central tendency, hypothesis testing and machine learning techniques including linear regression and clustering; data and information visualization, e.g., plotting and interactive charts using various technologies; and presentation and communication of the results of these analyses. Students should be comfortable programming in Python and familiar with the fundamentals of algorithmic analysis and computer systems.