Semester.ly

Johns Hopkins University | EN.553.636

Introduction to Data Science

4.0

credits

Average Course Rating

(4.17)

Today the term Data Science is widely used covering a broad range of topics from mathematics and algorithms to actual data analysis and machine learning techniques. This course provides a thorough survey of relevant methods balancing the theory and the application aspects. Accordingly, the material and the discussions alternate between the methodology along with its underlying assumptions and the implementations along with their applications. We will cover several supervised methods for regression and classification, as well as unsupervised methods for clustering and dimensional reduction. To name a few in chronological order, the topics will include generalized linear regression, principal component analysis, nearest neighbor and Bayesian classifiers, support vector machines, logistic regression, decision trees, random forests, K-means clustering, Gaussian mixtures and Laplacian eigenmaps. The course uses Python and Jupyter Notebook and includes visualization techniques throughout the semester. Time permitting, an introduction to the Structured Query Language (SQL) is provided toward the end of the semester.

Fall 2022

(4.21)

Spring 2023

(4.12)

Fall 2022

Professor: Tamas Budavari

(4.21)

Spring 2023

Professor: Tamas Budavari

(4.12)

Lecture Sections

(01)

No location info
T. Budavari
16:30 - 17:20

(02)

No location info
T. Budavari
09:00 - 09:50

(03)

No location info
T. Budavari
09:00 - 09:50

(04)

No location info
T. Budavari
10:00 - 10:50

(05)

No location info
T. Budavari
16:30 - 17:20

(06)

No location info
T. Budavari
18:00 - 18:50

(07)

No location info
T. Budavari
18:00 - 18:50

(08)

No location info
T. Budavari
19:00 - 19:50