Search courses 👉
Professional Training

Data Analysis: Statistical Modeling and Computation in Applications

edX, Online
Length
16 weeks
Next course start
Start anytime See details
Course delivery
Self-Paced Online
Length
16 weeks
Next course start
Start anytime See details
Course delivery
Self-Paced Online
Visit this course's homepage on the provider's site to learn more or book!

Course description

Data Analysis: Statistical Modeling and Computation in Applications

Data science requires multi-disciplinary skills ranging from mathematics, statistics, machine learning, problem solving to programming, visualization, and communication skills. In this course, learners will combine these foundational and practical skills with domain knowledge to ask and answer questions using real data.

This course will start with a review of common statistical and computational tools such as hypothesis testing, regression, and gradient descent methods. Then, learners will study common models and methods to analyze specific types of data in four different domain areas:

  • Epigenetic Codes and Data Visualization
  • Criminal Networks and Network Analysis
  • Prices, Economics and Time Series
  • Environmental Data and Spatial Statistics

Learners will be guided to analyze a real data set from each of these areas of focus, and present their findings in written reports. They will also discuss relevant and practical issues with peers.

Upcoming start dates

1 start date available

Start anytime

  • Self-Paced Online
  • Online
  • English

Suitability - Who should attend?

Prerequisites

  • Undergraduate Python programming
  • Undergraduate multi-variable calculus, and linear algebra
  • Undergraduate probability theory and statistics
  • Undergraduate machine learning

Outcome / Qualification etc.

What you'll learn

  • Model, form hypotheses, perform statistical analysis on real data
  • Use dimension reduction techniques such as principal component analysis to visualize high-dimensional data and apply this to genomics data
  • Analyze networks (e.g. social networks) and use centrality measures to describe the importance of nodes, and apply this to criminal networks
  • Model time series using moving average, autoregressive and other stationary models for forecasting with financial data
  • Use Gaussian processes to model environmental data and make predictions Communicate analysis results effectively

Course delivery details

This course is offered through Massachusetts Institute of Technology, a partner institute of EdX.

10-15 hours per week

Expenses

  • Verified Track -$300
  • Audit Track - Free
Ads