Course description
Modern Distributed Systems
Modern IT infrastructure is built as distributed systems, an exciting concept that started with the first computers and evolved rapidly into its present form. From online video meetings to internet services, from social media platforms to online games, we all use and interact with distributed systems on a daily basis and increasingly depend on them. Designing and operating such large-scale distributed systems, however, is complex and typically involves making reasonable compromises. There are fundamental technical barriers as well as economic arguments why we cannot make these systems behave as if they were running on a single, perfectly reliable machine..
In this course, learners will be introduced to the essential functional and non-functional concerns of distributed systems and the common problems encountered while designing them, such as consistency, availability, elasticity, and scalability. A variety of practical solutions that have been established in the leading tech industry in recent years will be reviewed. These provide re-usable building blocks to create new large-scale applications. These recent developments, especially around cloud computing, large-scale data processing, distributed machine learning, and other fields are often not reflected in textbooks and are absent from many traditional curricula but are at the heart of this course.
The course will therefore provide learners with the fundamental understanding (theoretical and practical foundations) of how cloud, edge, and big data processing systems work and how they address common challenges for distributed systems such as performance, resilience, and scalability.
The learning progress is assessed through a variety of different activities including quizzes, design exercises, experiments, and open questions, with peer review of other students’ solutions. In the final project, learners will design a distributed system based on the learners’ own experience and interests and describe the functional and non-functional properties of the system.
Upcoming start dates
Suitability - Who should attend?
Prerequisites
Basic knowledge of software systems.
Basic programming skills in a mainstream programming language.
Outcome / Qualification etc.
What you'll learn
- Describe the principles of distributed systems.
- Contrast distributed systems with other forms of computation (e.g., single machine computation, parallel computing).
- Identify applications of distributed systems in science, engineering, business, and home use, and in particular the use of cloud and serverless applications, big data and graph processing applications, interactive and online gaming, etc.
- Analyze and design core architectures, components, and techniques in distributed systems.
- Solve practical problems related to modern uses of distributed systems.
Training Course Content
Introduction to Distributed Systems
- Parallel vs. Distributed Systems
- Challenges in Distributed Systems
- The CAP Theorem
- Example 1: Online Gaming
- Example 2: Scientific Computing
Functional Requirements
- Functional vs. Non–Functional Properties
- Naming
- Replication
- Consistency
- Consensus
Non-Functional Requirements
- Importance of Performance
- Measuring NFRs: Metrics
- Scalability and Elasticity
- Amdahl’s Law
- Gustafson’s Law
- Benchmarking
Resource Management and Scheduling
- Scheduling in the Small: Processor Scheduling
- Scheduling in the Large: Scheduling for Distributed Systems
- Workloads in DS
- Centralized Schedulers: Kubernetes, etc.
- Decentralized Schedulers: (HTCondor) etc.
- Portfolio Scheduling
System Architectures and Programming Models
- Trade-offs between SAs and PMs
- Programming Models
- System Architectures: communication, big data, machine learning
- Layering
Distributed Ecosystems
- Introduction to massive processing
- Theory of ecosystems
- Super-distribution principle
- Distributed ecosystems in science and engineering: cloud, edge, big data
- Distributed ecosystems in online gaming
- The future of distributed ecosystems
Course delivery details
This course is offered through The Georgia Institute of Technology, a partner institute of EdX.
3-5 hours per week
Expenses
- Verified Track -$149
- Audit Track - Free