Course description
Databases: Semistructured Data
"Databases" was one of Stanford's three inaugural massive open online courses in the fall of 2011. It has been offered in synchronous and then in self-paced versions on a variety of platforms continuously since 2011. The material is now being offered as a set of five self-paced courses, which can be taken in a variety of ways to learn about different aspects of databases.
Relational Databases and SQL is the most popular course in the Databases series. It is applicable to learners seeking to gain a strong understanding of relational databases, and to master SQL, the long-accepted standard query language for relational database systems. Additional courses focus on advanced concepts in relational databases and SQL, formal foundations and database design methodologies, and semistructured data.
All of the courses are based around video lectures and demos. Many of them include quizzes between video segments to check understanding, in-depth standalone quizzes, and/or a variety of automatically-checked interactive exercises. Each course also includes an unmoderated discussion forum and pointers to readings and resources. The courses are described briefly below. Taught by Professor Jennifer Widom, the overall curriculum draws from Stanford's popular longstanding Databases course.
Upcoming start dates
Suitability - Who should attend?
Prerequisites
None
Training Course Content
Relational Databases and SQL
- Introduction to the relational model and concepts in relational databases and relational database management systems
- Comprehensive coverage of SQL, the long-accepted standard query language for relational database management systems
Advanced Topics in SQL (prerequisite: Relational Databases and SQL)
- Creating indexes for increased query performance
- Using transactions for concurrency control and failure recovery
- Database constraints: key, referential integrity, and "check" constraints
- Database triggers
- How views are created, used, and updated in relational databases
- Authorization in relational databases
OLAP and Recursion
- Star schemas, the data cube concept, and On-Line Analytical Processing (OLAP) features in relational databases including the Cube and Rollup operators
- The SQL standard for queries over recursively-defined relations
Modeling and Theory
- Relational algebra – the algebraic query language that provides the formal foundations of SQL
- Dependency theory and normal forms in relational databases as the basis of schema design
- The data-modeling component of the Unified Modeling Language (UML), how UML diagrams are translated to relations
Semistructured Data
- The XML model for semistructured and self-describing data, including DTDs and some features of XML Schema
- The JSON model for human-readable structured or semistructured data
- The XPath language for processing XML data, and many features of the more advanced XQuery language
- An introduction to the XSLT rule-based language for querying and transforming XML data
Course delivery details
This course is offered through Stanford University, a partner institute of EdX.
8-10 hours per week
Expenses
- Verified Track -$50
- Audit Track - Free