© 2009-2023 Strange Loop | Privacy Policy

Modern distributed systems are massively complex. When running in production, they can experience various unexpected subtle failures. Thus, deploying comprehensive runtime checkers (monitors) to detect and localize failures is crucial for high availability. Unfortunately, existing checkers are ad-hoc, simplistic, and tedious to write manually. In this session, you will learn three systematic techniques that can automatically generate effective, customized checkers for large distributed systems.
Ryan Huang is an Assistant Professor at Johns Hopkins University, where he works broadly on computer systems including distributed systems, operating systems, cloud, and mobile computing. His research focuses on designing principled methods to improve the reliability and performance of modern systems. His work received multiple best paper awards at top systems conferences. He is a recipient of the NSF CAREER Award (2020) and a Facebook research award. More information about him can be found at https://www.cs.jhu.edu/~huang.