Hard Choices, Tight Timelines: A Closer Look at Skip-level Tradeoff Decisions during Incidents

Tuesday, March 19, 2024 - 2:40 pm3:25 pm

Dr. Laura Maguire, Trace Cognitive Engineering, and Courtney Nash, The VOID

Abstract: 

Unexpected outages in software service delivery - also known as incidents—often require making rapid tradeoff decisions on the road to recovery. Tradeoffs can be relatively minor-—rolling back a recent change or temporarily disabling a certain feature—or they can represent significant threats to reliability or reputation, such as when facing a loss of customer data. While the resolution of incidents is unquestioningly in the hands of engineers, senior management also have an active role in making tradeoff decisions during significant incidents. As researchers interested in software incidents, we recognized a gap in the industry’s understanding of how different levels across the organization work together to resolve challenging incidents.

Our objective in this research is to examine the kinds of tradeoff decisions management faces during incidents, the patterns in how and when they become involved, and the strategies used to coordinate effectively with their incident response teams.

During this talk you’ll get a behind the scenes (and between the ears!) look at management tradeoff decisions and how this knowledge can be used to increase an organization's capacity to handle unexpected events.

Laura Maguire, Trace Cognitive Engineering

Dr. Laura Maguire is the Principal Research Engineer at Trace Cognitive Engineering where she works with software organizations to bring new insights to their hard problems and to support the design of software for complex, cognitively demanding work. She is a Fellow with the Cognitive Systems Engineering Lab at Ohio State University and a Thesis Supervisor at Lund University in Sweden. Laura is an engaging and thought-provoking presenter who routinely speaks to software companies around the world on data driven strategies for enhancing cognitive and team performance.

Courtney Nash, The VOID

Courtney Nash is a researcher focused on system safety and failures in complex sociotechnical systems. She created the VOID in 2021 to help shine a light on how we can more effectively learn from software incidents. An erstwhile cognitive neuroscientist, she has always been fascinated by how people learn, and the ways memory influences how they solve problems. Over the past two decades, she’s held a variety of editorial, program management, research, and management roles at Holloway, Fastly, O’Reilly Media, Microsoft, and Amazon.

BibTeX
@conference {295055,
author = {Laura Maguire and Courtney Nash},
title = {Hard Choices, Tight Timelines: A Closer Look at Skip-level Tradeoff Decisions during Incidents },
year = {2024},
address = {San Francisco, CA},
publisher = {USENIX Association},
month = mar
}

Presentation Video