DiffStream: Differential Output Testing for Stream Processing Programs
High performance architectures for processing distributed data streams, such as Flink, Spark, and Storm, are increasingly deployed in emerging data-driven computing systems. Exploiting the parallelism afforded by such platforms, while preserving the semantics of the desired computation, is prone to errors, and motivates the development of tools for specification, testing, and verification. We focus on the problem of differential output testing for distributed stream processing systems, that is, checking whether two implementations produce equivalent output streams in response to a given input stream. The notion of equivalence allows reordering of logically independent data items, and the main technical contribution of the paper is an optimal online algorithm for checking this equivalence. Our testing framework is implemented as a library called \library in Flink. We present four case studies to illustrate how our framework can be used to (1) correctly identify bugs in a set of benchmark MapReduce programs, (2) facilitate the development of difficult-to-parallelize high performance applications, and (3) monitor an application for a long period of time using minimal computational resources.
Fri 22 OctDisplayed time zone: Central Time (US & Canada) change
10:50 - 12:10 | |||
10:50 15mTalk | DiffStream: Differential Output Testing for Stream Processing Programs SIGPLAN Papers Konstantinos Kallas University of Pennsylvania, Filip Niksic Google, Caleb Stanford University of Pennsylvania, Rajeev Alur University of Pennsylvania | ||
11:05 15mTalk | Guided Linking: Dynamic Linking Without the Costs SIGPLAN Papers Sean Bartell University of Illinois at Urbana-Champaign, Will Dietz University of Illinois at Urbana-Champaign, Vikram S. Adve University of Illinois at Urbana-Champaign, USA Link to publication DOI | ||
11:20 15mTalk | Regex Matching with Counting-Set Automata SIGPLAN Papers Lukáš Holík Brno University of Technology, Ondřej Lengál Brno University of Technology, Olli Saarikivi Microsoft, Lenka Turoňová Brno University of Technology, Margus Veanes Microsoft, Tomáš Vojnar Brno University of Technology | ||
11:35 15mTalk | Actor Concurrency Bugs: A Comprehensive Study on Symptoms, Root Causes, API Usages, and DifferencesIn-Person SIGPLAN Papers Mehdi Bagherzadeh Oakland University, Nicholas Fireman Oakland University, Anas Shawesh Oakland University, Raffi Khatchadourian CUNY Hunter College Link to publication DOI Pre-print Media Attached | ||
11:50 20mLive Q&A | Discussion, Questions and Answers SIGPLAN Papers |