Multi-modal Program Inference: A Marriage of Pre-trained Language Models and Component-Based Synthesis (SPLASH 2021 - OOPSLA)

Sun 17 - Fri 22 October 2021 Chicago, Illinois, United States

Who

Kia Rahmani, Mohammad Raza, Sumit Gulwani, Vu Le, Daniel Morris, Arjun Radhakrishna, Gustavo Soares, Ashish Tiwari

Track

SPLASH 2021 OOPSLA

Time Zone

The program is currently displayed in (GMT-05:00) Central Time (US & Canada).

Use conference time zone: (GMT-05:00) Central Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Fri 22 Oct 2021 11:35 - 11:50 at Zurich D - Synthesis of models, tools and programs Chair(s): Jonathan Aldrich
Fri 22 Oct 2021 19:35 - 19:50 at Zurich D - Synthesis of models, tools and programs -- mirror Chair(s): Alex Potanin

Abstract

Multi-modal program synthesis refers to the task of synthesizing programs (code) from their specification given in different forms, such as a combination of natural language and examples. Examples provide a precise but incomplete specification, and natural language provides an ambiguous but more "complete" task description. Machine-learned pre-trained models (PTMs) are adept at handling ambiguous natural language, but struggle with generating syntactically and semantically precise code. Program synthesis techniques can generate correct code, often even from incomplete but precise specifications, such as examples, but they are unable to work with the ambiguity of natural languages. We present an approach that combines PTMs with component-based synthesis (CBS): PTMs are used to generate candidates programs from the natural language description of the task, which are then used to guide the CBS procedure to find the program that matches the precise examples-based specification. We use our combination approach to instantiate multi-modal synthesis systems for two programming domains: the domain of regular expressions and the domain of CSS selectors. Our evaluation demonstrates the effectiveness of our domain-agnostic approach in comparison to a state-of-the-art specialized system, and the generality of our approach in providing multi-modal program synthesis from natural language and examples in different programming domains.

Link to Preprint

https://arxiv.org/pdf/2109.02445.pdf

DOI

https://doi.org/10.1145/3485535

Kia Rahmani

Purdue University

United States

Mohammad Raza

Microsoft

United States

Sumit Gulwani

Microsoft

United States

Vu Le

Microsoft

United States

Daniel Morris

Microsoft

United States

Arjun Radhakrishna

Microsoft

United States

Gustavo Soares

Microsoft

United States

Ashish Tiwari

Microsoft

United States

Time Zone

The program is currently displayed in (GMT-05:00) Central Time (US & Canada).

Use conference time zone: (GMT-05:00) Central Time (US & Canada)Select other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Fri 22 Oct
Displayed time zone: Central Time (US & Canada) change

10:50 - 12:10	Synthesis of models, tools and programsOOPSLA at Zurich D +8h Chair(s): Jonathan Aldrich Carnegie Mellon University

10:50 15m Talk		Rewrite Rule Inference Using Equality SaturationVirtual OOPSLA Chandrakana Nandi Certora, inc., Max Willsey University of Washington, Amy Zhu University of Washington, Yisu Remy Wang University of Washington, Brett Saiki University of Washington, Adam Anderson University of Washington, Adriana Schulz University of Washington, Dan Grossman University of Washington, Zachary Tatlock University of Washington DOI
11:05 15m Talk		Semantic Programming by Example with Pre-trained ModelsVirtual OOPSLA Gust Verbruggen KU Leuven, Vu Le Microsoft, Sumit Gulwani Microsoft DOI
11:20 15m Talk		One Down, 699 to Go: or, Synthesising Compositional DesugaringsVirtual OOPSLA Sándor Bartha University of Edinburgh, James Cheney University of Edinburgh; Alan Turing Institute, Vaishak Belle University of Edinburgh; Alan Turing Institute DOI
11:35 15m Talk		Multi-modal Program Inference: A Marriage of Pre-trained Language Models and Component-Based SynthesisIn-Person OOPSLA Kia Rahmani Purdue University, Mohammad Raza Microsoft, Sumit Gulwani Microsoft, Vu Le Microsoft, Daniel Morris Microsoft, Arjun Radhakrishna Microsoft, Gustavo Soares Microsoft, Ashish Tiwari Microsoft DOI Pre-print
11:50 20m Live Q&A		Discussion, Questions and Answers OOPSLA

18:50 - 20:10	Synthesis of models, tools and programs -- mirrorOOPSLA at Zurich D Chair(s): Alex Potanin Victoria University of Wellington

18:50 15m Talk		Rewrite Rule Inference Using Equality SaturationVirtual OOPSLA Chandrakana Nandi Certora, inc., Max Willsey University of Washington, Amy Zhu University of Washington, Yisu Remy Wang University of Washington, Brett Saiki University of Washington, Adam Anderson University of Washington, Adriana Schulz University of Washington, Dan Grossman University of Washington, Zachary Tatlock University of Washington DOI
19:05 15m Talk		Semantic Programming by Example with Pre-trained ModelsVirtual OOPSLA Gust Verbruggen KU Leuven, Vu Le Microsoft, Sumit Gulwani Microsoft DOI
19:20 15m Talk		One Down, 699 to Go: or, Synthesising Compositional DesugaringsVirtual OOPSLA Sándor Bartha University of Edinburgh, James Cheney University of Edinburgh; Alan Turing Institute, Vaishak Belle University of Edinburgh; Alan Turing Institute DOI
19:35 15m Talk		Multi-modal Program Inference: A Marriage of Pre-trained Language Models and Component-Based SynthesisIn-Person OOPSLA Kia Rahmani Purdue University, Mohammad Raza Microsoft, Sumit Gulwani Microsoft, Vu Le Microsoft, Daniel Morris Microsoft, Arjun Radhakrishna Microsoft, Gustavo Soares Microsoft, Ashish Tiwari Microsoft DOI Pre-print
19:50 20m Live Q&A		Discussion, Questions and Answers OOPSLA