Semantic Programming by Example with Pre-trained Models - Gust Verbruggen

Semantic Programming by Example with Pre-trained Models - Gust Verbruggen

Gust Verbruggen, Senior AI researcher and member of the PROSE team at Microsoft, discusses his paper "Semantic Programming by Example with Pre-trained Models," which introduces a framework for integrating inductive program synthesis with large language models.


The project emerged from an attempt to extend Flash Fill-style program synthesis beyond purely syntactic string transformations. Motivated by limitations in symbolic systems - especially their inability to access semantic knowledge without manually encoding it - Verbruggen and collaborators explored how GPT-3 could serve as a semantic oracle within the PROSE framework. The result is a neurosymbolic architecture that preserves the efficiency and guarantees of symbolic synthesis while selectively delegating semantic subproblems to a language model.


In This Episode -


• Limitations of both program synthesis and LLMs

• Programming by example

• Syntactic versus semantic

• Integrating GPT-3 as semantic operators

• Semantic map, position, and condition operators

• Deductive backpropagation in PROSE

• Deferred query execution for efficiency

• Greedy clustering to control search explosion

• Ranking programs to minimize semantic calls


References

• https://www.microsoft.com/en-us/research/group/prose/

• https://www.microsoft.com/en-us/research/project/prose-framework/

• https://www.dagstuhl.de/en/seminars/seminar-calendar

• Sumit Gulwani's Flash Fill talk: https://youtu.be/421gU482xFE


About the Paper -


"Semantic Programming by Example with Pre-trained Models"

Gust Verbruggen, Vu Le, Sumit Gulwani

Proceedings of the ACM on Programming Languages (OOPSLA), 2021


This paper presents a framework for augmenting inductive program synthesis with semantic operators powered by large language models. By decomposing tasks into syntactic and semantic subproblems, the system delegates only the irreducibly semantic components to a pre-trained model, while maintaining symbolic guarantees elsewhere. A deferred query execution strategy allows efficient learning without excessive model calls.


https://dl.acm.org/doi/10.1145/3485477


About the Guest -


Gust Verbruggen is a researcher at KU Leuven and a member of Microsoft’s PROSE team. His work focuses on program synthesis, data wrangling, and neurosymbolic integration, particularly in real-world automation settings such as spreadsheets and code refactoring tools.

• https://www.microsoft.com/en-us/research/people/gverbruggen/

• https://scholar.google.com/citations?user=TmU3sKMAAAAJ&hl=en


Credits -


• Host & Music: Bryan Landers, Technical Staff, Ndea

• Editor: Alejandro Ramirez

• https://x.com/ndea

• https://x.com/bryanlanders

• https://ndea.com

Det här avsnittet är hämtat från ett öppet RSS-flöde och publiceras inte av Podme. Det kan innehålla reklam.

Avsnitt(14)

Inventing Inductive Logic Programming - Stephen Muggleton

Inventing Inductive Logic Programming - Stephen Muggleton

Stephen Muggleton, Emeritus Professor at Imperial College London, discusses his paper “Inductive Logic Programming”, which introduced and named the field. The paper presents a framework that combines ...

18 Juni 57min

Recursive Program Synthesis - Aws Albarghouthi

Recursive Program Synthesis - Aws Albarghouthi

Aws Albarghouthi, Associate Professor of Computer Science at the University of Wisconsin-Madison, discusses his paper “Recursive Program Synthesis”, which introduced Escher, an inductive synthesis alg...

27 Maj 55min

DreamCoder's Wake-Sleep Library Learning - Kevin Ellis

DreamCoder's Wake-Sleep Library Learning - Kevin Ellis

Kevin Ellis, Assistant Professor at Cornell University, discusses his influential paper “DreamCoder,” which presents a system that jointly learns reusable program abstractions and a neural search stra...

7 Apr 47min

February 2026 Podcast Recap

February 2026 Podcast Recap

Program synthesis is the problem of automatically generating code that satisfies a specification. The real challenge isn’t searching faster, it’s making the right parts of the search space searchable ...

9 Feb 6min

Relational Decomposition for Program Synthesis - Céline Hocquette

Relational Decomposition for Program Synthesis - Céline Hocquette

The way a problem is represented can determine whether it is solvable at all.Céline Hocquette, AI researcher at Ndea and former postdoctoral researcher at the University of Oxford, discusses her paper...

2 Feb 47min

Symbolic World Models - Top Piriyakulkij

Symbolic World Models - Top Piriyakulkij

Wasu "Top" Piriyakulkij, PhD student at Cornell University advised by Kevin Ellis, discusses his paper "PoE-World: Compositional World Modeling with Products of Programmatic Experts." The episode expl...

26 Jan 57min

Vision-Language Programs - Antonia Wüst

Vision-Language Programs - Antonia Wüst

Antonia Wüst, PhD student at TU Darmstadt, discusses her paper "Synthesizing Visual Concepts as Vision-Language Programs," which introduces a neurosymbolic approach to visual concept induction by comb...

19 Jan 54min

Populärt inom Teknik

uppgang-och-fall
natets-morka-sida
elbilsveckan
bilar-med-sladd
market-makers
rss-technokratin
skogsforum-podcast
bli-saker-podden
rss-elektrikerpodden
rss-uppgang-och-fall
rss-snacka-om-ai
ai-sweden-podcast
dom-kallar-oss-krypto
hej-bruksbil
rss-heja-framtiden
rss-en-ai-till-kaffet
rss-laddstationen-med-elbilen-i-sverige
developers-mer-an-bara-kod
rss-veckans-ai
har-vi-akt-till-mars-an