Program synthesis for automated data wrangling is the application of synthesis techniques to automatically generate executable scripts or queries—such as for SQL, Pandas, or regular expressions—that transform raw, messy data into a clean, analysis-ready format. The specification is typically provided via input-output examples (e.g., a user demonstrates a few row transformations), natural language descriptions, or constraints on the desired output schema. This automates the tedious, error-prone process of data cleaning and feature engineering, directly translating user intent into correct, executable code.
