Prog. Synthesis

Automated programming, code generation, and program induction

20 papers indexed

cs.PLcs.AIcs.LGTheoreticalRecentJul 20, 2026

Decode-Time Grammars: Constrained LLM Generation over a Refinement Order of Grammar Fragments

Shuoming Zhang, Ruiyuan Xu, Haofeng Li, Qiuchu Yu +6 more

This paper introduces decode-time grammars to ensure grammatical and semantic correctness of code generated by large language models, particularly for low-resource programming surfaces.

View →

cs.CLcs.SERecentMay 29, 2026

Combinatorial Synthesis: Scaling Code RLVR via Atomic Decomposition and Recombination

Jiasheng Zheng, Boxi Cao, Boxi Yu, Yuzhong Zhang +5 more

The paper introduces Atomic Decomposition and Recombination (ADR), a novel framework that generates genuinely novel and challenging verifiable code tasks, significantly improving the scalability of Re…

View →

cs.AIRecentJun 1, 2026

Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization

Yusuke Ohtsubo, Kota Dohi, Koichiro Yawata, Koki Takeshita +1 more

The paper proposes a visual program synthesis framework using a VLM to generate accurate training data for semiconductor inspection, mitigating the sim-to-real gap by applying input binarization to st…

View →

cs.PLTheoreticalRecentJul 20, 2026

Weakly Non-Negative Supermartingales for Omega-Regular Verification

Toru Takisaka, Hongjie Qing, Libo Zhang

The paper introduces lazy Streett supermartingales and their lexicographic extension to certify almost-sure satisfaction of omega-regular properties with polynomial templates under a broad class of sa…

View →

cs.CRcs.AIcs.LGRecentMay 22, 2026

An Empirical Evaluation of LLM-Generated Code Security Across Prompting Methods

Mohammed Kharma, Ahmed Sabbah, Mohammad Alkhanafseh, Mohammad Hammoudeh +1 more

The paper empirically evaluates the security quality of LLM-generated code across various prompting methods, finding that while prompting alters the structure of weaknesses, it is insufficient to reli…

View →

cs.PLcs.LGTheoreticalRecentJul 8, 2026

GradInf: Gradient Estimation as Probabilistic Inference

Gaurav Arya, Mathieu Huot, Moritz Schauer, Alexander K. Lew +1 more

This paper introduces gradient inference, a new approach to developing sound and efficient gradient estimators for probabilistic programs by reducing gradient estimation to a related probabilistic inf…

View →

cs.PLcs.DSTheoreticalRecentJul 9, 2026

Potential Functions as Types

Harrison Grodin, Ethan Chu, Runming Li, Jan Hoffmann +1 more

This paper presents Calf, a dependent type theory for cost verification, which synthesizes the physicist's and banker's views on amortized analysis using potential functions and credit annotations.

View →

cs.CRRecentApr 18, 2026

False Security Confidence in Benign LLM Code Generation

Xiaolei Ren

The paper introduces False Security Confidence (FSC), a new metric to measure the inherent prevalence of security vulnerabilities in code generated by LLMs that are otherwise functionally correct, eve…

View →

cs.CRRecentMar 26, 2026

ALPS: Automated Least-Privilege Enforcement for Securing Serverless Functions

Changhee Shin, Bom Kim, Seungsoo Lee

ALPS is an automated, vendor-agnostic framework that enforces least privilege in serverless functions by analyzing code and generating precise security policies, achieving high coverage and significan…

View →

cs.SEcs.PLEmpiricalRecentJun 19, 2026

KBSpec: LLM-driven Formal Specification Generation with Evolving Domain Knowledge Base

Wenhan Wang, Zeyu Sun

Proposed method, KBSpec, uses external and internal knowledge to improve formal specification generation by LLMs, increasing verification pass rates and producing more high-completeness specifications…

View →

cs.AIcs.SEEmpiricalRecentJun 18, 2026

AutoACSL: Synthesizing ACSL Specifications by Integrating LLMs with CPG-Based Static Analysis

Han Zhou, Yu Luo, Dianxiang Xu

AutoACSL is a framework that uses Code Property Graphs and large language models to generate formal specifications for C programs with improved success ratio and full proof ratio.

View →

cs.SEcs.AIRecentMay 31, 2026

FVSpec: Real-World Property-Based Tests as Lean Challenges

Quinn Dougherty, Max von Hippel, Hazel Shackleton, Mike Dodds

The paper introduces FVSpec, a large-scale benchmark that translates thousands of real-world Python property-based tests into formal Lean 4 specifications to evaluate AI models for formal software ver…

View →

cs.LGcs.AIcs.NERecentMay 28, 2026

Evolving Features vs Evolving Entire Trees with GP for Interpretable Survival Analysis

Thalea Schlender, Peter A. N. Bosman, Tanja Alderliesten

This paper proposes using genetic programming (GP) to jointly evolve both the feature sets and the structure of survival trees, resulting in highly interpretable and high-performing shallow models for…

View →

cs.SEcs.AIRecentMay 28, 2026

Inferring Code Correctness from Specification

Tambon Florian, Papadakis Mike

The paper introduces TRAILS~, a novel method that improves code correctness validation by grounding LLM reasoning in concrete (input, output) pairs derived from specifications, achieving state-of-the-…

View →

cs.AIcs.LGRecentMay 30, 2026

MOSAIC: Modular Orchestration for Structured Agentic Intelligence and Composition

Yifan Bao, Xinyu Xi, Xinyu Liu, Wen Ge +7 more

MOSAIC introduces a structured agentic framework that treats automated data science as a staged, context-grounded model selection problem, improving performance and traceability over traditional AutoM…

View →

cs.CRcs.AIcs.SERecentMay 15, 2026

Detecting Privilege Escalation in Polyglot Microservices via Agentic Program Analysis

Penghui Li, Hong Yau Chong, Yinzhi Cao, Junfeng Yang

The paper introduces Neo, an agentic program analysis framework that successfully detects zero-day privilege escalation vulnerabilities in complex, polyglot microservices by combining LLMs with advanc…

View →

cs.DCEmpiricalRecentJun 30, 2026

Performance Analysis in Parallel Programming Education: A Comparative Usability Study

Anna-Lena Roth, David James, Jonas Posner, Michael Kuhn

The paper introduces EduMPI, a learning support tool for simplifying cluster usage and performance analysis of MPI parallel programs for students.

View →

cs.ARcs.AIcs.LGEmpiricalRecentJun 11, 2026

VHDLSuite: Unified Pipeline for LLM VHDL Generation with Data Synthesis and Evaluation

Yijun Shen, Minghao Shao, Yichen Zhao, Zhuoyan Yu +3 more

The paper introduces VHDLSuite, an infrastructure for evaluating Large Language Models in VHDL, including a data pipeline, benchmark, and evaluation framework.

View →

cs.PLcs.OSEmpiricalRecentJul 27, 2026

KernelScript: Cross-Boundary Typed DSL for eBPF Applications

Cong Wang, Siyuan Sun, Yusheng Zheng

The paper introduces KernelScript, a DSL for eBPF that types maps, program handles, and execution domains to unify cross-boundary relationships, prevent bugs at compile time, and reduce diff size.

View →

cs.CRcs.CLRecentMay 1, 2026

SRTJ: Self-Evolving Rule-Driven Training-Free LLM Jailbreaking

Jindong Li, Ying Liu, Yali Fu, Jinjing Zhu +3 more

The paper proposes SRTJ, a Self-Evolving Rule-Driven Training-Free Jailbreak framework that systematically discovers and refines attack strategies using rule composition and feedback to achieve robust…

View →