Martin Vechev

5 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×4AI×4ML×2NLP×1

Frequent co-authors

Mark Vero4×

Fabian Kaczmarczyck2×

Ivan Petrov2×

Ilia Shumailov2×

Jamie Hayes2×

Niels Heinen2×

Research Timeline

2026

SecPI: Secure Code Generation with Reasoning Models via Security Reasoning Internalization

The paper introduces SecPI, a fine-tuning pipeline that teaches reasoning language models (RLMs) to autonomously internalize structured security reasoning, significantly improving secure code generation without requiring explicit security prompts at inference.

Every Bit, Everywhere, All at Once: A Binomial Multibit LLM Watermark

The paper proposes a novel binomial multibit LLM watermarking scheme that encodes every bit of a payload at every token position, achieving superior message accuracy and robustness compared to existing methods.

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

The paper introduces Honeyval, a comprehensive evaluation framework, to rigorously test LLM-powered HTTP honeypots, demonstrating that these systems provide substantially longer and harder-to-detect interactions compared to traditional methods.

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

The paper introduces Honeyval, a comprehensive evaluation framework, to rigorously test LLM-powered HTTP honeypots, demonstrating that these honeypots provide substantially longer and harder-to-detect interactions compared to traditional methods.

Learning from Saturated Data: Signals Beyond Correctness for LLM Training

The paper proposes using fine-grained quality signals, such as pairwise self-judgments and token-level entropy, instead of simple binary correctness to improve LLM performance on saturated datasets, showing significant gains on simple tasks but requiring careful calibration for complex ones.

Highlighted terms show continued research focus across papers

Papers

cs.CLRecentMay 31, 2026

Learning from Saturated Data: Signals Beyond Correctness for LLM Training

Hanno Hiss, Jasper Dekoninck, Martin Vechev

View →

cs.CRcs.AIcs.LGRecentMay 28, 2026