Papers similar to 2606.00109

~ similar to 2606.00109· 19 results

cs.CRRecentApr 2, 2026

Diffusion-Guided Adversarial Perturbation Injection for Generalizable Defense Against Facial Manipulations

Yue Li, Linying Xue, Kaiqing Lin, Hanyu Quan +4 more

The paper proposes AEGIS, a novel diffusion-guided method for injecting adversarial perturbations into the latent space to create generalizable and robust defenses against advanced facial deepfake man…

View →

cs.AIRecentMay 28, 2026

Robust and Generalizable Safety Steering for Text-to-Image Diffusion Transformers

Zihao Xue, Yan Wang, Zhen Bi, Long Ma +6 more

The paper proposes SafeDIG, a robust safety steering framework that adapts Diffusion Transformers for text-to-image generation by treating safety control as position-aware sparse feature transfer, ens…

View →

cs.CRcs.LGRecentMay 13, 2026

DiffusionHijack: Supply-Chain PRNG Backdoor Attack on Diffusion Models and Quantum Random Number Defense

Ziyang You, Liling Zheng, Xiaoke Yang, Xuxing Lu

The paper introduces DiffusionHijack, a supply-chain backdoor attack that compromises the PRNG used by diffusion models to deterministically control generated images, which is successfully mitigated b…

View →

cs.CVcs.AIcs.LGRecentMay 28, 2026

Controllable Lung Nodule Synthesis via Histogram-Regularized Latent Diffusion Models

Arunkumar Kannan, Yanbo Zhang, Han Liu, Michael Baumgartner +4 more

The paper introduces a histogram-regularized latent diffusion model to synthesize highly realistic and subtype-specific pulmonary nodules in 3D CT volumes, addressing the limitations of existing metho…

View →

cs.AIRecentJun 1, 2026

Bridging the Sim-to-Real Gap in Semiconductor Visual Program Synthesis via Input Binarization

Yusuke Ohtsubo, Kota Dohi, Koichiro Yawata, Koki Takeshita +1 more

The paper proposes a visual program synthesis framework using a VLM to generate accurate training data for semiconductor inspection, mitigating the sim-to-real gap by applying input binarization to st…

View →

cs.CVRecentJun 1, 2026

Improving Combined Detection and Classification of TEM Defects via Mask-Conditioned Latent Diffusion Augmentation

Ni Li, Nuohao Liu, Ryan Jacobs, Ajay Annamareddy +4 more

The paper proposes using a mask-conditioned latent diffusion model to generate synthetic, labeled TEM images for data augmentation, achieving small but measurable performance improvements in defect de…

View →

cs.CVcs.LGRecentJun 1, 2026

Hallucination-Aware Diffusion Sampling for Inverse Problems via Robust Prior Updates

Pengfei Jin, Yiqi Tian, Kailong Fan, Bingjie Qi +1 more

The paper introduces Robust Prior Update (RPU), a module that improves the faithfulness of diffusion-based inverse solvers by stabilizing the prior update step, thereby reducing measurement-conditione…

View →

cs.AIRecentMay 31, 2026

Brain-Atlas-Guided Generative Counterfactual Attention for Explainable Cognitive Decline Diagnosis Using Multimodal Connectomes

Xiongri Shen, Jiaqi Wang, Zhenxi Song, Yi Zhong +4 more

The paper proposes a novel Generative Counterfactual Attention-guided Network (GCAN) that uses multimodal connectomes and brain atlas knowledge to provide explainable and highly accurate diagnosis of…

View →

cs.AIcs.ARcs.CLRecentJun 2, 2026

StepPRM-RTL: Stepwise Process-Reward Guided LLM Fine-Tuning for Enhanced RTL Synthesis

Prashanth Vijayaraghavan, Apoorva Nitsure, Luyao Shi, Ehsan Degan +1 more

StepPRM-RTL is a novel framework that enhances LLM-based RTL code generation for digital hardware designs.

View →

cs.ARcs.PFRecentMay 30, 2026

Regular-Activation Concentration: Characterizing Column-Level Output Sparsity Across Diffusion Model Architectures

Dazhi Yang, Shafayat Mowla Anik, Byeong Kil Lee, Jeeho Ryoo

The paper systematically characterizes column-level activation sparsity across various diffusion model architectures, demonstrating that element-level sparsity metrics significantly overestimate the a…

View →

cs.AIcs.CLcs.LGRecentMay 28, 2026

SchGen: PCB Schematic Generation with Semantic-Grounded Code Representations

Qinpei Luo, Ruichun Ma, Xinyu Zhang, Lili Qiu

The paper introduces SchGen, the first large language model capable of generating editable PCB schematics from natural language by using a novel semantically grounded code representation.

View →

cs.LGcs.AIcs.CERecentJun 1, 2026

On the Generalization in Topology Optimization via Sensitivity-Conditioned Bernoulli Flow Matching

Mohammad Rashed, Duarte F. Valoroso Madeira, Babak Gholami, Caglar Guerbuez +2 more

The paper proposes using pseudo-sensitivities, derived from adjoint sensitivity fields, as an optimal conditioning signal in a Bernoulli flow-matching framework to significantly improve the out-of-dis…

View →

cs.CRcs.CVRecentMay 7, 2026

Secure Seed-Based Multi-bit Watermarking for Diffusion Models from First Principles

Enoal Gesny, Eva Giboulot

The paper introduces a theoretically grounded evaluation framework for watermarking generative models, proposing a novel method (SSB) that allows for systematic design across all security-robustness-f…

View →

eess.IVcs.AIRecentMay 29, 2026

A physics-informed foundation model for quantitative diffusion MRI

Zihan Li, Jialan Zheng, Ziyu Li, Xun Yuan +17 more

The paper introduces PIGMENT, a physics-informed foundation model that enables reliable quantitative mapping of brain microstructure from extremely sparse or challenging diffusion MRI scans.

View →

cs.CVcs.AIeess.IVRecentJun 1, 2026

Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization

Jingyun Liang, Min Wei, Shikai Li, Yizeng Han +4 more

The paper proposes a novel render-free framework that conditions video diffusion models directly on compressed 3D human mesh tokens, enabling robust 3D-aware human motion control without relying on re…

View →

cs.LGcs.AIcs.CVRecentMay 27, 2026

Geometry-Correct Diffusion Posterior Sampling with Denoiser-Pullback Curvature Guidance and Manifold-Aligned Damping

Seunghyeok Shin, Minwoo Kim, Dabin Kim, Hongki Lim

The paper introduces a novel diffusion posterior sampling method that stabilizes and accelerates data-consistent sampling by replacing hand-tuned guidance weights with a per-noise-level, curvature-gui…

View →

eess.IVcs.AIRecentMay 28, 2026

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

Antony Jerald, Hemant K Aggarwal, Brian Nett, Avinash Gopal +3 more

The paper proposes a unified deep learning framework to synthesize contrast-phase-specific virtual monochromatic 50 keV images from single-energy CT (SECT) data, overcoming the hardware limitations of…

View →

stat.MLcs.AIcs.LGRecentMay 29, 2026

Routing on the Stiefel Manifold: When Does Adaptive Subspace Selection Help for Cross-Domain EEG Decoding?

Isabella Costa Maia, Pedro L. C. Rodrigues, Salem Said, Marco Congedo

The paper introduces dynamic Stiefel routing, a novel method that adaptively selects specialized subspace projection filters on the Stiefel manifold to improve cross-domain EEG decoding without requir…

View →

cs.CVcs.AIRecentJun 1, 2026

Fast and Lightweight Novel View Synthesis with Differentiable Multiplane Image

Kaidi Zhang, Guanxu Zhu

The paper proposes a fast and lightweight novel view synthesis method using a differentiable Multiplane Image (MPI) representation, achieving significant speed and size improvements over state-of-the-…

View →