Papers similar to 2605.30631

~ similar to 2605.30631· 16 results

cs.CVRecentJun 1, 2026

Improving Combined Detection and Classification of TEM Defects via Mask-Conditioned Latent Diffusion Augmentation

Ni Li, Nuohao Liu, Ryan Jacobs, Ajay Annamareddy +4 more

The paper proposes using a mask-conditioned latent diffusion model to generate synthetic, labeled TEM images for data augmentation, achieving small but measurable performance improvements in defect de…

View →

cs.LGcs.CVRecentJun 1, 2026

Entropy Minimization without Model Collapse: Mitigating Prediction Bias in Medical Imaging

Tim Nielen, Sameer Ambekar, Johannes Kiechle, Daniel M. Lang +1 more

This paper identifies prediction bias, a failure mode of entropy minimization in test-time adaptation, and proposes Distribution Shift Bias Reduction (DSBR) to stabilize adaptation and prevent model c…

View →

cs.CVcs.AIcs.CLRecentMay 29, 2026

Generating Reports or Repeating Templates? Measuring and Mitigating Template Collapse in 3D CT Report Generation

Tom Maye-Lasserre, Yitong Li, Bailiang Jian, Morteza Ghahremani +2 more

The paper addresses 'Template Collapse' in 3D CT report generation—where models generate generic reports—by proposing CLarGen, a decoupled framework that significantly improves clinical accuracy and d…

View →

cs.CVcs.AIcs.LGRecentMay 27, 2026

VDSB-GWSyn: Diffusion Schrödinger Bridge for Controllable and Anatomically Feasible Guidewire Synthesis in Coronary Angiography

Haoyuan Tang, Zhuo Zhang, Jialin Li, Shuai Xiao +1 more

The paper proposes VDSB-GWSyn, a Diffusion Schrödinger Bridge framework, to synthesize controllable and anatomically feasible guidewire images on coronary angiography (CAG) scans, significantly improv…

View →

cs.CRRecentApr 2, 2026

Diffusion-Guided Adversarial Perturbation Injection for Generalizable Defense Against Facial Manipulations

Yue Li, Linying Xue, Kaiqing Lin, Hanyu Quan +4 more

The paper proposes AEGIS, a novel diffusion-guided method for injecting adversarial perturbations into the latent space to create generalizable and robust defenses against advanced facial deepfake man…

View →

cs.CVcs.LGRecentJun 1, 2026

Hallucination-Aware Diffusion Sampling for Inverse Problems via Robust Prior Updates

Pengfei Jin, Yiqi Tian, Kailong Fan, Bingjie Qi +1 more

The paper introduces Robust Prior Update (RPU), a module that improves the faithfulness of diffusion-based inverse solvers by stabilizing the prior update step, thereby reducing measurement-conditione…

View →

cs.CRRecentMay 1, 2026

Repurposing Image Diffusion Models for Adversarial Synthetic Structured Data: A Case Study of Ground Truth Drift

Adam Arthur, Christopher Schwartz

The paper demonstrates that off-the-shelf image diffusion models, like Stable Diffusion, can be repurposed to generate synthetic structured data, posing a threat of ground truth drift in closed eviden…

View →

cs.CVcs.AIcs.LGRecentMay 30, 2026

Improving Visual Representation Alignment Generation with GRPO

Shentong Mo, Sukmin Yun

The paper proposes VRPO, a reinforcement learning-based optimization strategy that replaces static alignment losses in diffusion models, significantly improving both convergence and image fidelity.

View →

eess.IVcs.AIRecentMay 28, 2026

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

Antony Jerald, Hemant K Aggarwal, Brian Nett, Avinash Gopal +3 more

The paper proposes a unified deep learning framework to synthesize contrast-phase-specific virtual monochromatic 50 keV images from single-energy CT (SECT) data, overcoming the hardware limitations of…

View →

cs.LGcs.CVRecentJun 1, 2026

Measurement Geometry and Design for Trustworthy Generative Inverse Problems

Pengfei Jin, Na Li, Quanzheng Li

The paper proposes a measurement-geometry framework to quantify how well fixed measurement operators can distinguish between images generated by a prior, thereby guiding the design of more trustworthy…

View →

cs.CVcs.AIcs.HCRecentMay 30, 2026

CodeCytos: AI-assisted spatial molecular imaging analysis via code-augmented agent action space

Hung Q. Vo, Huy Q. Vo, Son T. Ly, Zhihao Wan +5 more

CodeCytos is a novel coding-based reasoning agent framework that enables dynamic, programmable interaction with spatial molecular imaging data, significantly improving the automation and customization…

View →

cs.CVcs.AIcs.LGRecentMay 30, 2026

DASH: Dual-Branch Score Distillation for Guidance-Calibrated Compact Diffusion Models

Abdullah Al Shafi, Kazi Saeed Alam, Sk Imran Hossain, Engelbert Mephu Nguifo

DASH introduces a dual-branch distillation framework to effectively compress class-conditional diffusion models by independently supervising both score branches, significantly preserving guidance fide…

View →

cs.CVcs.AIRecentMay 30, 2026

Pre-Deployment Robustness Stress Testing for CT Segmentation Systems Using Clinically Motivated Multi-Corruption Augmentation

CholMin Kang, Jonghyun Chung, Amanpreet Kaurb, Nagesh Gulkotwarb +1 more

The paper proposes RAMP, a multi-corruption augmentation framework, which significantly improves the robustness and reliability of CT segmentation deep learning models when deployed in real-world, deg…

View →

cs.CVcs.AIRecentJun 1, 2026

Fast and Lightweight Novel View Synthesis with Differentiable Multiplane Image

Kaidi Zhang, Guanxu Zhu

The paper proposes a fast and lightweight novel view synthesis method using a differentiable Multiplane Image (MPI) representation, achieving significant speed and size improvements over state-of-the-…

View →

cs.CVRecentJun 1, 2026

Equilibrated Diffusion: Frequency-aware Textual Embedding for Equilibrated Image Customization

Liyuan Ma, Xueji Fang, Guo-Jun Qi

Equilibrated Diffusion introduces a frequency-aware approach to image customization, disentangling style and subject content embeddings to achieve superior subject fidelity and text adherence.

View →

cs.CVcs.AIcs.LGRecentMay 27, 2026

Residualized Temporal Sparse Autoencoders for Interpreting Diffusion Models

Calvin Yeung, Prathyush Poduval, Ali Zakeri, Zhuowen Zou +1 more

The paper introduces residualized temporal Sparse Autoencoders (SAEs) to analyze the full spatiotemporal structure of activations generated during the iterative denoising process of diffusion models,…

View →