Papers similar to 2606.00140

~ similar to 2606.00140· 16 results

cs.AIRecentMay 27, 2026

Orthogonal Concept Erasure for Diffusion Models

Yuhao Sun, Lingyun Yu, Haoxiang Xu, Fengyuan Miao +2 more

The paper proposes Orthogonal Concept Erasure (OCE), a novel multiplicative parameter update method that achieves precise concept erasure in diffusion models while independently preserving overall gen…

View →

cs.CRRecentJun 1, 2026

CoreUnlearn: Rethinking Concept Unlearning through Disentangled Component-Level Erasure in Text-guided Diffusion Models

Mengnan Zhao, Lihe Zhang, Baocai Yin

CoreUnlearn introduces a novel framework that disentangles and removes undesirable concepts from text-guided diffusion models by targeting specific, erasure-critical components of the concept embeddin…

View →

cs.CVcs.CRRecentMar 27, 2026

Neighbor-Aware Localized Concept Erasure in Text-to-Image Diffusion Models

Zhuan Shi, Alireza Dehghanpour Farashah, Rik de Vries, Golnoosh Farnadi

The paper proposes Neighbor-Aware Localized Concept Erasure (NLCE), a training-free framework that effectively removes specific concepts from text-to-image models while minimizing the unintended degra…

View →

cs.CVcs.CRRecentApr 17, 2026

Beyond Text Prompts: Precise Concept Erasure through Text-Image Collaboration

Jun Li, Lizhi Xiong, Ziqiang Li, Weiwei Jiang +3 more

The paper introduces TICoE, a text-image collaborative framework that achieves precise and faithful concept removal from text-to-image generative models, surpassing existing methods in both precision…

View →

cs.CRcs.CVRecentMay 10, 2026

On the Generation and Mitigation of Harmful Geometry in Image-to-3D Models

Yule Liu, Yilong Yang, Jiale Teng, Hanze Jia +10 more

The paper systematically measures the risk of current image-to-3D models generating harmful geometries, finding that these models are effective at reconstruction and existing safeguards are insufficie…

View →

cs.CVcs.AIEmpiricalRecentJun 10, 2026

Reroute, Don't Remove: Recoverable Visual Token Routing for Vision-Language Models

Cheng-Yu Yang, Shao-Yuan Lo, Yu-Lun Liu

肖代替了视觉令牌的永久删除，通过可恢复的路由来改进视觉语言模型的性能

View →

cs.AIRecentMay 29, 2026

Geodesic Flow Matching for Denoising High-Dimensional Structured Representations

Karim Habashy, Chris Eliasmith

The paper introduces Geodesic Flow Matching, a manifold-aware denoising technique that adapts Riemannian transport dynamics to accurately clean high-dimensional structured representations like Spatial…

View →

cs.CVcs.AIeess.IVRecentJun 1, 2026

Towards 3D-Aware Video Diffusion Models: Render-Free Human Motion Control with Mesh Tokenization

Jingyun Liang, Min Wei, Shikai Li, Yizeng Han +4 more

The paper proposes a novel render-free framework that conditions video diffusion models directly on compressed 3D human mesh tokens, enabling robust 3D-aware human motion control without relying on re…

View →

cs.CVRecentJun 1, 2026

From Extrinsic to Intrinsic: Geodesic-Guided Representation Learning for 3D Geometric Data

Yuming Zhao, Junhui Hou, Qijian Zhang, Jia Qin +1 more

The paper introduces PRISM, a novel representation learning framework that learns isometric embeddings by explicitly modeling the intrinsic geodesic metric of 3D surfaces, achieving superior performan…

View →

cs.LGRecentJun 1, 2026

Why Are DMD Students Lazy? Understanding the Copying Behavior in Few-Step Distillation

Shucheng Li, Iolo Jones, Alexander Tong, Michael M. Bronstein

This paper investigates the phenomenon of 'copying' in Distribution Matching Distillation (DMD), finding that high-dimensional distillation causes student models to spontaneously reproduce the teacher…

View →

cs.ROcs.CVRecentJun 1, 2026

RoboDream: Compositional World Models for Scalable Robot Data Synthesis

Junjie Ye, Rong Xue, Basile Van Hoorick, Runhao Li +5 more

RoboDream introduces an embodiment-centric world model that synthesizes photorealistic, physically feasible robot demonstrations by decoupling motion generation from environment synthesis, significant…

View →

cs.SDcs.AIcs.IRRecentMay 29, 2026

Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation

Ioannis Prokopiou, Pantelis Vikatos, Maximos Kaliakatsos-Papakostas, Theodoros Giannakopoulos +1 more

The paper proposes an inference-time activation steering framework, utilizing orthogonalization, to achieve fine-grained, deterministic control over discrete musical attributes like Pitch and Duration…

View →

cs.CVRecentJun 1, 2026

Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models

Guangzhao He, Rundong Luo, Wei-Chiu Ma, Hadar Averbuch-Elor

The paper introduces Staged Executable Inverse Graphics (SEIG), an agentic framework that uses general-purpose Vision-Language Models (VLMs) to reconstruct editable 3D scenes directly into executable…

View →

cs.CRcs.CLRecentJun 2, 2026

PsychoPass: Geometric Profiling of Multi-Turn Adversarial LLM Conversations

Muberra Ozmen, Subhabrata Majumdar

The paper introduces PsychoPass, a framework that analyzes the geometric trajectory of multi-turn conversations in embedding space to detect adversarial intent early, before harmful content is generat…

View →

cs.CVRecentJun 1, 2026

Spatial-Temporal Decoupled Reference Conditioning for Identity-Preserving Text-to-Video Generation

Yuheng Chen, Teng Hu, Yuji Wang, Qingdong He +2 more

The paper proposes ST-DRC, a Spatial-Temporal Decoupled Reference Conditioning framework that effectively balances high-level semantic control and low-level identity fidelity for text-to-video generat…

View →

cs.CLRecentJun 1, 2026

Geometric Latent Reasoning Induces Shorter Generations in LLMs

Shashi Kumar, Yacouba Kaloga, Petr Motlicek, Ina Kodrasi +1 more

The paper introduces Geometric Latent Reasoning (GLR), a method that models reasoning as continuous paths in the embedding space, showing that this continuous approach allows LLMs to solve problems us…

View →