~ similar to 2606.02498· 20 results
The paper proposes a novel ResNet-34 encoder with a lightweight decoder for highly accurate and computationally efficient segmentation of complex fetal brain structures in MRI.
The paper proposes a novel Global Context-aware Squeeze and Excite Residual UNet (GCSER-UNet) network, which significantly enhances brain tumor segmentation accuracy on benchmark MRI datasets.
Tim Nielen, Sameer Ambekar, Johannes Kiechle, Daniel M. Lang +1 more
This paper identifies prediction bias, a failure mode of entropy minimization in test-time adaptation, and proposes Distribution Shift Bias Reduction (DSBR) to stabilize adaptation and prevent model c…
Xiongri Shen, Jiaqi Wang, Zhenxi Song, Yi Zhong +4 more
The paper proposes a novel Generative Counterfactual Attention-guided Network (GCAN) that uses multimodal connectomes and brain atlas knowledge to provide explainable and highly accurate diagnosis of…
Kjersti Engan, Neel Kanwal, Anita Yeconia, Ladislaus Blacy +3 more
The paper introduces FHRFormer, a masked transformer-based autoencoder designed to accurately reconstruct missing and forecast fetal heart rate (FHR) time-series data, thereby enabling robust AI-based…
Tengfei Zhang, Ziheng Zhao, Lisong Dai, Xiaoman Zhang +4 more
This paper introduces MedReCo and MedReCo-VLM, a framework that enables entity-aware cross-image reasoning for medical imaging, allowing AI to compare current scans with prior studies and analogous ca…
Hwa Hui Tew, Junn Yong Loo, Fang Yu Leong, Julia K. Lau +5 more
The paper introduces Dual-Spectral Flow Matching (DSFM), a novel generative framework that uses wavelet and cosine transforms to synthesize highly realistic, non-stationary fMRI time series for improv…
KidsNanny is a two-stage multimodal content moderation pipeline that achieves high accuracy and efficiency in detecting child safety threats, particularly excelling in text-embedded content.
Zihan Li, Jialan Zheng, Ziyu Li, Xun Yuan +17 more
The paper introduces PIGMENT, a physics-informed foundation model that enables reliable quantitative mapping of brain microstructure from extremely sparse or challenging diffusion MRI scans.
Jeyeon Eo, Joo Young Kim, Ran Ju, Minyoung Jung +1 more
BuddyBench introduces a novel, privacy-constrained multi-task benchmark that integrates longitudinal learning trajectories, standardized clinical assessments, and randomized trial data to advance pedi…
The paper addresses 'Template Collapse' in 3D CT report generation—where models generate generic reports—by proposing CLarGen, a decoupled framework that significantly improves clinical accuracy and d…
The paper proposes EEG-FuseFormer, a transformer-based framework that fuses features from CNN-LSTM and ResNet-18 to achieve high accuracy in predicting seizure onset from EEG signals.
Baris Karacan, Vaibhav Bhargava, Barbara Di Eugenio, Natalie Parde +20 more
The paper introduces a supervised fine-tuning pipeline using large language models to accurately categorize sentence-level clinical provenance across multi-disciplinary hospital notes, demonstrating t…
This paper presents an open-source computer vision pipeline for classifying vehicle body types from naturalistic roadway video.
The paper proposes GraD-IBD, a graph-based model that reformulates longitudinal ICD diagnosis codes into temporally directed graphs to efficiently and accurately detect the risk of Inflammatory Bowel…
Adrián Cánovas-Rodriguez, Miguel A. González-Illán, Maria Fernanda García-Cruz, Pedro Nortes Tortosa +4 more
The paper proposes an attention-enhanced deep learning framework using EfficientNet and CBAM to achieve high accuracy (93.3%) in classifying peach leaf damage, demonstrating improved robustness under…
The paper demonstrates that clinical vision-language models (VLMs) pose a significant privacy risk by allowing de-identified images to be re-linked to original reports, and proposes a targeted differe…
The paper introduces Geodesic Flow Matching, a manifold-aware denoising technique that adapts Riemannian transport dynamics to accurately clean high-dimensional structured representations like Spatial…
GLiGuard introduces a compact, schema-conditioned bidirectional encoder that achieves state-of-the-art performance in LLM content moderation across multiple safety dimensions while drastically reducin…
Thierry Judge, Nicolas Duchateau, Andreas Østvik, Khuram Faraz +12 more
The paper introduces a novel simulation strategy that integrates speckle decorrelation measures from real videos to create a photorealistic dataset, enabling a deep learning algorithm that achieves st…