~ similar to 2605.31284· 19 results
Lukas Johanns, Marilin Moor, Davide Panzeri, Yu Zhou +8 more
Agentic-J is a containerized, multi-agent AI assistant designed to enable biologists to perform complex, reproducible biological microscopy image analysis by specifying tasks in natural language.
The paper proposes a unified framework to systematically redefine instance matching for Panoptic Quality evaluation, moving beyond the standard One-to-One matching to accommodate complex scenarios lik…
Hung Q. Vo, Huy Q. Vo, Son T. Ly, Zhihao Wan +5 more
CodeCytos is a novel coding-based reasoning agent framework that enables dynamic, programmable interaction with spatial molecular imaging data, significantly improving the automation and customization…
Christian Gehrmann, Jonas Ricker, Simon Damm, Deruo Cheng +4 more
The paper introduces SAMSEM, a generalized and scalable model based on SAM2, which significantly improves metal line segmentation across diverse and unseen integrated circuit (IC) samples.
Zihan Li, Jialan Zheng, Ziyu Li, Xun Yuan +17 more
The paper introduces PIGMENT, a physics-informed foundation model that enables reliable quantitative mapping of brain microstructure from extremely sparse or challenging diffusion MRI scans.
Yiming Wang, Baiqi Wu, Qingming Li, Jiahao Chen +2 more
The paper proposes FLAME, a novel framework that detects AI-generated image forgeries by identifying intrinsic energy anomalies caused by the diffusion process, achieving state-of-the-art localization…
Adrián Cánovas-Rodriguez, Miguel A. González-Illán, Maria Fernanda García-Cruz, Pedro Nortes Tortosa +4 more
The paper proposes an attention-enhanced deep learning framework using EfficientNet and CBAM to achieve high accuracy (93.3%) in classifying peach leaf damage, demonstrating improved robustness under…
This paper systematically studies the robustness of vision foundation models to common image perturbations, finding that most models are generally non-robust and proposing a fine-tuning method to impr…
FLORO is a multimodal geospatial foundation model that learns transferable remote sensing representations from a small, diverse corpus, achieving strong performance across various sensor types and res…
Yu Xue, Haoxuan Qu, Zhuoling Li, Yihang Lou +3 more
The paper introduces ToolFG, a novel tool-integrated MLLM framework that enhances fine-grained image classification by enabling models to autonomously use external tools to gather verifiable visual cu…
This paper introduces BBOmix, an open-source benchmark for unsupervised representation learning on real-world biological data.
Guangsheng Zhang, Huan Tian, Leo Zhang, Tianqing Zhu +3 more
This paper systematically revisits and expands the threat model for backdoor attacks on semantic segmentation, proposing a unified framework (BADSEG) that demonstrates severe, previously overlooked vu…
The paper introduces SpatialBench-Long, a comprehensive benchmark designed to test AI agents' ability to perform end-to-end scientific reasoning and derive biological claims from complex, raw spatial…
The paper proposes FedSAP, a framework that stabilizes federated prototype learning by delaying global alignment and enforcing inter-class structure, significantly improving representation quality und…
This paper investigates the application of Parameter-Efficient Fine-Tuning (PEFT) methods, specifically adapters and LoRA, to large pretrained models for instance segmentation, demonstrating that thes…
The paper introduces Strong Stochastic Flow Maps (SSFMs), a novel framework that directly learns the strong solution map of additive-noise Stochastic Differential Equations (SDEs), enabling few-step s…
The paper introduces CAFOSat, a large-scale, strongly annotated, and infrastructure-aware dataset designed to improve the accuracy of mapping Concentrated Animal Feeding Operations (CAFOs) from high-r…
This study empirically benchmarks classical and quantum machine learning models for image recognition, finding that while quantum models offer superior accuracy and resource efficiency at high dimensi…
The paper introduces a simple, token-efficient vision-language model for generating comprehensive pathology synoptic reports from multiple whole-slide images (WSIs), achieving high performance while s…