Papers similar to 2606.14686

~ similar to 2606.14686· 19 results

cs.CVcs.AIRecentJun 1, 2026

Attention mechanisms and transfer learning for robust peach leaf damage classification under domain shift

Adrián Cánovas-Rodriguez, Miguel A. González-Illán, Maria Fernanda García-Cruz, Pedro Nortes Tortosa +4 more

The paper proposes an attention-enhanced deep learning framework using EfficientNet and CBAM to achieve high accuracy (93.3%) in classifying peach leaf damage, demonstrating improved robustness under…

View →

cs.CVcs.AIcs.LGRecentMay 30, 2026

CAFOSat: A Strongly Annotated Dataset for Infrastructure-Aware CAFO Mapping Using High-Resolution Imagery

Oishee Bintey Hoque, Nibir Chandra Mandal, Mandy L Wilson, Samarth Swarup +2 more

The paper introduces CAFOSat, a large-scale, strongly annotated, and infrastructure-aware dataset designed to improve the accuracy of mapping Concentrated Animal Feeding Operations (CAFOs) from high-r…

View →

cs.CVcs.AIRecentMay 28, 2026

A Novel Global Context-aware Deep Neural Network for Enhanced Brain Tumor Segmentation using Magnetic Resonance Images

Sourjya Mukherjee, Ananya Bhattacharjee, R. Murugan

The paper proposes a novel Global Context-aware Squeeze and Excite Residual UNet (GCSER-UNet) network, which significantly enhances brain tumor segmentation accuracy on benchmark MRI datasets.

View →

cs.CVRecentJun 1, 2026

Cross-Domain Dead Tree Detection via Knowledge Distillation in Aerial Imagery

Anis Ur Rahman, Mete Ahishali, Einari Heinaro, Samuli Junttila

The paper introduces a knowledge distillation framework to adapt a dead tree detection model trained on one geographical area (Finland) to multiple diverse forest types (Poland, Germany, Estonia), ach…

View →

cs.CVcs.AIRecentMay 29, 2026

Simple Token-Efficient Vision-Language Model for Case-level Pathology Synoptic Report Generation

Zhiyuan Yang, Jiahao Cheng, Vincent Quoc-Huy Trinh, Mahdi S. Hosseini

The paper introduces a simple, token-efficient vision-language model for generating comprehensive pathology synoptic reports from multiple whole-slide images (WSIs), achieving high performance while s…

View →

cs.CVRecentJun 1, 2026

ToolFG: Towards Well-Grounded Fine-Grained Image Classification

Yu Xue, Haoxuan Qu, Zhuoling Li, Yihang Lou +3 more

The paper introduces ToolFG, a novel tool-integrated MLLM framework that enhances fine-grained image classification by enabling models to autonomously use external tools to gather verifiable visual cu…

View →

cs.CVcs.LGeess.IVRecentJun 3, 2026

An Open-Source Two-Stage Computer Vision Pipeline for Fine-Grained Vehicle Classification using Vision Transformers

Gandhimathi Padmanaban, Fred Feng

This paper presents an open-source computer vision pipeline for classifying vehicle body types from naturalistic roadway video.

View →

cs.CVcs.AIcs.LGRecentMay 27, 2026

Do We Really Need Quantum Machine Learning?: A Multidimensional Empirical Study

Sudip Vhaduri, Ryan Gammon, Sayanton Dibbo

This study empirically benchmarks classical and quantum machine learning models for image recognition, finding that while quantum models offer superior accuracy and resource efficiency at high dimensi…

View →

cs.CVcs.CRRecentApr 29, 2026

Privacy-Preserving Clothing Classification using Vision Transformer for Thermal Comfort Estimation

Tatsuya Chuman, Yousuke Udagawa, Hitoshi Kiya

This paper introduces a novel Vision Transformer (ViT)-based method for privacy-preserving clothing classification that accurately estimates clothing insulation for secure occupant-centric control sys…

View →

cs.AIRecentMay 31, 2026

Brain-Atlas-Guided Generative Counterfactual Attention for Explainable Cognitive Decline Diagnosis Using Multimodal Connectomes

Xiongri Shen, Jiaqi Wang, Zhenxi Song, Yi Zhong +4 more

The paper proposes a novel Generative Counterfactual Attention-guided Network (GCAN) that uses multimodal connectomes and brain atlas knowledge to provide explainable and highly accurate diagnosis of…

View →

cs.CVcs.AIRecentMay 31, 2026

Data Collection for Training Quality-Control AI in Carpet Manufacturing

Akbar Erkinov

The paper proposes an end-to-end, deployable blueprint for an in-line machine-vision system that not only inspects carpet defects in real-time but also systematically collects and labels defect data t…

View →

cs.CVRecentJun 1, 2026

Chroma Clues: Leveraging Color Statistics to Detect Synthetic Images

Lea Uhlenbrock, Davide Cozzolino, Christian Riess

This paper proposes using color statistics, specifically through novel color transformations, to detect AI-generated synthetic images by exploiting the color-imitation weaknesses of current generative…

View →

cs.CVcs.CRcs.SIRecentMay 14, 2026

Can Visual Mamba Improve AI-Generated Image Detection? An In-Depth Investigation

Mamadou Keita, Wassim Hamidouche, Hessen Bougueffa Eutamene, Abdelmalik Taleb-Ahmed +2 more

This study systematically evaluates Vision Mamba models for detecting AI-generated images, finding that while they show promise, their current strengths and limitations must be understood relative to…

View →

cs.CRcs.AIcs.CVRecentApr 6, 2026

SE-Enhanced ViT and BiLSTM-Based Intrusion Detection for Secure IIoT and IoMT Environments

Afrah Gueriani, Hamza Kheddar, Ahmed Cherif Mazari, Seref Sagiroglu +1 more

The paper proposes an SE ViT-BiLSTM hybrid model for enhanced intrusion detection in IIoT and IoMT environments, achieving superior performance on real-world datasets, especially after data balancing.

View →

cs.CRRecentApr 22, 2026

Image-Based Malware Type Classification on MalNet-Image Tiny: Effects of Multi-Scale Fusion, Transfer Learning, Data Augmentation, and Schedule-Free Optimization

Ahmed A. Abouelkhaire, Waleed A. Yousef, Issa Traor

The paper investigates improving 43-class malware type classification on MalNet-Image Tiny by evaluating the combined effects of multi-scale feature fusion, transfer learning, advanced data augmentati…

View →

cs.CVcs.AIcs.LGRecentMay 28, 2026

Genetically Aligned Patient Representations Improve Hematological Diagnosis

Muhammed Furkan Dasdelen, Fatih Ozlugedik, Ilaria Looser, Rao Muhammad Umer +2 more

The paper introduces a novel framework that aligns single white blood cell images with genetic data (karyotype and somatic mutations) to significantly improve the diagnosis of blood cancers, outperfor…

View →

cs.CVcs.AIcs.LGRecentJun 1, 2026

A Structured Benchmark for Text-Guided Anomaly Detection: When Language Stops Conditioning the Decision

Stefano Samele, Eugenio Lomurno, Teodora Jovanovic, Sanjay Shivakumar Manohar +2 more

The paper introduces a structured benchmark (TGAD) showing that current text-guided anomaly detection models often overstate their language conditioning, as performance significantly degrades when the…

View →

cs.CVRecentJun 1, 2026

GloResNet: A lightweight 3D CNN with global topological features for preterm brain injury prediction

Boyu Yuan, Jiamiao Lu, Weichuan Zhang, Benqing Wu +4 more

The paper proposes GloResNet, a lightweight 3D CNN that effectively predicts brain injury in preterm infants using T2-weighted MRI, achieving an average accuracy of 75.18%.

View →

cs.LGcs.CVRecentJun 1, 2026

Entropy Minimization without Model Collapse: Mitigating Prediction Bias in Medical Imaging

Tim Nielen, Sameer Ambekar, Johannes Kiechle, Daniel M. Lang +1 more

This paper identifies prediction bias, a failure mode of entropy minimization in test-time adaptation, and proposes Distribution Shift Bias Reduction (DSBR) to stabilize adaptation and prevent model c…

View →