~ similar to 2606.12235· 18 results
Lukas Einhaus, Natalie Maman, Julian Hoever, Andreas Erbslöh +1 more
The paper proposes a novel convolutional block and optimization algorithm to implement resource-efficient 1D-CNNs for atrial fibrillation detection on tiny smart sensor systems, achieving high accurac…
The paper proposes Family-Grouped Hierarchical Federated Learning (Family-FL) combined with a highly optimized Tiny CNN-LSTM model to enable privacy-preserving ECG monitoring on ultra-resource-constra…
The elasticAI.explorer is an extensible, unified Python framework that simplifies hardware-aware Neural Architecture Search (NAS) by decoupling search space definition from model implementation and de…
Haihang Xia, Xinyu Zhao, Xuecheng Wang, John Goodenough +4 more
This paper proposes and validates a novel hardware architecture, ITP-STDP, to significantly reduce the energy consumption and hardware overhead associated with training Spiking Neural Networks (SNNs).
The paper proposes a novel, highly secure real-time ECG monitoring framework that uses a patient's own ECG signal characteristics to generate unique, dynamic encryption keys, ensuring confidential dat…
This paper presents a unified framework for end-to-end co-design of neural network processors.
This paper presents a hardware-oriented description of GoldenFloat, a static-split floating-point family, and its concrete artefacts.
The paper introduces Chimera, a highly efficient and scalable MCU designed for ultra-low-power edge AI inference, achieving 3.1 TOPS/W by integrating a dedicated transformer accelerator and a QoS-guar…
The paper systematically characterizes the fault response of the Intel NCS2 accelerator to electromagnetic fault injection, revealing a major degradation mode that is undetectable by standard inferenc…
The paper proposes a Ferroelectric Charge-Domain Compute Cell (FCDC) using HZO memcapacitors to perform attention computation, achieving significant energy efficiency gains, especially for long-reside…
The paper introduces ArrythML, a highly efficient autoencoder-based TinyML model that enables accurate, low-power arrhythmia detection directly on resource-constrained embedded wearable devices.
Junyi Yang, Shuai Dong, Zhengnan Fu, Hongyang Shang +1 more
The paper proposes a highly reconfigurable 256x128 in-memory computing array that significantly improves efficiency and performance for analog computing by introducing novel components for ADC, weight…
This study empirically benchmarks classical and quantum machine learning models for image recognition, finding that while quantum models offer superior accuracy and resource efficiency at high dimensi…
O-POPE is a novel outer-product engine that accelerates floating-point GEMM by repurposing FPU pipeline registers as buffers, achieving high utilization and improved energy efficiency.
This paper analyzes the impact of long-term and short-term transistor aging on Deep Neural Network (DNN) inference accuracy and proposes an aging-aware retraining methodology to maintain performance e…
Di Zhu, Yu Yvonne Wu, Hong Jia, Aaqib Saeed +2 more
VitalAgent is a novel tool-augmented agentic framework that significantly improves physiological monitoring from wearable health data by enabling both reactive question answering and proactive, long-t…
The paper introduces BLADEI, a hardware-accelerated framework that screens FPGA configuration bitstreams for anomalies in real-time, overcoming the latency bottleneck of traditional software-based det…
Davood Fattahi, Runze Yan, Saurabh Kataria, Zhaoliang Chen +1 more
This paper proposes a unified framework for inference-time augmentation to improve the robustness of physiological signal classification in real-world deployments.