ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:

20 results for “Transformer framework”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.LGRecentJun 1, 2026

EEG-FuseFormer: A Transformer-Driven Feature Fusion Framework for Seizure Onset Prediction

Vigneshwar Hariharan, Chithra Reghuvaran, Arlene John, Nhat Pham +3 more

The paper proposes EEG-FuseFormer, a transformer-based framework that fuses features from CNN-LSTM and ResNet-18 to achieve high accuracy in predicting seizure onset from EEG signals.

View →
cs.CRRecentApr 11, 2026

EncFormer: Secure and Efficient Transformer Inference over Encrypted Data

Yufan Zhu, Chao Jin, Khin Mi Mi Aung, Xiaokui Xiao

EncFormer is a novel two-party framework that significantly improves the efficiency and scalability of private Transformer inference by optimizing the combination of Fully Homomorphic Encryption (FHE)…

View →
cs.LGcs.AIcs.CCRecentMay 28, 2026

Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't

Anej Svete, William Merrill, Ryan Cotterell, Ashish Sabharwal

The paper analyzes the expressivity of padded transformers, proving that their computational power is primarily determined by model depth and numeric precision, rather than attention type or width.

View →
cs.CLRecentJun 1, 2026

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

Shihao Rao, Liang Li, Jiapeng Liu, Tong Lin +5 more

The paper introduces DocFormBench, a new benchmark for content-aware document formatting, and proposes DocFormFlow, a workflow that improves formatting accuracy and efficiency by decoupling target loc…

View →
cs.AIcs.CERecentMay 27, 2026

VFEAgent: A Multimodal Agent Framework for End-to-End Automated Finite Element Analysis

Jiachen Zhang, Junyi Lao, Chenghao Liu, Siyuan Liu +4 more

VFEAgent is a novel multi-agent framework that automates the entire Finite Element Analysis (FEA) workflow, achieving high success rates in generating complete and physically valid simulations directl…

View →
cs.AIRecentMay 28, 2026

Uncertainty-Aware Transfer Learning for Cross-Building Energy Forecasting: Toward Robust and Scalable District-Level Energy Management

Shadmehr Zaregarizi, Khashayar Yavari

The paper proposes an uncertainty-aware transfer learning framework using the Temporal Fusion Transformer (TFT) to achieve robust and scalable energy forecasting across different buildings, demonstrat…

View →
cs.LGcs.CLRecentMay 31, 2026

CART: Context-Anchored Recurrent Transformer -- A Parameter-Efficient Architecture with Learned Stability

Chad A. Capps

CART introduces a parameter-efficient recurrent transformer architecture that reuses a core block multiple times, but its performance does not surpass a dense baseline, suggesting that weight sharing…

View →
cs.LGcs.CLeess.SPRecentMay 31, 2026

Beyond Sinusoids: A Morlet Wavelet Framework for Transformer Positional Encoding

Athanasios Zeris

The paper introduces Morlet Positional Encoding (MoPE), a novel wavelet-based positional encoding that models position and locality simultaneously, outperforming standard sinusoidal and RoPE methods.

View →
cs.CVcs.AIRecentJun 1, 2026

Parameter-Efficient Fine-Tuning of Large Pretrained Models for Instance Segmentation Tasks

Nermeen Abou Baker, David Rohrschneider, Uwe Handmann

This paper investigates the application of Parameter-Efficient Fine-Tuning (PEFT) methods, specifically adapters and LoRA, to large pretrained models for instance segmentation, demonstrating that thes…

View →
cs.AIRecentMay 28, 2026

Formalizing Mathematics at Scale

Ahmad Rammal, Niket Patel, Fabian Gloeckle, Amaury Hayat +4 more

The paper introduces AutoformBot, a multi-agent system that successfully autoformalizes a large corpus of open-access graduate-level mathematics textbooks into a verified library in Lean 4, demonstrat…

View →
cs.CLRecentJun 1, 2026

PortBERT: Navigating the Depths of Portuguese Language Models

Raphael Scheible-Schmitt, Henry He, Armando B. Mendes

The paper introduces PortBERT, a family of RoBERTa-based language models for Portuguese, which achieves competitive performance while explicitly balancing efficiency and accuracy.

View →
cs.LGcs.AIRecentMay 27, 2026

A Multi-dimensional Framework for Evaluating Generalization in EEG Foundation Models

Aditya Kommineni, Emily Zhou, Kleanthis Avramidis, Tiantian Feng +1 more

The paper proposes a multi-dimensional evaluation framework to assess EEG foundation models under realistic low-resource conditions, finding that while these models excel in long-context tasks, their…

View →
cs.CRcs.AIRecentApr 7, 2026

CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments

Gustav Keppler, Moritz Gstür, Veit Hagenmeyer

The paper introduces CritBench, a novel framework to evaluate LLM cybersecurity capabilities specifically within IEC 61850 Digital Substation Operational Technology (OT) environments, finding that whi…

View →
cs.CRRecentApr 13, 2026

Optimizing IoT Intrusion Detection with Tabular Foundation Models for Smart City Forensics

Asma Al-Dahmani, Abdulla Bin Safwan, Mohammad Obeidat, Belal Alsinglawi

The paper demonstrates that using the transformer-based foundation model TabPFNv2.5 can significantly speed up IoT intrusion detection compared to traditional ensemble methods while maintaining high a…

View →
cs.CRcs.LGcs.SERecentMar 31, 2026

Efficient Software Vulnerability Detection Using Transformer-based Models

Sameer Shaik, Zhen Huang, Daniela Stan Raicu, Jacob Furst

This paper proposes using transformer-based models on program slices to accurately detect C/C++ software vulnerabilities by capturing both local and global contextual information.

View →
cs.CEcs.LGphysics.comp-phRecentMay 27, 2026

Adapting Automotive Aerodynamics Surrogates to New Vehicle Families via Transfer Learning

Seunghwan Keum, Alok Warey

The paper demonstrates that Low-Rank Adaptation (LoRA) is an effective and superior method for adapting large, pretrained Transformer surrogates for automotive aerodynamics to new vehicle families usi…

View →
cs.LGcs.AIRecentMay 30, 2026

Dive into Waves: Morlet Spectral Transformer for Cross-Subject Emotion Decoding from EEG

Jiaxin Qing, Lexin Li

The paper proposes the Morlet Spectral Transformer (MST), a novel architecture that effectively decodes cross-subject emotion from EEG by designing specialized spectral and spatial representations, ou…

View →
cs.AIcs.CLRecentMay 28, 2026

Notation Matters: A Benchmark Study of Token-Optimized Formats in Agentic AI Systems

Lorenz Kutschka, Bernhard Geiger

This study benchmarks token-optimized formats (TOON and TRON) against JSON in end-to-end agentic AI systems, finding that TRON significantly reduces token overhead with minimal performance degradation…

View →