"Understanding of transformer frameworks"

20 results for “Understanding of transformer frameworks”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.CRcs.LGcs.SERecentMar 31, 2026

Efficient Software Vulnerability Detection Using Transformer-based Models

Sameer Shaik, Zhen Huang, Daniela Stan Raicu, Jacob Furst

This paper proposes using transformer-based models on program slices to accurately detect C/C++ software vulnerabilities by capturing both local and global contextual information.

View →

cs.LGcs.AIcs.CCRecentMay 28, 2026

Revisiting Padded Transformer Expressivity: Which Architectural Choices Matter and Which Don't

Anej Svete, William Merrill, Ryan Cotterell, Ashish Sabharwal

The paper analyzes the expressivity of padded transformers, proving that their computational power is primarily determined by model depth and numeric precision, rather than attention type or width.

View →

cs.LGcs.CLeess.SPRecentMay 31, 2026

Beyond Sinusoids: A Morlet Wavelet Framework for Transformer Positional Encoding

Athanasios Zeris

The paper introduces Morlet Positional Encoding (MoPE), a novel wavelet-based positional encoding that models position and locality simultaneously, outperforming standard sinusoidal and RoPE methods.

View →

cs.AIRecentMay 28, 2026

Uncertainty-Aware Transfer Learning for Cross-Building Energy Forecasting: Toward Robust and Scalable District-Level Energy Management

Shadmehr Zaregarizi, Khashayar Yavari

The paper proposes an uncertainty-aware transfer learning framework using the Temporal Fusion Transformer (TFT) to achieve robust and scalable energy forecasting across different buildings, demonstrat…

View →

cs.CRcs.LGRecentApr 4, 2026

Explainability-Guided Adversarial Attacks on Transformer-Based Malware Detectors Using Control Flow Graphs

Andrew Wheeler, Kshitiz Aryal, Maanak Gupta

This paper proposes an explainability-guided adversarial attack that successfully evades transformer-based malware detectors by perturbing the most influential components of the control flow graph rep…

View →

cs.CRcs.AIRecentApr 7, 2026

CritBench: A Framework for Evaluating Cybersecurity Capabilities of Large Language Models in IEC 61850 Digital Substation Environments

Gustav Keppler, Moritz Gstür, Veit Hagenmeyer

The paper introduces CritBench, a novel framework to evaluate LLM cybersecurity capabilities specifically within IEC 61850 Digital Substation Operational Technology (OT) environments, finding that whi…

View →

cs.CVcs.AIRecentJun 1, 2026

Parameter-Efficient Fine-Tuning of Large Pretrained Models for Instance Segmentation Tasks

Nermeen Abou Baker, David Rohrschneider, Uwe Handmann

This paper investigates the application of Parameter-Efficient Fine-Tuning (PEFT) methods, specifically adapters and LoRA, to large pretrained models for instance segmentation, demonstrating that thes…

View →

cs.CRcs.CVRecentApr 16, 2026

Robustness of Vision Foundation Models to Common Perturbations

Hongbin Liu, Zhengyuan Jiang, Cheng Hong, Neil Zhenqiang Gong

This paper systematically studies the robustness of vision foundation models to common image perturbations, finding that most models are generally non-robust and proposing a fine-tuning method to impr…

View →

cs.CRRecentApr 11, 2026

EncFormer: Secure and Efficient Transformer Inference over Encrypted Data

Yufan Zhu, Chao Jin, Khin Mi Mi Aung, Xiaokui Xiao

EncFormer is a novel two-party framework that significantly improves the efficiency and scalability of private Transformer inference by optimizing the combination of Fully Homomorphic Encryption (FHE)…

View →

cs.LGcs.CCRecentJun 1, 2026

Rethinking the Role of Positional Encoding: Sliding-Window Transformers without PE Remain Turing Complete

Qian Li, Xinyu Mao, Shang-Hua Teng

The paper demonstrates that positional encodings are not necessary for transformers to achieve universal computation, showing that the inherent mechanism of sliding context windows already provides su…

View →

cs.CVRecentJun 1, 2026

Edge Prediction for Roof Wireframe Reconstruction with Transformers

Gustav Hanning, Ludvig Dillén, Jonathan Astermark, Johanna Lidholm +1 more

The paper proposes a Transformer-based end-to-end architecture to reconstruct 3D house roof wireframes from sparse point clouds and semantic data, achieving state-of-the-art results on the S23DR Chall…

View →

cs.CRRecentApr 13, 2026

Optimizing IoT Intrusion Detection with Tabular Foundation Models for Smart City Forensics

Asma Al-Dahmani, Abdulla Bin Safwan, Mohammad Obeidat, Belal Alsinglawi

The paper demonstrates that using the transformer-based foundation model TabPFNv2.5 can significantly speed up IoT intrusion detection compared to traditional ensemble methods while maintaining high a…

View →

cs.SEcs.CLeess.SYRecentMay 29, 2026

Knowledge Boundary Probing and Demand-Guided Intervention for LLM-Based Power System Code Generation

Hui Wu, Xiaoyang Wang, Zhong Fan

The paper addresses the reliability of open-weight LLMs for power system code generation by identifying structured API-knowledge boundary errors and proposing a boundary-aware intervention that signif…

View →

cs.LGcs.CLRecentMay 31, 2026

CART: Context-Anchored Recurrent Transformer -- A Parameter-Efficient Architecture with Learned Stability

Chad A. Capps

CART introduces a parameter-efficient recurrent transformer architecture that reuses a core block multiple times, but its performance does not surpass a dense baseline, suggesting that weight sharing…

View →

cs.CEcs.LGphysics.comp-phRecentMay 27, 2026

Adapting Automotive Aerodynamics Surrogates to New Vehicle Families via Transfer Learning

Seunghwan Keum, Alok Warey

The paper demonstrates that Low-Rank Adaptation (LoRA) is an effective and superior method for adapting large, pretrained Transformer surrogates for automotive aerodynamics to new vehicle families usi…

View →

cs.LGRecentJun 1, 2026

EEG-FuseFormer: A Transformer-Driven Feature Fusion Framework for Seizure Onset Prediction

Vigneshwar Hariharan, Chithra Reghuvaran, Arlene John, Nhat Pham +3 more

The paper proposes EEG-FuseFormer, a transformer-based framework that fuses features from CNN-LSTM and ResNet-18 to achieve high accuracy in predicting seizure onset from EEG signals.

View →

cs.ARRecentJun 1, 2026

CHIMERA: A Flexible and Scalable 3.1 TOPS/W AI-MCU with Transformer Accelerator and 563 Gb/s Shared-L2 Memory Subsystem with QoS Guarantees

Lorenzo Leone, Philip Wiese, Gamze İslamoğlu, Michael Rogenmoser +3 more

The paper introduces Chimera, a highly efficient and scalable MCU designed for ultra-low-power edge AI inference, achieving 3.1 TOPS/W by integrating a dedicated transformer accelerator and a QoS-guar…

View →

cs.CRcs.AIcs.LGRecentMar 27, 2026

Machine Learning Transferability for Malware Detection

César Vieira, João Vitorino, Eva Maia, Isabel Praça

This study evaluates various data preprocessing pipelines to improve the transferability and generalization of Machine Learning models for detecting malware in Portable Executable (PE) files across di…

View →

cs.LGcs.AImath.OCRecentMay 29, 2026

Agentic Transformers Provably Learn to Search via Reinforcement Learning

Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi

This paper demonstrates that transformer-based policies can provably learn complex tree search mechanisms, such as depth-first search, purely through reinforcement learning in a stochastic environment…

View →