AEGIS is a novel multi-agent framework that grounds vulnerability reasoning by reconstructing per-variable dependency chains over a Code Property Graph, achieving state-of-the-art performance on the PrimeVul dataset.
Large Language Models (LLMs) are increasingly adopted for vulnerability detection, yet their reasoning remains fundamentally unsound. We identify a root cause shared by both major mitigation paradigms (agent-based debate and retrieval augmentation): reasoning in an ungrounded deliberative space that lacks a bounded, hypothesis-specific evidence base. Without such grounding, agents fabricate cross-function dependencies, and retrieval heuristics supply generic knowledge decoupled from the repository's data-flow topology. Consequently, the resulting conclusions are driven by rhetorical persuasiveness rather than verifiable facts. To ground this deliberation, we present AEGIS, a novel multi-agent framework that shifts detection from ungrounded speculation to forensic verification over a closed factual substrate. Guided by a "From Clue to Verdict" philosophy, AEGIS first identifies suspicious code anomalies (clues), then dynamically reconstructs per-variable dependency chains for each clue via on-demand slicing over a repository-level Code Property Graph. Within this closed evidence boundary, a Verifier Agent constructs competing dialectical arguments for and against exploitability, while an independent Audit Agent scrutinizes every claim against the trace, exercising veto power to prevent hallucinated verdicts. Evaluation on the rigorous PrimeVul dataset demonstrates that AEGIS establishes a new state-of-the-art, achieving 122 Pair-wise Correct Predictions. To our knowledge, this is the first approach to surpass 100 on this benchmark. It reduces the false positive rate by up to 54.40% compared to leading baselines, at an average cost of $0.09 per sample without any task-specific training.
AEGIS: Adversarial Entropy-Guided Immune System -- Thermodynamic State Space Models for Zero-Day Net…
AEGIS introduces a novel physics-based system that analyzes encrypted network tr…
AEGIS: Scaling Long-Sequence Homomorphic Encrypted Transformer Inference via Hybrid Parallelism on M…
AEGIS is a novel system that significantly improves the scalability of running l…
Beyond RAG for Cyber Threat Intelligence: A Systematic Evaluation of Graph-Based and Agentic Retriev…
The paper systematically evaluates advanced retrieval-augmented generation (RAG)…
Diffusion-Guided Adversarial Perturbation Injection for Generalizable Defense Against Facial Manipul…
The paper proposes AEGIS, a novel diffusion-guided method for injecting adversar…
Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models
This paper introduces a novel framework, the Reasoning Safety Monitor, to detect…
Agent Audit: A Security Analysis System for LLM Agent Applications
Agent Audit is a novel security analysis system that comprehensively audits LLM…
ORACAL: A Robust and Explainable Multimodal Framework for Smart Contract Vulnerability Detection wit…
ORACAL, a novel multimodal framework, achieves state-of-the-art smart contract v…
Software Vulnerability Detection Using a Lightweight Graph Neural Network
The paper proposes VulGNN, a lightweight Graph Neural Network (GNN) model, which…