Hao Hu

14 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×7Crypto×6NLP×5Architecture×2Signal Processing×1ML×1Algorithms×1Prog. Lang.×1

Frequent co-authors

Fei Cheng2×

Muhao Chen2×

Research Timeline

2026

Zero-Shot Vulnerability Detection in Low-Resource Smart Contracts Through Solidity-Only Training

The paper introduces Sol2Vy, a framework that enables cross-language knowledge transfer from Solidity to Vyper, allowing effective vulnerability detection in low-resource smart contracts without needing labeled Vyper training data.

Towards Secure Retrieval-Augmented Generation: A Comprehensive Review of Threats, Defenses and Benchmarks

This paper provides the first comprehensive, end-to-end survey dedicated to the security of Retrieval-Augmented Generation (RAG) systems, systematically mapping threats, defenses, and benchmarks across the entire pipeline.

Cooking Up Risks: Benchmarking and Reducing Food Safety Risks in Large Language Models

The paper introduces FoodGuardBench, a comprehensive benchmark and a specialized guardrail model (FoodGuard-4B) to rigorously test and mitigate the severe food safety risks posed by large language models.

Enabling AI ASICs for Zero Knowledge Proof

The paper introduces MORPH, a framework that reformulates Zero-Knowledge Proof (ZKP) computations to efficiently utilize AI ASICs like TPUs, achieving up to 10x higher throughput on NTT.

VIPER-MCP: Detecting and Exploiting Taint-Style Vulnerabilities in Model Context Protocol Servers

VIPER-MCP is a novel, end-to-end automated framework that detects and dynamically confirms the exploitability of taint-style vulnerabilities in Model Context Protocol (MCP) servers, achieving high-fidelity vulnerability discovery in real-world systems.

SAMark: A Self-Anchored Text Watermarking with Paragraph-Level Paraphrase Robustness

SAMark introduces a self-anchored text watermarking framework that achieves high robustness (up to 90.2% TP@FP1%) against challenging paragraph-level paraphrasing attacks by establishing a step-independent green region in semantic space.

EviLink: Multi-Path Schema Linking with Uncertainty-Guided Evidence Acquisition for Large-Scale Text-to-SQL

EviLink addresses the ambiguity of schema linking in Text-to-SQL by treating it as an uncertainty-aware inference over multiple plausible SQL paths, significantly improving recall and efficiency.

Tailoring the Curriculum: Student-Centered Reasoning Distillation via Dynamic Data-Model Compatibility

This paper introduces the Data-Model Compatibility (DMC) metric to quantify how suitable a dataset is for reasoning distillation, showing that optimizing data selection using DMC significantly improves the performance of smaller student models.

BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents

The paper introduces BenchTrace, a novel benchmark designed to rigorously evaluate the self-evolution and reflection capabilities of LLM agents, revealing that current models struggle with accurate failure diagnosis and generalizing learned lessons.

GTA: Generating Long-Horizon Tasks for Web Agents at Scale

The paper introduces GTA, a scalable framework for generating realistic, multi-hop web-agent tasks with dense, executable trajectories, addressing the current lack of process-level supervision in web agent research.

Refining Word-Based Grammatical Error Annotation for L2 Korean

This paper refines word-based grammatical error annotation for L2 Korean by adapting existing resources to better reflect Korean morphology and error types, improving the evaluation of Korean Grammatical Error Correction (K-GEC) systems.

COMPASS: Cognitive MCTS-Guided Process Alignment for Safe Search Agents

COMPASS introduces a Cognitive MCTS-Guided Process Alignment framework to ensure robust safety for LLM search agents by identifying and supervising risky intermediate steps in multi-step reasoning.

GPU-Accelerated Effective Resistance Analysis for 3D IC Power Delivery Network

This paper proposes a GPU-accelerated framework for analyzing effective resistance in 3D IC power delivery networks, achieving significant speedup with negligible error.

Toward Generalizable Cognitive Impairment Detection with Speech-Based Multimodal Large Language Models

This paper proposes a multimodal cognitive impairment detection framework using large language models that integrates speech audio and transcripts, achieving a high classification accuracy.

Highlighted terms show continued research focus across papers

Papers

eess.SPcs.LGEmpiricalRecentJul 23, 2026

Toward Generalizable Cognitive Impairment Detection with Speech-Based Multimodal Large Language Models

Yingchao Huang, Xin Wang, Yuhan Su, Shanshan Yao

This paper proposes a multimodal cognitive impairment detection framework using large language models that integrates speech audio and transcripts, achieving a high classification accuracy.

View →

cs.AREmpirical