ArXivCSExplorer
☆☆Bookmarks🏆RSSHow to UseFAQ
Built with and by Teycir Ben Soltane•
How to Use•FAQ•GitHub•arXiv.org•
Share:

20 results for “Basic knowledge of human-computer interaction and document highlighting behavior.”

CS papers only

Hybrid search: Keyword + semantic, ranked by combined score.ⓘ

Want pure semantic search? Try claim verification →

cs.CLRecentMay 30, 2026

I-WebGenBench : Evaluating Interactivity in LLM-Generated Scientific Web Applications

Dasen Dai, Biao Wu, Meng Fang, Shuoqi Li +1 more

The paper introduces I-WebGenBench, a framework and benchmark that converts static scientific papers into executable, interactive web systems, allowing users to dynamically explore the paper's mechani…

View →
cs.IRcs.AIRecentMay 30, 2026

SkillPager: Query-Adaptive Intra-Skill Navigation via Semantic Node Retrieval

Zicai Cui, Zihan Guo, Weiwen Liu, Weinan Zhang

SkillPager is a novel two-stage framework that efficiently selects minimal, execution-sufficient context from large procedural skill documents by leveraging typed semantic nodes, significantly reducin…

View →
cs.IRcs.CLcs.HCEmpiricalRecentJun 10, 2026

Factions Within, Uncertain Across: Within-Document Reader Sub-Groups in Social Highlighting

Kazuki Nakayashiki, Keisuke Watanabe

This paper investigates whether a group of people highlighting the same document forms a single consensus or is internally structured into reader sub-groups.

View →
cs.IRcs.CLcs.HCEmpiricalRecentJun 10, 2026

The Long Tail, Not the Front Page: Cold-Start Prediction of Crowd Highlight Salience

Kazuki Nakayashiki, Keisuke Watanabe

This paper predicts the aggregate crowd salience of a document from its text before its marks accumulate.

View →
cs.CLcs.AIcs.CVRecentMay 31, 2026

Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing

Minglai Yang, Xinyan Velocity Yu, Pengyuan Li, Xinyu Guo +21 more

The paper introduces Dr. DocBench, a difficulty-aware, comprehensive benchmark designed to rigorously test expert-level and challenging document parsing capabilities for VLMs, demonstrating that curre…

View →
cs.CVRecentJun 1, 2026

Detecting Pen-In-Air States from Video: A Proof-of-Concept Toward Complementary Handwriting Analysis

Lauren Sismeiro, Remy Plastre, Binbin Xu, Frederic Puyjarinet +1 more

This paper demonstrates a proof-of-concept method using top-view video to detect 'Pen-Up' states in handwriting, showing it can reliably complement traditional digitizing tablets for developmental dis…

View →
cs.AIRecentMay 28, 2026

Cookie-Bench: Continuous On-screen Key Interaction Evaluation for Web Generation

Haoyue Yang, Zhangxiao Shen, Fan Ding, Hangting Lou +7 more

The paper introduces Cookie-Bench, a novel, autonomous, and reference-free evaluation framework that significantly improves the assessment of interactive web generation capabilities for frontier LLMs.

View →
cs.CVcs.AIcs.CLRecentJun 1, 2026

Multimodal Approaches for Visually-Rich Document Type Classification: A Comparative Analysis

Catyana Heyne, Jürgen Frikel, Filippo Riccio

The paper systematically compares multimodal transformer and LLM approaches for document type classification, finding that specialized multimodal Transformers outperform LLM-based models, especially w…

View →
cs.LGcs.AIcs.CRRecentMay 12, 2026

No More, No Less: Task Alignment in Terminal Agents

Sina Mavali, David Pape, Jonathan Evertz, Samira Abedini +4 more

The paper introduces the Task Alignment Benchmark (TAB) to evaluate terminal agents' ability to selectively follow relevant environmental instructions while ignoring misleading distractors, revealing…

View →
cs.IRRecentJun 3, 2026

SearchLog: A Web Browser Extension for Capturing Search Logs in Laboratory Studies

Jiaman He, Riccardo Xia, Dana McKay, Damiano Spina +1 more

The paper presents SearchLog, a web browser extension for collecting natural search logs during lab-based studies.

View →
cs.SEcs.AIRecentMay 29, 2026

Benchmarking Multimodal LLMs on Code Generation for Complex Interactive Webpages

Fan Wu, Lishuai Dong, Cuiyun Gao, Yujia Chen +3 more

The paper introduces WebIGBench, a novel benchmark designed to rigorously evaluate multimodal LLMs' ability to generate code for complex, interactive webpages, addressing the limitations of existing s…

View →
cs.CLcs.CVRecentMay 30, 2026

Do Text Edits Generalize to Visual Generation? Benchmarking Cross-Modal Knowledge Editing in UMMs

Xin Gao, Cheng Yang, Chufan Shi, Taylor Berg-Kirkpatrick

The paper introduces UniKE, a benchmark showing that successful knowledge edits in text-only multimodal models do not reliably transfer to image generation, revealing a significant modality gap.

View →
cs.CLRecentJun 1, 2026

What to Format and How: A Benchmark and Workflow Approach for Document Formatting

Shihao Rao, Liang Li, Jiapeng Liu, Tong Lin +5 more

The paper introduces DocFormBench, a new benchmark for content-aware document formatting, and proposes DocFormFlow, a workflow that improves formatting accuracy and efficiency by decoupling target loc…

View →
cs.AIRecentMay 27, 2026

LiveBrowseComp: Are Search Agents Searching, or Just Verifying What They Already Know?

HuiMing Fan, Xiao Wang, Zheng Chu, Qianyu Wang +4 more

The paper argues that current search agents often verify existing knowledge rather than genuinely searching, and introduces LiveBrowseComp, a new benchmark to measure true evidence-driven discovery.

View →
cs.CLcs.AIcs.LGRecentJun 4, 2026

Operation-Guided Progressive Human-to-AI Text Transformation Benchmark for Multi-Granularity AI-Text Detection

Sondos Mahmoud Bsharat, Jiacheng Liu, Xiaohan Zhao, Tianjun Yao +8 more

The paper introduces OpAI-Bench, a novel benchmark designed to study how AI authorship signals evolve and accumulate during the progressive co-editing process between humans and AI.

View →
cs.AIRecentMay 28, 2026

MEMENTO: Leveraging Web as a Learning Signal for Low-Data Domains

Ashutosh Ojha, Vinay Aggarwal, Ashutosh Srivastava, Siddharth Yedlapati +2 more

MEMENTO proposes a novel framework that treats the open web as a continuous learning signal, enabling agents to acquire task-specific expertise and reusable research strategies in low-data domains wit…

View →
cs.CCcs.HCstat.CORecentJun 1, 2026

Attention Dynamics and Adaptive Decision Support in C5ISR: A Recurrence Quantification Analysis of Visual and Multimodal Attention Guidance Effects on Mission Performance

Hyun-Gee Jei, Caleb J. Armstrong, Farzan Sasangohar

This study analyzed eye-tracking data in a simulated military environment, finding that multimodal adaptive decision support tools significantly improve mission performance compared to visual-only too…

View →
cs.AIRecentJun 1, 2026

Tracking the Behavioral Trajectories of Adapting Agents

Jonah Leshin, Manish Shah, Ian Timmis

The paper introduces a framework to quantitatively measure evolving agent behaviors (traits) by analyzing changes in their configuration text files, achieving high accuracy in classifying behavioral s…

View →
cs.CLcs.AIcs.CVRecentJun 4, 2026

Benchmarking Open-Source Layout Detection Models for Data Snapshot Extraction from Institutional Documents

AJ Carl P. Dy, Aivin V. Solatorio

This paper introduces a new benchmark dataset and evaluation framework for 'data snapshot extraction,' focusing on identifying and localizing semantically meaningful analytical artifacts within operat…

View →
cs.CLcs.AIRecentMay 29, 2026

KnowledgeGain: Evaluating and Optimizing Science News Generation for Reader Learning

Dominik Soós, Meng Jiang, Jian Wu

The paper introduces KnowledgeGain, a novel metric that measures the actual knowledge gained by readers from science news, and demonstrates its use in optimizing news generation to improve reader lear…

View →