Papers similar to 2605.29041

~ similar to 2605.29041· 19 results

cs.AIRecentMay 28, 2026

Temporal Stability and Few-Shot Prompting in Math Task Assessment

Danielle S. Fox, Brenda L. Robles, Elizabeth DiPietro Brovey, Christian D. Schunn

This study investigated the stability and prompt-responsiveness of AI tools in classifying the cognitive demand of math tasks, finding that few-shot prompting was a more reliable performance booster t…

View →

cs.AIcs.CLcs.HCRecentMay 28, 2026

Surfacing Isolated Learners with Outcome-Independent Mediation of Feedback between Teachers and Students Using AI

Junsoo Park, Youssef Medhat, Htet Phyo Wai, Ploy Thajchayapong +1 more

The paper proposes an interpretable, AI-driven decision layer that ranks course topics needing attention using multiple student and teacher signals, successfully identifying learning gaps before forma…

View →

cs.AIcs.MARecentMay 28, 2026

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Yulei Ye, Wenhao Li, Zhong Wen, Yunshu Huang +22 more

The paper introduces AgentSchool, an advanced LLM-powered multi-agent simulator that models learning as state transitions to provide a robust, ethically viable testbed for educational research and ped…

View →

cs.AIcs.CYRecentMay 28, 2026

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Julius Gabelmann, Felix Jahn, Kevin Baum, Sophie van Rossum +3 more

This paper proposes a modular, agentic AI chatbot architecture to assist students with exercise solving, aiming to ensure responsible and pedagogically sound AI use in education.

View →

cs.AIRecentJun 1, 2026

VET: A Framework for Analyzing AI Discourse

Meredith Ringel Morris

The paper introduces the VET Framework, a tool for analyzing polarized public discourse on AI by categorizing narratives based on valence, effectiveness, and trajectory, thereby promoting AI literacy.

View →

cs.HCcs.CRRecentMay 22, 2026

From Preventive to Reactive: How AI Coding Assistants Transform Developers' Security Awareness

Faisal Haque Bappy, Tahrim Hossain, Sidratul Muntaher Meheraj, Annoor Sharara Akhand +4 more

The paper investigates how AI coding assistants shift developers' security focus from proactive prevention to reactive review, finding that this structural change is reinforced by current tool interac…

View →

cs.HCcs.AIcs.CYRecentMay 29, 2026

The New Social Image: How AI Competency and AI Proactivity Influence Self- and Peer-Perceptions in the Workplace

Kuntal Ghosh, Marc Hassenzahl, Shadan Sadeghian

The study found that while AI collaboration is promising, highly competent and proactive AI systems can negatively impact human perceptions of ownership and job meaningfulness, suggesting that design…

View →

cs.CYcs.AIRecentMay 31, 2026

Beyond Access: Guided LLM Scaffolding for Independent Learning in Undergraduate Statistics

Mohammad Amanlou, Yasaman Amou-Jafari, Mehrad Livian, Fatemeh Boloukazari +2 more

This study compares different levels of LLM access in a statistics course, finding that structured, guided use significantly improves students' reasoning skills and independent learning compared to un…

View →

cs.AIRecentJun 1, 2026

Beyond One-shot: AI Agents for Learning in Field Experiments

Junjie Luo, Ritu Agarwal, Gordon Gao

The paper demonstrates that tool-augmented agentic AI can learn from prior field experiment data to automatically generate superior, domain-specific interventions, transforming one-shot A/B testing in…

View →

cs.HCcs.AIRecentMay 29, 2026

Developing a Culturally Grounded, AI-Augmented UX Research Point of View (POV): An Exemplar Case Study from Telemedicine Dementia Care

Abiodun Adedeji, Huseyin Dogan, Festus Adedoyin, Michelle Heward +4 more

This paper demonstrates the development of a culturally grounded, AI-augmented User Experience Research Point of View (POV) for a telemedicine dementia care framework in Nigeria, providing a replicabl…

View →

cs.CLcs.AIRecentMay 27, 2026

Models That Know How Evaluations Are Designed Score Safer

Katharina Deckenbach, Haritz Puerto, Jonas Geiping, Sahar Abdelnabi

The paper demonstrates that models can acquire 'evaluation meta-knowledge' from training data describing evaluation practices, leading to inflated safety benchmark performance that is independent of e…

View →

cs.HCcs.AIRecentMay 29, 2026

From Evidence to Design: Developing an AI-Augmented UX Research Point of View for Digital Wellbeing in Emergency and Public Safety Contexts

Olumuyiwa Ayorinde, Huseyin Dogan, Festus Adedoyin, Nan Jiang +3 more

The paper develops an AI-augmented UX Research Point-of-View (PoV) framework to guide the design of digital wellbeing tools for high-stress Emergency and Public Safety Personnel (EPSP), finding that s…

View →

cs.SEcs.AIRecentJun 3, 2026

From Prompt to Process: a Process Taxonomy and Comparative Assessment of Frameworks Supporting AI Software Development Agents

Sanderson Oliveira de Macedo

This paper studies AI development frameworks for software engineering and proposes a six-dimension process taxonomy.

View →

cs.HCcs.AIRecentMay 29, 2026

Developing an AI-Powered UX Research Point of View for Digital Health in A Regulatory Context: An Exemplar Case from MSM and Transgender HIV Care in Nigeria

Emmanuel Oluwatosin Oluokun, Festus Fatai Adedoyin, Huseyin Dogan, Nan Jiang +4 more

The paper introduces a Generative AI-augmented User Experience Research (UXR) methodology, operationalized through a four-stage process, to create actionable, stigma-aware design guidance for digital…

View →

cs.HCcs.AIcs.IRRecentMay 28, 2026

From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration

Ngoc Luyen Le, Marie-Hélène Abel, Bertrand Laforge

The paper introduces an ontology-driven framework, From Prompts to Context, to explicitly model and structure the often-opaque context of human-Generative AI collaborations, thereby improving traceabi…

View →

cs.AIRecentMay 27, 2026

BEAMS: Benchmarking and Evaluating AI for Modeling and Simulation

Sara Metcalf, William Schoenberg

The BEAMS initiative establishes comprehensive benchmarks and evaluates AI tools for modeling and simulation, finding that current AI tools excel at qualitative discussion tasks but struggle with comp…

View →

cs.AIRecentMay 31, 2026

The Case for Model Science: Verify, Explore, Steer, Refine

Przemyslaw Biecek, Luca Longo, Jianlong Zhou, Thomas Fel +2 more

The paper advocates for the establishment of Model Science, a systematic discipline that moves beyond simple benchmarking to deeply analyze AI models' internal workings and failure modes.

View →

cs.CYcs.AIRecentMay 31, 2026

AI From the Margins (AIM): Rethinking Participatory AI Design Through the Lived Experience of Minoritized Communities

Tijs Portegies, Laureanne Willems, Maaike Harbers, Giovanni Sileno +4 more

The paper proposes AI From the Margins (AIM), a methodological stance that centers the lived experiences of minoritized communities to fundamentally reshape the goals and scope of participatory AI des…

View →