Papers similar to 2605.30200

~ similar to 2605.30200· 19 results

cs.CLcs.AIcs.HCRecentMay 29, 2026

Effects of Varying LLM Access on Essay Writing Behavior

Julia Christenson, Karin de Langis, Shirley Anugrah Hayati, Dongyeop Kang

The study found that constraining LLM access, rather than banning it, can preserve students' sense of authorship and encourage more strategic writing behaviors while still providing scaffolding benefi…

View →

cs.CLRecentMay 30, 2026

IDEAFix: Evaluation Framework for Creative Defixation Prompting in LLMs

F. Carichon, S. Sharma, M. Girard, R. Rampa +1 more

The paper introduces IDEAFix, a systematic evaluation framework designed to analyze how structured prompting and task design influence the divergent thinking and originality of idea generation in LLMs…

View →

stat.OTcs.AIEmpiricalRecentJun 9, 2026

Flaws in the LLM Automation Narrative

George Perrett, Javae Elliott, Jennifer Hill, Marc Scott

This paper evaluates the performance of a Large Language Model (LLM) in a high-stakes context by comparing it to human experts and measuring variance and error magnitude.

View →

stat.OTcs.AIEmpiricalRecentJun 9, 2026

Flaws in the LLM Automation Narrative

George Perrett, Javae Elliott, Jennifer Hill, Marc Scott

This paper evaluates the performance of a Large Language Model (LLM) in a high-stakes context by comparing it to human experts and measuring variance and error magnitude.

View →

cs.CYcs.AIRecentMay 31, 2026

Beyond Access: Guided LLM Scaffolding for Independent Learning in Undergraduate Statistics

Mohammad Amanlou, Yasaman Amou-Jafari, Mehrad Livian, Fatemeh Boloukazari +2 more

This study compares different levels of LLM access in a statistics course, finding that structured, guided use significantly improves students' reasoning skills and independent learning compared to un…

View →

cs.CRcs.AIcs.CYRecentMay 7, 2026

Detecting Verbatim LLM Copy-Paste in Homework

Aizierjiang Aiersilan

The paper proposes SteganoPrompt, an input-side watermark embedded in the assignment prompt that forces LLMs to generate a detectable signature in their output, thereby exposing verbatim copy-pasting.

View →

cs.IRcs.AIcs.CYRecentMay 27, 2026

Whose Name Comes Up? III: Persona Prompting Effects in LLM-Based Scholar Recommendation

Annabella Sánchez-Guzmán, Lukas Eberhard, Denis Helic, Lisette Espín-Noboa

The paper proposes a comprehensive benchmark to systematically audit how varying persona prompts and model choices affect the technical quality and social representativeness of scholar recommendations…

View →

cs.AIRecentMay 28, 2026

Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

Arturo Valdivia, Paolo Burelli

This paper proposes a multi-agent framework using LLMs to improve collaborative story generation, demonstrating that an iterative Writer-Editor process significantly enhances narrative quality for you…

View →

cs.AIcs.IRRecentMay 27, 2026

From Learning Resources to Competencies: LLM-Based Tagging with Evidence and Graph Constraints

Ngoc Luyen Le, Marie-Hélène Abel, Bertrand Laforge

The paper introduces an LLM-based pipeline that tags learning resources with structured competencies, achieving strong performance while providing traceable evidence and leveraging graph constraints.

View →

cs.CRcs.AIRecentJun 2, 2026

"Important You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

Hang Li, Fedor Filippov, Yuling Lin, Pengfei He +5 more

This paper investigates the vulnerability of LLM-based automatic grading systems to prompt injection (PI) attacks, demonstrating that current systems are highly susceptible to manipulation that can le…

View →

cs.CLRecentMay 28, 2026

When English Rewrites Local Knowledge: Global Narrative Dominance in Large Language Models

Md Arid Hasan, Ruwad Naswan, Farhan Samir, Sharifa Sultana +1 more

The paper demonstrates that using English prompts causes large language models to prioritize globally dominant narratives over local cultural knowledge, even when local evidence is provided.

View →

cs.CLcs.AIRecentJun 1, 2026

Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents

Aitor Arronte Alvarez, Naiyi Xie Fincham

This study evaluates LLMs in conversational tutoring to identify high-confidence social biases, finding that state-of-the-art models are often overconfident in their incorrect assessments of stereotyp…

View →

cs.AIcs.MARecentMay 28, 2026

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Yulei Ye, Wenhao Li, Zhong Wen, Yunshu Huang +22 more

The paper introduces AgentSchool, an advanced LLM-powered multi-agent simulator that models learning as state transitions to provide a robust, ethically viable testbed for educational research and ped…

View →

cs.AIRecentMay 28, 2026

Temporal Stability and Few-Shot Prompting in Math Task Assessment

Danielle S. Fox, Brenda L. Robles, Elizabeth DiPietro Brovey, Christian D. Schunn

This study investigated the stability and prompt-responsiveness of AI tools in classifying the cognitive demand of math tasks, finding that few-shot prompting was a more reliable performance booster t…

View →

cs.AIcs.LGRecentMay 28, 2026

When Does Persona Prompting Actually Help? A Retrieval and Metric Analysis of Expert Role Injection in LLMs

Shuai Xiao, Su Liu, Weikai Zhou, Jialun Wu +3 more

Persona prompting does not universally improve LLM performance; instead, it systematically trades increased expertise depth for reduced clarity, making multi-metric evaluation essential.

View →

cs.AIRecentMay 29, 2026

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

Tom Lucas, Alessio Buscemi, Alfredo Capozucca, German Castignani +1 more

LLM-FACETS introduces an open-source, privacy-preserving framework designed to enable non-technical domain experts and compliance officers to audit and evaluate the transparency and accountability of…

View →

cs.AIcs.CLRecentMay 28, 2026

Demystifying Data Organization for Enhanced LLM Training

Yalun Dai, Yangyu Huang, Tongshen Yang, Yonghan Wang +7 more

This paper proposes four guidelines and two novel data ordering methods (STR and SAW) to systematically optimize data organization, significantly enhancing the stability and performance of LLM trainin…

View →

cs.AIcs.CYRecentMay 28, 2026

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Julius Gabelmann, Felix Jahn, Kevin Baum, Sophie van Rossum +3 more

This paper proposes a modular, agentic AI chatbot architecture to assist students with exercise solving, aiming to ensure responsible and pedagogically sound AI use in education.

View →

cs.CLRecentMay 30, 2026

Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities

Wajdi Zaghouani

The paper develops a theoretically grounded framework for evaluating multilingual LLMs in Social Sciences and Humanities, moving beyond traditional NLP benchmarks to assess interpretive validity and c…

View →

Effects of Varying LLM Access on Essay Writing Behavior

IDEAFix: Evaluation Framework for Creative Defixation Prompting in LLMs

Flaws in the LLM Automation Narrative

Flaws in the LLM Automation Narrative

Beyond Access: Guided LLM Scaffolding for Independent Learning in Undergraduate Statistics

Detecting Verbatim LLM Copy-Paste in Homework

Whose Name Comes Up? III: Persona Prompting Effects in LLM-Based Scholar Recommendation

Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

From Learning Resources to Competencies: LLM-Based Tagging with Evidence and Graph Constraints

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

When English Rewrites Local Knowledge: Global Narrative Dominance in Large Language Models

Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Temporal Stability and Few-Shot Prompting in Math Task Assessment

When Does Persona Prompting Actually Help? A Retrieval and Metric Analysis of Expert Role Injection in LLMs

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

Demystifying Data Organization for Enhanced LLM Training

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities

Effects of Varying LLM Access on Essay Writing Behavior

IDEAFix: Evaluation Framework for Creative Defixation Prompting in LLMs

Flaws in the LLM Automation Narrative

Flaws in the LLM Automation Narrative

Beyond Access: Guided LLM Scaffolding for Independent Learning in Undergraduate Statistics

Detecting Verbatim LLM Copy-Paste in Homework

Whose Name Comes Up? III: Persona Prompting Effects in LLM-Based Scholar Recommendation

Improving Collaborative Storytelling with a Multi-Agent Framework Based on Large Language Models

From Learning Resources to Competencies: LLM-Based Tagging with Evidence and Graph Constraints

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

When English Rewrites Local Knowledge: Global Narrative Dominance in Large Language Models

Identifying High-Confidence Social Biases in LLMs for Trustworthy Conversational Tutoring Agents

AgentSchool: An LLM-Powered Multi-Agent Simulation for Education

Temporal Stability and Few-Shot Prompting in Math Task Assessment

When Does Persona Prompting Actually Help? A Retrieval and Metric Analysis of Expert Role Injection in LLMs

LLM-FACETS: A Privacy-Preserving Framework for Evaluating LLM Transparency and Accountability

Demystifying Data Organization for Enhanced LLM Training

Modularizing Educational LLM-Agency for Fostering Responsible Learning Assistance

Toward Responsible and Epistemically Grounded Multilingual LLMs for Computational Social Science and Humanities

"Important You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

"Important You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems