Shuo Zhang

7 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

AI×4Vision×2Crypto×2NLP×1Info Retrieval×1

Frequent co-authors

Kuan Li2×

Huacan Wang2×

Fangzhou Yu2×

Yi Gu2×

Weipeng Ming2×

Lei Xue2×

Research Timeline

2026

TwoHamsters: Benchmarking Multi-Concept Compositional Unsafety in Text-to-Image Models

This paper introduces TwoHamsters, a new benchmark that rigorously tests Multi-Concept Compositional Unsafety (MCCU) in text-to-image models, demonstrating that current state-of-the-art models and safety defenses are highly vulnerable to subtle, compositionally unsafe prompts.

DPDSyn: Improving Differentially Private Dataset Synthesis for Model Training by Downstream Task Guidance

DPDSyn improves differentially private dataset synthesis by training a differentially private AI model on the original private data, which is then used to generate synthetic datasets that maintain high utility for downstream tasks.

HomeFlow: A Data Flywheel for Smart Home Agent Training with Verifiable Simulation

The paper introduces HomeFlow, a verifiable data flywheel that procedurally generates high-quality, multi-turn training data for smart home agents, achieving state-of-the-art performance on smart home tasks.

Why Not Hyperparameter-Friendly Optimisation? A Monotonic Adaptive Norm Rescaling Approach For Long-Tailed Recognition

The paper proposes Self-Adaptive Monotonic Normalization (SAMN), a hyperparameter-friendly method that improves long-tailed recognition by enforcing monotonicity on per-class weight norms without requiring parameter regularization.

SMH-Bench: Benchmarking LLM Agents for Environment-Grounded Reasoning and Action in Smart Homes

The paper introduces SMH-Bench, a comprehensive benchmark built on a simulator to rigorously test LLM agents' ability to perform complex, environment-grounded reasoning and actions in realistic smart-home scenarios.

When and How to Ask: Dynamic Preference Elicitation Strategies for Conversational Recommendation

This paper investigates the effectiveness of stage-dependent preference elicitation strategies in conversational recommendation systems and introduces COPE, a novel architecture for strategy modeling.

Beyond Score Prediction: LLM-Based Essay Scoring and Feedback Generation via Reinforcement Learning with Rubric Rewards

This paper proposes RLAES, a unified language model framework using reinforcement learning for essay scoring and feedback generation, with methods including Rubric-based Feedback Evaluation (RFE), Adaptive Gated Feedback Optimization (AGFO), and Adjacent Contrastive Reasoning (ACR).

Highlighted terms show continued research focus across papers

Papers

cs.CLcs.AIEmpiricalRecentJul 21, 2026

Beyond Score Prediction: LLM-Based Essay Scoring and Feedback Generation via Reinforcement Learning with Rubric Rewards

Xuefeng Jin, Jiashuo Zhang, Teng Cao, Bin Yang

View →

cs.IREmpirical