Unggi Lee
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
BuddyBench introduces a novel, privacy-constrained multi-task benchmark that integrates longitudinal learning trajectories, standardized clinical assessments, and randomized trial data to advance pediatric social-communication personalization.
The paper introduces TeachObs, a comprehensive, human-validated benchmark for multimodal teaching observation, and evaluates frontier LLMs, finding that no single model consistently outperforms others and that expert judgment remains crucial for accurate analysis.
Papers
TeachObs: A Human-Validated Benchmark for Multimodal Teaching Observation and Model Evaluation
Yeil Jeong, Youngjin Yoo, Seobin Sohn, Hyejin Han +3 more
The paper introduces TeachObs, a comprehensive, human-validated benchmark for multimodal teaching observation, and evaluates frontier LLMs, finding that no single model consistently outperforms others…