Tingting Zhang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
The paper introduces PetroBench, a comprehensive benchmark for evaluating Large Language Models across various domains of petroleum engineering, finding that models perform better on subjective tasks than on objective factual knowledge.
The paper introduces FAM-Bench, a novel multimodal benchmark designed to test advanced, condition-aware reasoning for food-as-medicine applications.
Papers
FAM-Bench: A Multimodal Benchmark for Condition-Aware Food-as-Medicine Reasoning
The paper introduces FAM-Bench, a novel multimodal benchmark designed to test advanced, condition-aware reasoning for food-as-medicine applications.