Ram Potham

2 indexed papers

Recent (6 mo)

With code

Influential cites

Benchmarked

Publications per year

Top categories

Crypto×2AI×2Software Eng.×1NLP×1

Frequent co-authors

Tyler Tracy1×

Nick Kuhn1×

Research Timeline

2026

An Independent Safety Evaluation of Kimi K2.5

The paper conducts a preliminary safety evaluation of the open-weight LLM Kimi K2.5, finding that while it is highly capable, it exhibits concerning dual-use risks, particularly regarding CBRNE misuse and disinformation, and recommends mandatory safety testing for future open-weight models.

LinuxArena: A Control Setting for AI Agents in Live Production Software Environments

The paper introduces LinuxArena, a large-scale, diverse control setting for testing AI agents in live production environments, demonstrating its utility for evaluating both attack and defense mechanisms.

Highlighted terms show continued research focus across papers

Papers

cs.CRcs.AIcs.SERecentApr 16, 2026

LinuxArena: A Control Setting for AI Agents in Live Production Software Environments

Tyler Tracy, Ram Potham, Nick Kuhn, Myles Heller +30 more

View →

cs.CRcs.AIcs.CLRecentApr 3, 2026