Garv Shah
1 indexed paper
Recent (6 mo)
1With code
0Influential cites
0Benchmarked
0Publications per year
126
Top categories
ML×1AI×1Crypto×1
Frequent co-authors
Research Timeline
2026
Self-Mined Hardness for Safety Fine-Tuning
The paper proposes a novel safety fine-tuning method that uses the target model's own rollouts to identify and train on the hardest prompts, significantly reducing jailbreak success rates while maintaining usability.
Highlighted terms show continued research focus across papers