Zhe Yang
2 indexed papers
Publications per year
Top categories
Frequent co-authors
Research Timeline
This paper investigates how Byte-Pair Encoding (BPE) tokenization causes Code LLMs to disproportionately memorize certain types of secrets, a phenomenon termed 'gibberish bias'.
The paper introduces Deep Spurious Regression (DSR) to address spurious correlations in continuous prediction tasks, proposing a method that exploits attribute similarity in both feature and label spaces for robust generalization.
Papers
Shortcut to Nowhere: Demystifying Deep Spurious Regression
The paper introduces Deep Spurious Regression (DSR) to address spurious correlations in continuous prediction tasks, proposing a method that exploits attribute similarity in both feature and label spa…