From "Weak" Signals to Strong Models: Preference Delta Aggregation with LoRA Merging | ArxivCSExplorer