Configurable Reward Model for Balanced Safety Alignment | ArxivCSExplorer