Phi: Preference Hijacking in Multi-modal Large Language Models at Inference Time
Paper
• 2509.12521 • Published
• 5
Image classification
QuantLRM: Quantization of Large Reasoning Models via Fine-Tuning Signals
Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph