Iterative Critique-Refine Framework for Enhancing LLM Personalization Paper • 2510.24469 • Published Oct 28, 2025
MLLM as a UI Judge: Benchmarking Multimodal LLMs for Predicting Human Perception of User Interfaces Paper • 2510.08783 • Published Oct 9, 2025 • 4
Learning to Route LLMs from Bandit Feedback: One Policy, Many Trade-offs Paper • 2510.07429 • Published Oct 8, 2025 • 3
Optimizing Data Delivery: Insights from User Preferences on Visuals, Tables, and Text Paper • 2411.07451 • Published Nov 12, 2024
MODS: Moderating a Mixture of Document Speakers to Summarize Debatable Queries in Document Collections Paper • 2502.00322 • Published Feb 1, 2025
A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations Paper • 2505.14106 • Published May 20, 2025
The Photographer Eye: Teaching Multimodal Large Language Models to See and Critique like Photographers Paper • 2509.18582 • Published Sep 23, 2025 • 3
mSCoRe: a $M$ultilingual and Scalable Benchmark for $S$kill-based $Co$mmonsense $Re$asoning Paper • 2508.10137 • Published Aug 13, 2025 • 2
Lizard: An Efficient Linearization Framework for Large Language Models Paper • 2507.09025 • Published Jul 11, 2025 • 18
A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality Paper • 2507.07202 • Published Jul 9, 2025 • 24
FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback Paper • 2307.10867 • Published Jul 20, 2023
Forecasting Time Series with LLMs via Patch-Based Prompting and Decomposition Paper • 2506.12953 • Published Jun 15, 2025 • 2
MS4UI: A Dataset for Multi-modal Summarization of User Interface Instructional Videos Paper • 2506.12623 • Published Jun 14, 2025 • 2
LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles Paper • 2506.06561 • Published Jun 6, 2025 • 2
Follow the Flow: Fine-grained Flowchart Attribution with Neurosymbolic Agents Paper • 2506.01344 • Published Jun 2, 2025 • 6