FaceLLM Collection A multimodal large language model trained specifically for facial image understanding. Project page: https://www.idiap.ch/paper/facellm • 3 items • Updated Jul 23, 2025 • 4
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25, 2025 • 218
view changelog Hugging Face Changelog Filter by MCP compatibility available in HF Spaces May 21, 2025 • 79
AIMO Progress Prize Collection Models and datasets used in the winning solution to the AIMO 1st Progress Prize • 7 items • Updated Jul 19, 2024 • 14
view article Article PaliGemma – Google's Cutting-Edge Open Vision Language Model +1 May 14, 2024 • 287
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models +1 Jun 24, 2024 • 207