Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents
•
34
Enterprise-grade AI models
ChartNet: A Million-Scale, High-Quality Multimodal Dataset for Robust Chart Understanding
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion
Extract and convert document content from images
Convert document images to HTML with Docling
Transcribe or translate spoken audio to text in your browser
Granite 4.0 1B Speech recognition and translation demo
RAG example using Granite [vision, embedding, instruct]
Extract and convert document content from images
Convert document images to HTML with Docling
Transcribe or translate spoken audio to text in your browser
Granite 4.0 1B Speech recognition and translation demo
RAG example using Granite [vision, embedding, instruct]