Tom Butler
butlert
AI & ML interests
None yet
Organizations
Models
-
Open-Orca/Mistral-7B-OpenOrca
Text Generation • Updated • 2.83k • 686 -
butlert/Llama-2-7b-chat-hf-sharded-bf16-fine-tuned-adapters
Updated -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 16.7M • 853 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
llava models
-
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54 -
liuhaotian/llava-v1.6-mistral-7b
Image-Text-to-Text • 8B • Updated • 10.7k • 241 -
liuhaotian/llava-v1.5-7b
Image-Text-to-Text • Updated • 185k • 536 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 82 • 19
Optimized Vision Language Models
-
Efficient-Large-Model/VILA-2.7b
Text Generation • 3B • Updated • 243 • 15 -
NousResearch/Obsidian-3B-V0.5
Text Generation • Updated • 110 • 178 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 82 • 19 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
Papers
-
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Paper • 2312.02949 • Published • 14 -
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper • 2402.14289 • Published • 20
Image Classification
Datasets
reasoning
Papers
-
Extending Context Window of Large Language Models via Semantic Compression
Paper • 2312.09571 • Published • 16 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 51 -
LLaVA-Grounding: Grounded Visual Chat with Large Multimodal Models
Paper • 2312.02949 • Published • 14 -
TinyLLaVA: A Framework of Small-scale Large Multimodal Models
Paper • 2402.14289 • Published • 20
Models
-
Open-Orca/Mistral-7B-OpenOrca
Text Generation • Updated • 2.83k • 686 -
butlert/Llama-2-7b-chat-hf-sharded-bf16-fine-tuned-adapters
Updated -
openai/clip-vit-base-patch32
Zero-Shot Image Classification • Updated • 16.7M • 853 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54
Image Classification
llava models
-
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54 -
liuhaotian/llava-v1.6-mistral-7b
Image-Text-to-Text • 8B • Updated • 10.7k • 241 -
liuhaotian/llava-v1.5-7b
Image-Text-to-Text • Updated • 185k • 536 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 82 • 19
Datasets
Optimized Vision Language Models
-
Efficient-Large-Model/VILA-2.7b
Text Generation • 3B • Updated • 243 • 15 -
NousResearch/Obsidian-3B-V0.5
Text Generation • Updated • 110 • 178 -
bczhou/TinyLLaVA-1.5B
Image-Text-to-Text • 2B • Updated • 82 • 19 -
liuhaotian/LLaVA-Lightning-MPT-7B-preview
Text Generation • Updated • 28 • 54