These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers
Minish
non-profit
AI & ML interests
small models
Recent Activity
View all activity
Organization Card
Hello, we're Minish!
About us
We're a two-person (pringled and stephantul) open-source lab, with a focus on Natural Language Processing.
We believe that if you make models fast enough, you unlock new possibilities.
Using our models and packages, you can:
- Embed the entire English Wikipedia in 5 minutes
- Classify tens of thousands of documents per second on a CPU
- Approximately deduplicate extremely large datasets in minutes
- Build the fastest RAG application in the world
- Easily evaluate which ANN algorithm works best for your data
Our projects:
- model2vec: tiny static embedding models with state-of-the-art performance.
- potion: the best small models in the world. 100-500x faster than a sentence-transformer, and almost as good.
- vicinity: consistent interfaces to many approximate nearest neighbor algorithms.
- semhash: lightning-fast, super accuracte, semantic deduplication and filtering for your text datasets.
- model2vec-rs: a Rust port of model2vec.
You can also find us on: š¬ GitHub š½ LinkedIn š¬ Discord
models 14
minishlab/potion-code-16M
16M ⢠Updated ⢠21.8k ⢠20
minishlab/potion-multilingual-128M
0.1B ⢠Updated ⢠78.2k ⢠58
minishlab/potion-base-32M
32.3M ⢠Updated ⢠66k ⢠26
minishlab/potion-base-8M
7.56M ⢠Updated ⢠588k ⢠77
minishlab/potion-base-4M
3.78M ⢠Updated ⢠464k ⢠9
minishlab/potion-base-2M
1.89M ⢠Updated ⢠18k ⢠17
minishlab/potion-retrieval-32M
32.3M ⢠Updated ⢠152k ⢠28
minishlab/M2V_base_output
7.56M ⢠Updated ⢠8.14k ⢠10
minishlab/potion-8m-edu-classifier
Updated ⢠7 ⢠2
minishlab/potion-science-8M
Updated ⢠9 ⢠2
datasets 8
minishlab/tokenlearn-cornstack-docs-coderankembed-v2
Viewer ⢠Updated ⢠600k ⢠64 ⢠1
minishlab/tokenlearn-cornstack-queries-coderankembed-v2
Viewer ⢠Updated ⢠600k ⢠62 ⢠1
minishlab/tokenlearn-cornstack-queries-coderankembed
Viewer ⢠Updated ⢠300k ⢠149 ⢠2
minishlab/tokenlearn-cornstack-docs-coderankembed
Viewer ⢠Updated ⢠300k ⢠104 ⢠2
minishlab/tokenlearn-c4-multilingual-bge-m3
Viewer ⢠Updated ⢠12M ⢠355 ⢠2
minishlab/tokenlearn-c4-en-bge-base-en-v1.5
Viewer ⢠Updated ⢠10M ⢠662 ⢠2
minishlab/my-vicinity-repo
Viewer ⢠Updated ⢠5 ⢠22 ⢠2
minishlab/tokenlearn_C4
Updated ⢠10 ⢠2