Embedding Models
Create vector representations of text for semantic search, clustering, recommendations, and RAG applications.Available Models
BGE-M3 (Recommended)
BAAI’s state-of-the-art multilingual embedding model supporting 100+ languages. FREE via HuggingFace.| Specification | Value |
|---|---|
| Provider | BAAI (via HuggingFace) |
| Dimensions | 1024 |
| Max Tokens | 8192 |
| Price | FREE |
| Similarity Metric | Cosine |
- Multilingual semantic search
- Production RAG systems
- Cross-lingual retrieval
- Cost-free deployment
Free Model: BGE-M3 is our recommended model for all embedding use cases. It offers superior multilingual support and quality at zero cost.
E5-large-v2
Microsoft’s flagship text embedding model with state-of-the-art performance.| Specification | Value |
|---|---|
| Provider | Microsoft |
| Dimensions | 1024 |
| Max Tokens | 512 |
| Price | $0.01 / million tokens |
| Similarity Metric | Cosine |
- Semantic search
- Document retrieval
- Question answering
- High-accuracy requirements
BGE-base-en
BAAI’s balanced embedding model with excellent English performance.| Specification | Value |
|---|---|
| Provider | BAAI |
| Dimensions | 768 |
| Max Tokens | 512 |
| Price | $0.01 / million tokens |
| Similarity Metric | Cosine |
- Cost-effective search
- English-only applications
- RAG systems
- Production deployments
Jina Embeddings v2
Jina AI’s long-context embedding model for entire documents.| Specification | Value |
|---|---|
| Provider | Jina AI |
| Dimensions | 768 |
| Max Tokens | 8192 |
| Price | $0.02 / million tokens |
| Similarity Metric | Cosine |
- Long documents
- Full-page embeddings
- Reduced chunking needs
- Document comparison
Nomic Embed Text
Nomic AI’s efficient embedding model with long context support.| Specification | Value |
|---|---|
| Provider | Nomic AI |
| Dimensions | 768 |
| Max Tokens | 8192 |
| Price | $0.01 / million tokens |
| Similarity Metric | Cosine |
- Long-context on budget
- Open-source preference
- General-purpose search
- Academic applications
GTE-large
Alibaba’s general text embeddings model with high dimensionality.| Specification | Value |
|---|---|
| Provider | Alibaba |
| Dimensions | 1024 |
| Max Tokens | 512 |
| Price | $0.01 / million tokens |
| Similarity Metric | Cosine |
- High-dimensional search
- Multilingual content
- Cross-lingual retrieval
- Asian language content
Model Comparison
| Model | Dimensions | Max Tokens | Quality | Price |
|---|---|---|---|---|
bge-m3 | 1024 | 8192 | ★★★★★ | FREE |
e5-large-v2 | 1024 | 512 | ★★★★★ | $0.01/M |
bge-base-en | 768 | 512 | ★★★★☆ | $0.01/M |
jina-embeddings-v2 | 768 | 8192 | ★★★★☆ | $0.02/M |
nomic-embed-text | 768 | 8192 | ★★★☆☆ | $0.01/M |
gte-large | 1024 | 512 | ★★★★☆ | $0.01/M |
Benchmark Results
MTEB (Massive Text Embedding Benchmark)
| Model | Average Score | Retrieval | STS | Price |
|---|---|---|---|---|
bge-m3 | 66.1 | 58.2 | 86.4 | FREE |
e5-large-v2 | 64.2 | 56.8 | 85.6 | $0.01/M |
bge-base-en | 63.4 | 55.2 | 84.1 | $0.01/M |
gte-large | 63.1 | 54.9 | 83.7 | $0.01/M |
jina-embeddings-v2 | 62.8 | 54.3 | 82.9 | $0.02/M |
nomic-embed-text | 61.5 | 53.1 | 81.4 | $0.01/M |
Use Cases
Semantic Search
Semantic Search
Find documents by meaning, not just keywords:
RAG (Retrieval-Augmented Generation)
RAG (Retrieval-Augmented Generation)
Retrieve relevant context for LLM responses:
Clustering
Clustering
Group similar content together:
Deduplication
Deduplication
Find and remove duplicate content:
Best Practices
Batch Requests
Embed multiple texts in one request for better throughput
Cache Embeddings
Store embeddings to avoid recomputing for the same text
Normalize Vectors
Most models output normalized vectors; verify for your use case
Match Query/Doc Models
Use the same model for queries and documents
Vector Databases
Store and search embeddings efficiently:| Database | Type | Features |
|---|---|---|
| Pinecone | Managed | Fast, scalable, serverless |
| Weaviate | Self-hosted | Open-source, hybrid search |
| Qdrant | Self-hosted | Rust-based, efficient |
| Milvus | Self-hosted | Distributed, GPU support |
| pgvector | Extension | PostgreSQL integration |