Introduction to Assisters API
Assisters API provides OpenAI-compatible endpoints for accessing open-source AI models. Whether you’re building chatbots, search systems, or content moderation tools, our API offers a simple, cost-effective solution.What is Assisters API?
Assisters API is a unified interface for accessing multiple open-source AI models through a single API. We handle the infrastructure, scaling, and optimization so you can focus on building great products.OpenAI Compatible
Use the same code you write for OpenAI - just change the base URL
18+ Models
Access Llama, Mistral, Qwen, and more through one API
Pay Per Token
Only pay for what you use, starting at $0.08 per million tokens
99.9% Uptime
Enterprise-grade reliability with SLA guarantees
Key Features
Chat Completions
Generate conversational responses with streaming support. Perfect for chatbots, assistants, and interactive applications.Text Embeddings
Create vector representations for semantic search, clustering, and similarity matching.Content Moderation
Detect harmful, inappropriate, or policy-violating content before it reaches your users.Document Reranking
Improve search results by reranking documents based on relevance to a query.Available Models
We offer models across four categories:| Category | Models | Starting Price |
|---|---|---|
| Chat | Llama 3.1, Mistral, Qwen, Phi-3 | $0.08/M tokens |
| Embeddings | E5, BGE, Jina, Nomic | $0.01/M tokens |
| Moderation | Llama Guard, ShieldGemma | $0.15/M tokens |
| Reranking | BGE Reranker, Jina Reranker | $0.05/M tokens |
View All Models
Browse our complete model catalog with detailed specifications
Use Cases
Chatbots & Virtual Assistants
Chatbots & Virtual Assistants
Build conversational AI that understands context and provides helpful responses. Our chat models support multi-turn conversations with streaming for real-time interaction.
Semantic Search
Semantic Search
Convert text into embeddings for similarity search. Find relevant documents, answers, or products based on meaning rather than keywords.
Content Moderation
Content Moderation
Automatically detect harmful content, spam, or policy violations before they reach your platform. Protect your users and brand.
RAG Applications
RAG Applications
Combine embeddings with chat completions to build retrieval-augmented generation systems that answer questions from your knowledge base.
Getting Started
Ready to build? Follow these steps:Create an Account
Sign up at assisters.dev to get your free API key with 100K tokens.
Make Your First Call
Follow our quickstart guide to send your first request.