API Overview
The Assisters API is fully compatible with the OpenAI API specification, making it easy to migrate existing applications or use familiar SDKs.Base URL
All API requests should be made to:Authentication
Authenticate using your API key in theAuthorization header:
Get Your API Key
Create an API key in your dashboard
Request Format
All requests should:- Use HTTPS
- Include
Content-Type: application/jsonheader - Send JSON-encoded request bodies
Response Format
All responses are JSON-encoded and follow the OpenAI response format:Available Endpoints
| Endpoint | Method | Description |
|---|---|---|
/v1/chat/completions | POST | Generate chat completions |
/v1/embeddings | POST | Create text embeddings |
/v1/moderate | POST | Content moderation |
/v1/rerank | POST | Document reranking |
/v1/models | GET | List available models |
Response Headers
Every response includes useful headers:| Header | Description |
|---|---|
X-Request-ID | Unique request identifier for debugging |
X-Processing-Time-Ms | Request processing time in milliseconds |
X-RateLimit-Limit-RPM | Your requests per minute limit |
X-RateLimit-Remaining-RPM | Remaining requests this minute |
X-RateLimit-Limit-TPM | Your tokens per minute limit |
X-RateLimit-Remaining-TPM | Remaining tokens this minute |
Rate Limits
Rate limits depend on your subscription tier:| Tier | RPM | TPM |
|---|---|---|
| Free | 10 | 100,000 |
| Developer | 100 | 1,000,000 |
| Startup | 500 | 5,000,000 |
| Enterprise | Custom | Custom |
429 Too Many Requests response with a Retry-After header.
Streaming
For chat completions, you can enable streaming for real-time responses:Idempotency
For POST requests, you can include anIdempotency-Key header to safely retry failed requests:
SDKs
Use the official OpenAI SDK with our base URL:Versioning
The API is currently at versionv1. We follow semantic versioning and will communicate any breaking changes well in advance.
View Changelog
See the latest API updates and changes