List Models

Retrieve information about all available models, including their capabilities, pricing, and specifications.

Endpoint

GET https://api.assisters.dev/v1/models

Request

No request body required. Authentication is optional for this endpoint.

from openai import OpenAI

client = OpenAI(
    api_key="ask_your_api_key",
    base_url="https://api.assisters.dev/v1"
)

models = client.models.list()

for model in models.data:
    print(f"{model.id}: {model.owned_by}")

Response

{
  "object": "list",
  "data": [
    {
      "id": "llama-3.1-8b",
      "object": "model",
      "created": 1704067200,
      "owned_by": "meta",
      "category": "chat",
      "version": "3.1",
      "capabilities": {
        "chat_completion": true,
        "streaming": true,
        "function_calling": false
      },
      "pricing": {
        "input_price_per_million": 0.10,
        "output_price_per_million": 0.10,
        "currency": "usd"
      },
      "specifications": {
        "context_window": 128000,
        "max_output_tokens": 8192,
        "training_cutoff": "2024-03"
      },
      "status": "active"
    },
    {
      "id": "e5-large-v2",
      "object": "model",
      "created": 1704067200,
      "owned_by": "microsoft",
      "category": "embedding",
      "version": "2.0",
      "capabilities": {
        "embeddings": true
      },
      "pricing": {
        "price_per_million_tokens": 0.01,
        "currency": "usd"
      },
      "specifications": {
        "max_tokens": 512,
        "output_dimensions": 1024
      },
      "status": "active"
    }
  ]
}

Response Fields

object

string

Always list

data

array

Array of model objects with the following fields:

Show Model Object

string

The model identifier to use in API requests

object

string

Always model

created

integer

Unix timestamp of when the model was added

owned_by

string

The organization that created the model (e.g., meta, mistral, microsoft)

Get Single Model

Retrieve information about a specific model:

GET https://api.assisters.dev/v1/models/{model_id}

model = client.models.retrieve("llama-3.1-8b")
print(f"Context window: {model.specifications.context_window}")

Available Models by Category

Chat Models

Model	Provider	Context	Price (per M tokens)
`llama-3.1-8b`	Meta	128K	$0.10
`llama-3.1-70b`	Meta	128K	$0.90
`mistral-7b`	Mistral AI	32K	$0.10
`qwen2-7b`	Alibaba	32K	$0.10
`gemma-2-9b`	Google	8K	$0.15
`phi-3-mini`	Microsoft	4K	$0.08

Embedding Models

Model	Provider	Dimensions	Price (per M tokens)
`e5-large-v2`	Microsoft	1024	$0.01
`bge-base-en`	BAAI	768	$0.01
`jina-embeddings-v2`	Jina AI	768	$0.02
`nomic-embed-text`	Nomic AI	768	$0.01
`gte-large`	Alibaba	1024	$0.01

Moderation Models

Model	Provider	Price (per M tokens)
`llama-guard-3`	Meta	$0.20
`shieldgemma`	Google	$0.15

Reranking Models

Model	Provider	Price (per M tokens)
`bge-reranker-v2`	BAAI	$0.05
`jina-reranker`	Jina AI	$0.08

Detailed Model Comparison

See full specifications and benchmarks for all models

Filtering Models

Filter models by category in your code:

models = client.models.list()

# Get only chat models
chat_models = [m for m in models.data if m.category == "chat"]

# Get only embedding models
embedding_models = [m for m in models.data if m.category == "embedding"]

# Get active models only
active_models = [m for m in models.data if m.status == "active"]

Model Status

Status	Description
`active`	Fully available and recommended for use
`beta`	Available but may change without notice
`deprecated`	Still available but will be removed; migrate to alternatives

We announce deprecations at least 3 months in advance. Check the changelog for updates.

Caching

The models endpoint has a 5-minute cache. For real-time availability, check the status page.

Cache-Control: public, max-age=300

Overview

Endpoints

List Models

List Models

Endpoint

Request

Response

Response Fields

Get Single Model

Available Models by Category

Chat Models

Embedding Models

Moderation Models

Reranking Models

Detailed Model Comparison

Filtering Models

Model Status

Caching

Overview

Endpoints

​List Models

​Endpoint

​Request

​Response

​Response Fields

​Get Single Model

​Available Models by Category

​Chat Models

​Embedding Models

​Moderation Models

​Reranking Models

Detailed Model Comparison

​Filtering Models

​Model Status

​Caching

List Models

Endpoint

Request

Response

Response Fields

Get Single Model

Available Models by Category

Chat Models

Embedding Models

Moderation Models

Reranking Models

Filtering Models

Model Status

Caching