Models

Osiris provides access to a variety of AI models across multiple providers. Each model has a billing multiplier that determines the effective token cost.

Available Models

Model IDProviderMultiplierMax Tokens
gpt-4oosiris1.0x16,384
gpt-4o-miniosiris0.3x16,384
claude-sonnet-4-20250514osiris1.2x8,192
claude-opus-4.6osiris3.0x4,096
gemini-2.5-proosiris1.0x8,192
gemini-2.5-flashosiris0.5x8,192
deepseek-v3osiris0.5x8,192
o3osiris2.0x16,384
o4-miniosiris0.8x16,384

Billing Formula

Token charges are calculated using the following formula:

tokens_charged = (input_tokens + output_tokens) × multiplier

Higher multipliers reflect the higher upstream cost of premium models. Use lower-multiplier models like gpt-4o-mini or deepseek-v3 for cost-effective workloads.

Example Calculations

Here's how billing works for different models with the same request (500 input + 200 output tokens):

Billing examples
# Example: gpt-4o (multiplier 1.0x)
Input tokens:  500
Output tokens: 200
Tokens charged = (500 + 200) × 1.0 = 700

# Example: claude-opus-4.6 (multiplier 3.0x)
Input tokens:  500
Output tokens: 200
Tokens charged = (500 + 200) × 3.0 = 2,100

# Example: gpt-4o-mini (multiplier 0.3x)
Input tokens:  500
Output tokens: 200
Tokens charged = (500 + 200) × 0.3 = 210