Models
Available Models
Section titled “Available Models”8080 is currently in a private beta and is offering only a limited number of models in our API:
| Model Name | Description | Max Tokens | Rate Limits (RPM / TPM) |
|---|---|---|---|
8080/taalas/llama3-8b-instruct | Smallest model, fast | 25k | 250 RPM / 100k TPM |
8080/llm_server/gpt-oss-20b | Small model, balanced | 64k | 250 RPM / 100k TPM |
For more details on each model, please refer to the pricing documentation.