Models
List of models available in 8080.
Available Models
Section titled “Available Models”8080 is currently in private beta and is offering only a limited number of models in our API:
| Model Name | Description | Max Tokens | Rate Limits (RPM / TPM) |
|---|---|---|---|
8080/taalas/llama3.1-8b-instruct | Smallest model, fast | 25k | 250 RPM / 100k TPM |
8080/llm_server/gpt-oss-20b | Small model, balanced | 64k | 250 RPM / 100k TPM |
For pricing and quotas for each model, see 8080 pricing.