Skip to content
GitHub Login

Models

8080 is currently in a private beta and is offering only a limited number of models in our API:

Model NameDescriptionMax TokensRate Limits (RPM / TPM)
8080/taalas/llama3-8b-instructSmallest model, fast25k250 RPM / 100k TPM
8080/llm_server/gpt-oss-20bSmall model, balanced64k250 RPM / 100k TPM

For more details on each model, please refer to the pricing documentation.