Skip to content
GitHub Login

Models

List of models available in 8080.

8080 is currently in private beta and is offering only a limited number of models in our API:

Model NameDescriptionMax TokensRate Limits (RPM / TPM)
8080/taalas/llama3.1-8b-instructSmallest model, fast25k250 RPM / 100k TPM
8080/llm_server/gpt-oss-20bSmall model, balanced64k250 RPM / 100k TPM

For pricing and quotas for each model, see 8080 pricing.