Models

List of models available in 8080.

Available Models

8080 is currently in private beta and is offering only a limited number of models in our API:

Model Name	Description	Max Tokens	Rate Limits (RPM / TPM)
`8080/taalas/llama3.1-8b-instruct`	Smallest model, fast	25k	250 RPM / 100k TPM
`8080/llm_server/gpt-oss-20b`	Small model, balanced	64k	250 RPM / 100k TPM

For pricing and quotas for each model, see 8080 pricing.