Skip to main content
The inference playground lets you send test prompts to an available text generation endpoint.

Start a chat

1

Open AI inference

In your project, open AI Inference.
2

Create an API key

Create an inference API key if the project does not already have one.
3

Open Playground

Choose the Playground tab.
4

Pick a model and key

Choose an inference endpoint and an API key.
5

Send a message

Enter a prompt and submit the request.

Advanced settings

Advanced settings can include:
  • Temperature.
  • Top-p.
  • Maximum tokens.
  • Streaming mode.
  • System prompt.
Use lower temperature for deterministic answers and higher temperature for more varied responses.

Files

If file upload is available for the selected endpoint, attach files only when the model supports that input type.
Embedding endpoints are not chat models. Use text generation endpoints in the playground.

Troubleshooting

ErrorWhat to check
400 responseConfirm the selected model supports the request shape.
Missing API keyCreate an inference API key first.
Empty responseTry a shorter prompt or a different model.
UnauthorizedMake sure the key belongs to the current project.