Skip to main content
LayerRail AI inference gives a project a small set of model endpoints, API keys, and a playground for testing requests before you wire them into an application.

What it is for

Use AI inference when you want to:
  • Test text generation models.
  • Generate embeddings.
  • Build simple AI features without operating model servers.
  • Keep inference keys scoped to a LayerRail project.
  • Prototype prompts in the console.

Model catalog

The inference catalog shows available endpoints for the project. Each model card includes:
FieldMeaning
Model nameThe model you pass in requests.
ProviderThe system backing the endpoint.
CapabilityText generation or embeddings.
Context lengthThe supported input window.
PriceInput and output token pricing where configured.
URLThe endpoint URL to call from your application.

API keys

Create an inference API key before sending requests.
curl https://console.layerrail.com/ai/v1/chat/completions \
  -H "Authorization: Bearer $INFERENCE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "@cf/meta/llama-3.1-8b-instruct",
    "messages": [
      { "role": "user", "content": "Write a short deployment checklist." }
    ]
  }'
Inference API keys are project secrets. Do not expose them in frontend code.

Next steps

Playground

API authentication