The model becomes available within a few minutes. Adaptive supports most transformer-based models including Llama, Qwen, Gemma, Mistral, and DeepSeek. See Integrations for proprietary models.
response = adaptive.chat.create( model="llama-3.1-8b-instruct", messages=[ {"role": "system", "content": "You are a helpful assistant."}, {"role": "user", "content": "Hello!"} ], labels={"project": "my-app"},)print(response.choices[0].message.content)# Get completion_id for feedbackcompletion_id = response.choices[0].completion_id
Requests are logged automatically. Use labels to organize and filter interactions. See Interactions for details.If you omit model, requests route to the use case’s default model.
# Streamingstream = adaptive.chat.create( messages=[{"role": "user", "content": "Hello!"}], stream=True,)for chunk in stream: if chunk.choices: print(chunk.choices[0].delta.content, end="", flush=True)
OpenAI compatibility
Use the OpenAI Python library with your Adaptive deployment: