Streams a model-generated response for a chat turn while persisting both the user message and assistant reply.

POST /studio/chat_stream

Selects the provider implementation dynamically by options.Provider.
Token usage and estimated cost are computed prior to generation.
Response headers are set to disable buffering and compression for smooth SSE delivery.

Streams a model-generated response for a chat turn while persisting both the user message and assistant reply.

/studio/chat_stream

Request

Responses

Streams a model-generated response for a chat turn while persisting both the user message and assistant reply.

/studio/chat_stream

Request​

Responses​

Request

Responses