Skip to main content

Generate from image (Buffered)

POST 

/image_generate

Generate from image (Buffered)

The /image_generate endpoint is used to communicate with the LLM. Use this endpoint when you want to send an image to a multimodal LLM and receive a text response, all at once. See the /image_generate_stream endpoint if you want to receive a stream of responses from the LLM, token by token.

This endpoint takes a multipart input, with two required fields:

  1. 'json_data': should contain json data, matching the format used for the /generate and /generate_stream endpoints.
  2. 'image_data': a stream of bytes, representing an image file.

Multipart requests support is built into most common HTTP clients.

To send a batch of requests with the same image, the text field of the json payload can be either a string, or an array of strings. Only one image can be supplied per request - to supply a set of generation requests each to different images, send them in quick succession and rely on automatic batching.

Request​

Responses​

Takes in a JSON payload and returns the response all at once.