Inference API | Double Word AI docs

📄️ Chat Template

The `reader_id` can be found as the model's key in the `live_readers` object, returned by the status endpoint.

To send a batch of requests all at once, the text field can either be a string, an array of strings,

The detokenization endpoint takes in a list of tokens, and returns the decoded version of the text: the way the text is seen by the model

To send a batch of requests all at once, the text field can be either a string, or an array of

The `/image_generate_stream` endpoint is used to communicate with the LLM. Use this endpoint when you want to send an image to a multimodal LLM, and

The `/image_generate` endpoint is used to communicate with the LLM. Use this endpoint when you want to send an image to a multimodal

The `/generate_stream` endpoint is used to communicate with the LLM. Use this endpoint when you want to

The `/generate` endpoint is used to communicate with the LLM. Use this endpoint when you want to

The `/healthz` endpoint is used to check if the server is running, and whether its ready to receive requests. A 200

Returns all information about the app and its current status.

The tokenization endpoint takes in a text, and returns the tokenized version of the text: the way the text is seen by the model