Skip to main content
Version: Next

Create reader

POST 

/reader

Create reader

Takes in a ReaderConfig payload and loads up new python reader

Request​

Body

required

    access_token stringnullable
    backend stringnullable
    batch_duration_millis int64nullable
    consumer_group stringrequired
    cuda_graph_cache_capacity int32nullable
    cuda_visible_devices stringnullable
    device stringrequired
    disable_cuda_graph int32nullable
    disable_paged_attention int32nullable
    disable_static int32nullable
    internal_gateway_ip stringnullable
    log_level stringnullable
    max_batch_size int64nullable
    max_sequence_length int32nullable
    model_name stringrequired
    nvlink_unavailable int32nullable
    page_cache_size stringnullable
    quant_type stringnullable
    tensor_parallel int32nullable

Responses​

Takes in a JSON payload and loads up new model and/or backend

Schema

    _reader_config

    object

    required

    access_token stringnullable
    backend stringnullable
    batch_duration_millis int64nullable
    consumer_group stringrequired
    cuda_graph_cache_capacity int32nullable
    cuda_visible_devices stringnullable
    device stringrequired
    disable_cuda_graph int32nullable
    disable_paged_attention int32nullable
    disable_static int32nullable
    internal_gateway_ip stringnullable
    log_level stringnullable
    max_batch_size int64nullable
    max_sequence_length int32nullable
    model_name stringrequired
    nvlink_unavailable int32nullable
    page_cache_size stringnullable
    quant_type stringnullable
    tensor_parallel int32nullable
    _reader_id stringrequired
Loading...