0.7.1

November 30, 2023

Ready flag added to management api GET /reader_groups endpoint to know if model has done loading or not.
Redis max memory and takeoff single prompt limit are now configurable in environment variables: TAKEOFF_REDIS_MAX_MEMORY and TAKEOFF_MAX_PROMPT_STRING_BYTES. Their defaults are set to 1GB and 30KB respectively.
Stop ability to send generation requests to embedding model through frontend UIs.