0.7.1
- Ready flag added to management api GET
/reader_groups
endpoint to know if model has done loading or not. - Redis max memory and takeoff single prompt limit are now configurable in environment variables:
TAKEOFF_REDIS_MAX_MEMORY
andTAKEOFF_MAX_PROMPT_STRING_BYTES
. Their defaults are set to 1GB and 30KB respectively. - Stop ability to send generation requests to embedding model through frontend UIs.