Skip to main content

0.7.1

  • Ready flag added to management api GET /reader_groups endpoint to know if model has done loading or not.
  • Redis max memory and takeoff single prompt limit are now configurable in environment variables: TAKEOFF_REDIS_MAX_MEMORY and TAKEOFF_MAX_PROMPT_STRING_BYTES. Their defaults are set to 1GB and 30KB respectively.
  • Stop ability to send generation requests to embedding model through frontend UIs.