Skip to main content

0.13.1

  • Fixed the configuration issue with the entrypoint for Mistral embedding models.
  • Fixed the issue with continuous batching that was causing performance degradation.
  • Added tokenization endpoint in takeoff.