Skip to main content

0.7.0

  • New model memory calculator to inference frontend! You can calculate to see if your models will fit on your hardware with desired sequence length and batch size.
  • Hash history for inference and management apps to fix getting a 404 when refreshing a sub-page of app.