Metrics
We use Prometheus as our chosen integration for aggregation of metrics from your LLM applications. This is an open source time series database and monitoring solution.
Installing
Prerequisites
- Kube Prometheus Stack - we have our own opinionated configuration: Doubleword Observability Chart. Which can be installed with:
helm repo add doublewordai https://doublewordai.github.io/helm-charts
helm install observability doublewordai/observability
Alert Integrations
We currently support Slack and Incident.io for alerting, you can add your webhooks via Kubernetes secrets using the following commands:
kubectl create secret generic slack-webhook --from-literal=url=<your-slack-webhook-url> --namespace monitoring
kubectl create secret generic incident-io-webhook --from-literal=url=<your-incident-io-webhook-url> --from-literal=token=<your-incident-io-token> --namespace monitoring
Configuration
To setup Prometheus to scrape metrics from your LLM applications add these values to your Inference Stack:
serviceMonitor:
enabled: true