Cluster administrators can view the following metrics for Knative Serving components.
You can use the following metrics to understand how applications respond when traffic passes through the activator.
Metric name | Description | Type | Tags | Unit |
---|---|---|---|---|
|
The number of concurrent requests that are routed to the activator, or average concurrency over a reporting period. |
Gauge |
|
Integer (no units) |
|
The number of requests that are routed to activator. These are requests that have been fulfilled from the activator handler. |
Counter |
|
Integer (no units) |
|
The response time in milliseconds for a fulfilled, routed request. |
Histogram |
|
Milliseconds |
The autoscaler component exposes a number of metrics related to autoscaler behavior for each revision. For example, at any given time, you can monitor the targeted number of pods the autoscaler tries to allocate for a service, the average number of requests per second during the stable window, or whether the autoscaler is in panic mode if you are using the Knative pod autoscaler (KPA).
Metric name | Description | Type | Tags | Unit |
---|---|---|---|---|
|
The number of pods the autoscaler tries to allocate for a service. |
Gauge |
|
Integer (no units) |
|
The excess burst capacity served over the stable window. |
Gauge |
|
Integer (no units) |
|
The average number of requests for each observed pod over the stable window. |
Gauge |
|
Integer (no units) |
|
The average number of requests for each observed pod over the panic window. |
Gauge |
|
Integer (no units) |
|
The number of concurrent requests that the autoscaler tries to send to each pod. |
Gauge |
|
Integer (no units) |
|
The average number of requests-per-second for each observed pod over the stable window. |
Gauge |
|
Integer (no units) |
|
The average number of requests-per-second for each observed pod over the panic window. |
Gauge |
|
Integer (no units) |
|
The number of requests-per-second that the autoscaler targets for each pod. |
Gauge |
|
Integer (no units) |
|
This value is |
Gauge |
|
Integer (no units) |
|
The number of pods that the autoscaler has requested from the Kubernetes cluster. |
Gauge |
|
Integer (no units) |
|
The number of pods that are allocated and currently have a ready state. |
Gauge |
|
Integer (no units) |
|
The number of pods that have a not ready state. |
Gauge |
|
Integer (no units) |
|
The number of pods that are currently pending. |
Gauge |
|
Integer (no units) |
|
The number of pods that are currently terminating. |
Gauge |
|
Integer (no units) |
Each Knative Serving control plane process emits a number of Go runtime memory statistics (MemStats).
The |
Metric name | Description | Type | Tags | Unit |
---|---|---|---|---|
|
The number of bytes of allocated heap objects. This metric is the same as |
Gauge |
|
Integer (no units) |
|
The cumulative bytes allocated for heap objects. |
Gauge |
|
Integer (no units) |
|
The total bytes of memory obtained from the operating system. |
Gauge |
|
Integer (no units) |
|
The number of pointer lookups performed by the runtime. |
Gauge |
|
Integer (no units) |
|
The cumulative count of heap objects allocated. |
Gauge |
|
Integer (no units) |
|
The cumulative count of heap objects that have been freed. |
Gauge |
|
Integer (no units) |
|
The number of bytes of allocated heap objects. |
Gauge |
|
Integer (no units) |
|
The number of bytes of heap memory obtained from the operating system. |
Gauge |
|
Integer (no units) |
|
The number of bytes in idle, unused spans. |
Gauge |
|
Integer (no units) |
|
The number of bytes in spans that are currently in use. |
Gauge |
|
Integer (no units) |
|
The number of bytes of physical memory returned to the operating system. |
Gauge |
|
Integer (no units) |
|
The number of allocated heap objects. |
Gauge |
|
Integer (no units) |
|
The number of bytes in stack spans that are currently in use. |
Gauge |
|
Integer (no units) |
|
The number of bytes of stack memory obtained from the operating system. |
Gauge |
|
Integer (no units) |
|
The number of bytes of allocated |
Gauge |
|
Integer (no units) |
|
The number of bytes of memory obtained from the operating system for |
Gauge |
|
Integer (no units) |
|
The number of bytes of allocated |
Gauge |
|
Integer (no units) |
|
The number of bytes of memory obtained from the operating system for |
Gauge |
|
Integer (no units) |
|
The number of bytes of memory in profiling bucket hash tables. |
Gauge |
|
Integer (no units) |
|
The number of bytes of memory in garbage collection metadata. |
Gauge |
|
Integer (no units) |
|
The number of bytes of memory in miscellaneous, off-heap runtime allocations. |
Gauge |
|
Integer (no units) |
|
The target heap size of the next garbage collection cycle. |
Gauge |
|
Integer (no units) |
|
The time that the last garbage collection was completed in Epoch or Unix time. |
Gauge |
|
Nanoseconds |
|
The cumulative time in garbage collection stop-the-world pauses since the program started. |
Gauge |
|
Nanoseconds |
|
The number of completed garbage collection cycles. |
Gauge |
|
Integer (no units) |
|
The number of garbage collection cycles that were forced due to an application calling the garbage collection function. |
Gauge |
|
Integer (no units) |
|
The fraction of the available CPU time of the program that has been used by the garbage collector since the program started. |
Gauge |
|
Integer (no units) |