Configuring gradual roll-out of traffic to Revisions
This might make the request queue too long, either at the QP or Activator, and cause the requests to expire or be rejected by the QP.
Knative provides a parameter, which can be used to gradually shift traffic to the latest Revision, preventing requests from being queued or rejected. Affected Configuration targets are rolled out to 1% of traffic first, and then in equal incremental steps for the rest of the assigned traffic.
Note
rollout-duration
is time-based, and does not interact with the autoscaling subsystem.
You can configure the rollout-duration
parameter per Knative Service or Route by using an annotation.
Tip
For information about global, ConfigMap configurations for roll-out durations, see the Administration guide.
During a rollout, the system updates the Route and Knative Service status conditions. Both the traffic
and conditions
status parameters are affected.
apiVersion: serving.knative.dev/v1
kind: Service
metadata:
...
spec:
...
traffic:
- percent: 55
configurationName: config # Pinned to latest ready Revision
Initially 1% of the traffic is rolled out to the Revisions:
Then the rest of the traffic is rolled out in increments of 18%:
apiVersion: serving.knative.dev/v1
kind: Service
metadata:
...
spec:
...
traffic:
- percent: 36
revisionName: config-00008
- percent: 19
revisionName: config-00009
- percent: 45
The rollout continues until the target traffic configuration is reached:
During the rollout, the Route and Knative Service status conditions are as follows:
kind: Service
metadata:
...
spec:
...
status:
conditions:
...
- lastTransitionTime: "..."
message: A gradual rollout of the latest revision(s) is in progress.
reason: RolloutInProgress
status: Unknown