Security Best Practices

    Istio will encrypt traffic using Mutual TLS whenever possible. However, proxies are configured in by default, meaning they will accept both mutual TLS and plaintext traffic.

    While this is required for incremental adoption or allowing traffic from clients without an Istio sidecar, it also weakens the security stance. It is recommended to migrate to strict mode when possible, to enforce that mutual TLS is used.

    Mutual TLS alone is not always enough to fully secure traffic, however, as it provides only authentication, not authorization. This means that anyone with a valid certificate can still access a service.

    To fully lock down traffic, it is recommended to configure . These allow creating fine-grained policies to allow or deny traffic. For example, you can allow only requests from the app namespace to access the hello-world service.

    Authorization policies

    Istio plays a critical part in Istio security. It takes effort to configure the correct authorization policies to best protect your clusters. It is important to understand the implications of these configurations as Istio cannot determine the proper authorization for all users. Please follow this section in its entirety.

    Use default-deny patterns

    We recommend you define your Istio authorization policies following the default-deny pattern to enhance your cluster’s security posture. The default-deny authorization pattern means your system denies all requests by default, and you define the conditions in which the requests are allowed. In case you miss some conditions, traffic will be unexpectedly denied, instead of traffic being unexpectedly allowed. The latter typically being a security incident while the former may result in a poor user experience, a service outage or will not match your SLO/SLA.

    For example, in the authorization for HTTP traffic task, the authorization policy named allow-nothing makes sure all traffic is denied by default. From there, other authorization policies allow traffic based on specific conditions.

    Use ALLOW-with-positive-matching and DENY-with-negative-match patterns

    Use the ALLOW-with-positive-matching or DENY-with-negative-matching patterns whenever possible. These authorization policy patterns are safer because the worst result in the case of policy mismatch is an unexpected 403 rejection instead of an authorization policy bypass.

    The ALLOW-with-positive-matching pattern is to use the ALLOW action only with positive matching fields (e.g. paths, values) and do not use any of the negative matching fields (e.g. notPaths, notValues).

    The DENY-with-negative-matching pattern is to use the DENY action only with negative matching fields (e.g. notPaths, notValues) and do not use any of the positive matching fields (e.g. paths, values).

    For example, the authorization policy below uses the ALLOW-with-positive-matching pattern to allow requests to path /public:

    The above policy explicitly lists the allowed path (/public). This means the request path must be exactly the same as /public to allow the request. Any other requests will be rejected by default eliminating the risk of unknown normalization behavior causing policy bypass.

    The following is an example using the DENY-with-negative-matching pattern to achieve the same result:

    1. apiVersion: security.istio.io/v1beta1
    2. kind: AuthorizationPolicy
    3. metadata:
    4. name: foo
    5. spec:
    6. action: DENY
    7. rules:
    8. - to:
    9. - operation:
    10. notPaths: ["/public"]

    Understand path normalization in authorization policy

    The enforcement point for authorization policies is the Envoy proxy instead of the usual resource access point in the backend application. A policy mismatch happens when the Envoy proxy and the backend application interpret the request differently.

    A mismatch can lead to either unexpected rejection or a policy bypass. The latter is usually a security incident that needs to be fixed immediately, and it’s also why we need path normalization in the authorization policy.

    For example, consider an authorization policy to reject requests with path /data/secret. A request with path /data//secret will not be rejected because it does not match the path defined in the authorization policy due to the extra forward slash / in the path.

    The request goes through and later the backend application returns the same response that it returns for the path /data/secret because the backend application normalizes the path /data//secret to /data/secret as it considers the double forward slashes // equivalent to a single forward slash /.

    In this example, the policy enforcement point (Envoy proxy) had a different understanding of the path than the resource access point (backend application). The different understanding caused the mismatch and subsequently the bypass of the authorization policy.

    This becomes a complicated problem because of the following factors:

    • Lack of a clear standard for the normalization.

    • Backends and frameworks in different layers have their own special normalization.

    • Applications can even have arbitrary normalizations for their own use cases.

    Istio authorization policy implements built-in support of various basic normalization options to help you to better address the problem:

    • Refer to to understand which normalization options you may want to use.

    • Refer to Customize your system on path normalization to understand the detail of each normalization option.

    • Refer to for alternative solutions in case you need any unsupported normalization options.

    Guideline on configuring the path normalization option

    Case 1: You do not need normalization at all

    Before diving into the details of configuring normalization, you should first make sure that normalizations are needed.

    You do not need normalization if you don’t use authorization policies or if your authorization policies don’t use any path fields.

    You may not need normalization if all your authorization policies follow the safer authorization pattern which, in the worst case, results in unexpected rejection instead of policy bypass.

    Case 2: You need normalization but not sure which normalization option to use

    You need normalization but you have no idea of which option to use. The safest choice is the strictest normalization option that provides the maximum level of normalization in the authorization policy.

    This is often the case due to the fact that complicated multi-layered systems make it practically impossible to figure out what normalization is actually happening to a request beyond the enforcement point.

    You could use a less strict normalization option if it already satisfies your requirements and you are sure of its implications.

    For either option, make sure you write both positive and negative tests specifically for your requirements to verify the normalization is working as expected. The tests are useful in catching potential bypass issues caused by a misunderstanding or incomplete knowledge of the normalization happening to your request.

    Refer to Customize your system on path normalization for more details on configuring the normalization option.

    Case 3: You need an unsupported normalization option

    If you need a specific normalization option that is not supported by Istio yet, please follow Mitigation for unsupported normalization for customized normalization support or create a feature request for the Istio community.

    Customize your system on path normalization

    Istio authorization policies can be based on the URL paths in the HTTP request. Path normalization (a.k.a., URI normalization) modifies and standardizes the incoming requests’ paths, so that the normalized paths can be processed in a standard way. Syntactically different paths may be equivalent after path normalization.

    The configuration is specified via the field in the the mesh config.

    To emphasize, the normalization algorithms are conducted in the following order:

    1. The and other normalization implemented by the normalize_path option in Envoy.
    2. Merge slashes

    While these normalization options represent recommendations from HTTP standards and common industry practices, applications may interpret a URL in any way it chooses to. When using denial policies, ensure that you understand how your application behaves.

    For a complete list of supported normalizations, please refer to .

    Examples of configuration

    Ensuring Envoy normalizes request paths to match your backend services’ expectation is critical to the security of your system. The following examples can be used as reference for you to configure your system. The normalized URL paths, or the original URL paths if NONE is selected, will be:

    1. Used to check against the authorization policies
    2. Forwarded to the backend application
    Your application…Choose…
    Relies on the proxy to do normalizationBASE, MERGE_SLASHES or DECODE_AND_MERGE_SLASHES
    Normalizes request paths based on and does not merge slashesBASE
    Normalizes request paths based on RFC 3986, merges slashes but does not decode slashesMERGE_SLASHES
    Normalizes request paths based on RFC 3986, decodes slashes and merges slashesDECODE_AND_MERGE_SLASHES
    Processes request paths in a way that is incompatible with RFC 3986NONE

    How to configure

    You can use istioctl to update the mesh config:

    1. $ istioctl upgrade --set meshConfig.pathNormalization.normalization=DECODE_AND_MERGE_SLASHES

    or by altering your operator overrides file

    1. $ cat <<EOF > iop.yaml
    2. apiVersion: install.istio.io/v1alpha1
    3. kind: IstioOperator
    4. spec:
    5. meshConfig:
    6. pathNormalization:
    7. normalization: DECODE_AND_MERGE_SLASHES
    8. EOF
    9. $ istioctl install -f iop.yaml

    Alternatively, if you want to directly edit the mesh config, you can add the to the mesh config, which is the istio-<REVISION_ID> configmap in the istio-system namespace. For example, if you choose the option, you modify the mesh config as the following:

    1. apiVersion: v1
    2. data:
    3. mesh: |-
    4. ...
    5. pathNormalization:
    6. normalization: DECODE_AND_MERGE_SLASHES
    7. ...

    This section describes various mitigations for unsupported normalization. These could be useful when you need a specific normalization that is not supported by Istio.

    Please make sure you understand the mitigation thoroughly and use it carefully as some mitigations rely on things that are out the scope of Istio and also not supported by Istio.

    Custom normalization logic

    You can apply custom normalization logic using the WASM or Lua filter. It is recommended to use the WASM filter because it’s officially supported and also used by Istio. You could use the Lua filter for a quick proof-of-concept DEMO but we do not recommend using the Lua filter in production because it is not supported by Istio.

    Example custom normalization (case normalization)

    In some environments, it may be useful to have paths in authorization policies compared in a case insensitive manner. For example, treating https://myurl/get and https://myurl/GeT as equivalent.

    In those cases, the EnvoyFilter shown below can be used to insert a Lua filter to normalize the path to lower case. This filter will change both the path used for comparison and the path presented to the application.

    Writing Host Match Policies

    Istio generates hostnames for both the hostname itself and all matching ports. For instance, a virtual service or Gateway for a host of example.com generates a config matching example.com and example.com:*. However, exact match authorization policies only match the exact string given for the hosts or notHosts fields.

    matching hosts should be written using prefix matches instead of exact matches. For example, for an AuthorizationPolicy matching the Envoy configuration generated for a hostname of example.com, you would use hosts: ["example.com", "example.com:*"] as shown in the below AuthorizationPolicy.

    1. apiVersion: security.istio.io/v1beta1
    2. kind: AuthorizationPolicy
    3. metadata:
    4. name: ingress-host
    5. namespace: istio-system
    6. spec:
    7. selector:
    8. matchLabels:
    9. app: istio-ingressgateway
    10. action: DENY
    11. rules:
    12. - to:
    13. - operation:
    14. hosts: ["example.com", "example.com:*"]

    Additionally, the host and notHosts fields should generally only be used on gateway for external traffic entering the mesh and not on sidecars for traffic within the mesh. This is because the sidecar on server side (where the authorization policy is enforced) does not use the Host header when redirecting the request to the application. This makes the host and notHost meaningless on sidecar because a client could reach out to the application using explicit IP address and arbitrary Host header instead of the service name.

    If you really need to enforce access control based on the Host header on sidecars for any reason, follow with the default-deny patterns which would reject the request if the client uses an arbitrary Host header.

    Specialized Web Application Firewall (WAF)

    Many specialized Web Application Firewall (WAF) products provide additional normalization options. They can be deployed in front of the Istio ingress gateway to normalize requests entering the mesh. The authorization policy will then be enforced on the normalized requests. Please refer to your specific WAF product for configuring the normalization options.

    Feature request to Istio

    If you believe Istio should officially support a specific normalization, you can follow the page to send a feature request about the specific normalization to the Istio Product Security Work Group for initial evaluation.

    Please do not open any issues in public without first contacting the Istio Product Security Work Group because the issue might be considered a security vulnerability that needs to be fixed in private.

    If the Istio Product Security Work Group evaluates the feature request as not a security vulnerability, an issue will be opened in public for further discussions of the feature request.

    Known limitations

    This section lists known limitations of the authorization policy.

    Server-first TCP protocols are not supported

    Server-first TCP protocols mean the server application will send the first bytes right after accepting the TCP connection before receiving any data from the client.

    Currently, the authorization policy only supports enforcing access control on inbound traffic and not the outbound traffic.

    It also does not support server-first TCP protocols because the first bytes are sent by the server application even before it received any data from the client. In this case, the initial first bytes sent by the server are returned to the client directly without going through the access control check of the authorization policy.

    You should not use the authorization policy if the first bytes sent by the server-first TCP protocols include any sensitive data that need to be protected by proper authorization.

    You could still use the authorization policy in this case if the first bytes does not include any sensitive data, for example, the first bytes are used for negotiating the connection with data that are publicly accessible to any clients. The authorization policy will work as usual for the following requests sent by the client after the first bytes.

    Understand traffic capture limitations

    The Istio sidecar works by capturing both inbound traffic and outbound traffic and directing them through the sidecar proxy.

    However, not all traffic is captured:

    • Redirection only handles TCP based traffic. Any UDP or ICMP packets will not be captured or modified.
    • Inbound capture is disabled on many as well as port 22. This list can be expanded by options like traffic.sidecar.istio.io/excludeInboundPorts.
    • Outbound capture may similarly be reduced through settings like traffic.sidecar.istio.io/excludeOutboundPorts or other means.

    In general, there is minimal security boundary between an application and its sidecar proxy. Configuration of the sidecar is allowed on a per-pod basis, and both run in the same network/process namespace. As such, the application may have the ability to remove redirection rules and remove, alter, terminate, or replace the sidecar proxy. This allows a pod to intentionally bypass its sidecar for outbound traffic or intentionally allow inbound traffic to bypass its sidecar.

    As a result, it is not secure to rely on all traffic being captured unconditionally by Istio. Instead, the security boundary is that a client may not bypass another pod’s sidecar.

    For example, if I run the reviews application on port 9080, I can assume that all traffic from the productpage application will be captured by the sidecar proxy, where Istio authentication and authorization policies may apply.

    Defense in depth with NetworkPolicy

    To further secure traffic, Istio policies can be layered with Kubernetes . This enables a strong defense in depth) strategy that can be used to further strengthen the security of your mesh.

    For example, you may choose to only allow traffic to port 9080 of our reviews application. In the event of a compromised pod or security vulnerability in the cluster, this may limit or stop an attackers progress.

    Depending on the actual implementation, changes to network policy may not affect existing connections in the Istio proxies. You may need to restart the Istio proxies after applying the policy so that existing connections will be closed and new connections will be subject to the new policy.

    Securing egress traffic

    A common misconception is that options like outboundTrafficPolicy: REGISTRY_ONLY acts as a security policy preventing all access to undeclared services. However, this is not a strong security boundary as mentioned above, and should be considered best-effort.

    Configure TLS verification in Destination Rule when using TLS origination

    Istio offers the ability to originate TLS from a sidecar proxy or gateway. This enables applications that send plaintext HTTP traffic to be transparently “upgraded” to HTTPS.

    Care must be taken when configuring the DestinationRule’s tls setting to specify the caCertificates, subjectAltNames, and sni fields. The caCertificate can be automatically set from the system’s certificate store’s CA certificate by enabling the environment variable VERIFY_CERTIFICATE_AT_CLIENT=true on Istiod. If the Operating System CA certificate being automatically used is only desired for select host(s), the environment variable VERIFY_CERTIFICATE_AT_CLIENT=false on Istiod, caCertificates can be set to system in the desired DestinationRule(s). Specifying the caCertificates in a DestinationRule will take priority and the OS CA Cert will not be used. By default, egress traffic does not send SNI during the TLS handshake. SNI must be set in the DestinationRule to ensure the host properly handle the request.

    In order to verify the server’s certificate it is important that both caCertificates and subjectAltNames be set.

    Verification of the certificate presented by the server against a CA is not sufficient, as the Subject Alternative Names must also be validated.

    If VERIFY_CERTIFICATE_AT_CLIENT is set, but subjectAltNames is not set then you are not verifying all credentials.

    If no CA certificate is being used, subjectAltNames will not be used regardless of it being set or not.

    For example:

    1. apiVersion: networking.istio.io/v1beta1
    2. kind: DestinationRule
    3. metadata:
    4. name: google-tls
    5. spec:
    6. host: google.com
    7. trafficPolicy:
    8. tls:
    9. mode: SIMPLE
    10. caCertificates: /etc/ssl/certs/ca-certificates.crt
    11. subjectAltNames:
    12. - "google.com"
    13. sni: "google.com"

    When running an Istio gateway, there are a few resources involved:

    • Gateways, which controls the ports and TLS settings for the gateway.
    • VirtualServices, which control the routing logic. These are associated with Gateways by direct reference in the gateways field and a mutual agreement on the hosts field in the Gateway and VirtualService.

    It is recommended to restrict creation of Gateway resources to trusted cluster administrators. This can be achieved by Kubernetes RBAC policies or tools like .

    Avoid overly broad hosts configurations

    When possible, avoid overly broad hosts settings in .

    For example, this configuration will allow any VirtualService to bind to the Gateway, potentially exposing unexpected domains:

    1. servers:
    2. - port:
    3. number: 80
    4. name: http
    5. hosts:
    6. - "*"

    This should be locked down to allow only specific domains or specific namespaces:

    1. servers:
    2. - port:
    3. number: 80
    4. name: http
    5. protocol: HTTP
    6. hosts:
    7. - "foo.example.com" # Allow only VirtualServices that are for foo.example.com
    8. - "default/bar.example.com" # Allow only VirtualServices in the default namespace that are for bar.example.com
    9. - "route-namespace/*" # Allow only VirtualServices in the route-namespace namespace for any host

    Isolate sensitive services

    It may be desired to enforce stricter physical isolation for sensitive services. For example, you may want to run a dedicated gateway instance for a sensitive payments.example.com, while utilizing a single shared gateway instance for less sensitive domains like blog.example.com and store.example.com. This can offer a stronger defense-in-depth and help meet certain regulatory compliance guidelines.

    Explicitly disable all the sensitive http host under relaxed SNI host matching

    It is reasonable to use multiple Gateways to define mutual TLS and simple TLS on different hosts. For example, use mutual TLS for SNI host admin.example.com and simple TLS for SNI host *.example.com.

    If the above is necessary, it’s highly recommended to explicitly disable the http host admin.example.com in the VirtualService that attaches to *.example.com. The reason is that currently the underlying envoy proxy does not require the http 1 header Host or the http 2 pseudo header :authority following the SNI constraints, an attacker can reuse the guest-SNI TLS connection to access admin VirtualService. The http response code 421 is designed for this Host SNI mismatch and can be used to fulfill the disable.

    1. apiVersion: networking.istio.io/v1alpha3
    2. kind: VirtualService
    3. metadata:
    4. name: disable-sensitive
    5. spec:
    6. hosts:
    7. - "admin.example.com"
    8. gateways:
    9. - guestgateway
    10. http:
    11. - match:
    12. - uri:
    13. prefix: /
    14. fault:
    15. abort:
    16. percentage:
    17. value: 100
    18. httpStatus: 421
    19. route:
    20. - destination:
    21. port:
    22. number: 8000
    23. host: dest.default.cluster.local

    Protocol detection

    Istio will automatically determine the protocol of traffic it sees. To avoid accidental or intentional miss detection, which may result in unexpected traffic behavior, it is recommended to where possible.

    CNI

    In order to transparently capture all traffic, Istio relies on iptables rules configured by the istio-init initContainer. This adds a for the NET_ADMIN and NET_RAW capabilities to be available to the pod.

    To reduce privileges granted to pods, Istio offers a which removes this requirement.

    Use hardened docker images

    Istio’s default docker images, including those run by the control plane, gateway, and sidecar proxies, are based on ubuntu. This provides various tools such as bash and curl, which trades off convenience for an increase attack surface.

    Istio also offers a smaller image based on that reduces the dependencies in the image.

    Distroless images are currently an alpha feature.

    In order to ensure your cluster has the latest security patches for known vulnerabilities, it is important to stay on the latest patch release of Istio and ensure that you are on a that is still receiving security patches.

    Detect invalid configurations

    While Istio provides validation of resources when they are created, these checks cannot catch all issues preventing configuration being distributed in the mesh. This could result in applying a policy that is unexpectedly ignored, leading to unexpected results.

    • Run istioctl analyze before or after applying configuration to ensure it is valid.
    • Monitor the control plane for rejected configurations. These are exposed by the pilot_total_xds_rejects metric, in addition to logs.
    • Test your configuration to ensure it gives the expected results. For a security policy, it is useful to run positive and negative tests to ensure you do not accidentally restrict too much or too few traffic.

    Avoid alpha and experimental features

    All Istio features and APIs are assigned a feature status, defining its stability, deprecation policy, and security policy.

    Because alpha and experimental features do not have as strong security guarantees, it is recommended to avoid them whenever possible. Security issues found in these features may not be fixed immediately or otherwise not follow our standard process.

    To determine the feature status of features in use in your cluster, consult the Istio features list.

    Lock down ports

    Istio configures a variety of ports that may be locked down to improve security.

    Istiod exposes a few unauthenticated plaintext ports for convenience by default. If desired, these can be closed:

    • Port 8080 exposes the debug interface, which offers read access to a variety of details about the clusters state. This can be disabled by set the environment variable ENABLE_DEBUG_ON_HTTP=false on Istiod. Warning: many istioctl commands depend on this interface and will not function if it is disabled.
    • Port 15010 exposes the XDS service over plaintext. This can be disabled by adding the --grpcAddr="" flag to the Istiod Deployment. Note: highly sensitive services, such as the certificate signing and distribution services, are never served over plaintext.

    Data Plane

    The proxy exposes a variety of ports. Exposed externally are port 15090 (telemetry) and port 15021 (health check). Ports 15020 and 15000 provide debugging endpoints. These are exposed over localhost only. As a result, the applications running in the same pod as the proxy have access; there is no trust boundary between the sidecar and application.

    To authenticate with the Istio control plane, the Istio proxy will use a Service Account token. Kubernetes supports two forms of these tokens:

    • Third party tokens, which have a scoped audience and expiration.
    • First party tokens, which have no expiration and are mounted into all pods.

    Because the properties of the first party token are less secure, Istio will default to using third party tokens. However, this feature is not enabled on all Kubernetes platforms.

    If you are using istioctl to install, support will be automatically detected. This can be done manually as well, and configured by passing --set values.global.jwtPolicy=third-party-jwt or --set values.global.jwtPolicy=first-party-jwt.

    To determine if your cluster supports third party tokens, look for the TokenRequest API. If this returns no response, then the feature is not supported:

    1. $ kubectl get --raw /api/v1 | jq '.resources[] | select(.name | index("serviceaccounts/token"))'
    2. {
    3. "name": "serviceaccounts/token",
    4. "singularName": "",
    5. "namespaced": true,
    6. "group": "authentication.k8s.io",
    7. "version": "v1",
    8. "kind": "TokenRequest",
    9. "verbs": [
    10. "create"
    11. ]
    12. }

    While most cloud providers support this feature now, many local development tools and custom installations may not prior to Kubernetes 1.20. To enable this feature, please refer to the Kubernetes documentation.

    Configure a limit on downstream connections

    By default, Istio (and Envoy) have no limit on the number of downstream connections. This can be exploited by a malicious actor (see security bulletin 2020-007). To work around you this, you must configure an appropriate connection limit for your environment.

    1. Create a config map by downloading . Update global_downstream_max_connections in the config map according to the number of concurrent connections needed by individual gateway instances in your deployment. Once the limit is reached, Envoy will start rejecting tcp connections.

      1. $ kubectl --namespace istio-system patch deployment istio-ingressgateway --patch "$(cat gateway-patch.yaml)"