HTTP Add-on Returns 503 Errors During Cold Start

Symptom

When your application is scaled to zero and a new request arrives, the client receives a 502 or 503 error instead of a successful response.

Cause

The HTTP Add-on Interceptor queues requests while the target Deployment has zero replicas. Once it detects at least one ready endpoint, it immediately forwards the queued request without retrying on failure. In an Istio mesh, this creates a race condition during cold start:

The Interceptor receives the request and queues it.
KEDA detects pending requests and scales the Deployment from 0 to 1.
The Pod starts: container pull, init containers, readiness probes, Istio sidecar injection.
The readiness probe passes, but the Istio sidecar or the application is not yet fully ready to handle traffic.
The Interceptor sees a ready endpoint and forwards the queued request.
The upstream returns 503 or refuses the connection.

Additionally, in some edge cases the Istio Ingress Gateway can return a 502 or 503 from the Interceptor if it is temporarily overloaded during scale-up.

Solution

Configure an EnvoyFilter on the Istio Ingress Gateway to transparently retry failed requests during the cold-start window:

yaml

apiVersion: networking.istio.io/v1alpha3
kind: EnvoyFilter
metadata:
  name: my-app-keda-retry
  namespace: istio-system
spec:
  workloadSelector:
    labels:
      istio: ingressgateway
  configPatches:
  - applyTo: HTTP_ROUTE
    match:
      context: GATEWAY
      routeConfiguration:
        vhost:
          name: "my-app.example.com:443"
          route:
            name: ""
    patch:
      operation: MERGE
      value:
        route:
          retry_policy:
            retry_on: "5xx,connect-failure,reset,refused-stream"
            num_retries: 100
            per_try_timeout: 3s

NOTE

The vhost.name must match the actual virtual host name in Envoy's configuration. To find the correct name, run istioctl proxy-config routes <ingressgateway-pod> -o json. The format is typically hostname:port (for example, my-app.example.com:443), not https://hostname.

With this retry policy in place, failed attempts (502, 503, connection refused) are transparently retried. The client receives a successful response once the Pod is ready — up to 100 × 3s = 5 minutes of retries.

Also set KEDA_HTTP_REQUEST_TIMEOUT to 0 (the default) on the Interceptor Deployment. This makes the Interceptor wait indefinitely for the target to scale up, letting the EnvoyFilter retry policy handle all client-facing timeouts.

Istio Service Mesh

Tutorials

Technical Reference

Troubleshooting

Istio Gateways

Configure an mTLS Gateway

Exposing and Securing Workloads

JWT Validation

External Authorization

noAuth Configuration

Custom Resources

APIRule Migration

Technical Reference

Troubleshooting Guides

Working with Multiple Subaccounts

Resources

Tutorials

Troubleshooting

Technical Reference

Runtime Agent

Tutorials

Resources

Tutorials

Register a Service

VPC Peering

Resources

Tutorials

Troubleshooting for the Eventing Module

Resources

Troubleshooting Guides

Troubleshooting

Tutorials

Resources

Technical Reference

Troubleshooting Guides

Telemetry Pipeline API

Collecting Logs

Collecting Traces

Collecting Metrics

Filtering and Processing Data

Transform and Filter with OTTL

Integrate with your OTLP Backend

Architecture

Integration Guides

Resources

Tutorials

Resources

Technical Reference

Tutorials

Resources

Troubleshooting Guides

Technical Reference

Tutorials

Commands

HTTP Add-on Returns 503 Errors During Cold Start ​

Symptom ​

Cause ​

Solution ​

HTTP Add-on Returns 503 Errors During Cold Start

Symptom

Cause

Solution