Impact Start Time (UTC): 2018-Oct-2 00:16 AM
Impact End Time (UTC): 2018-Oct-2 00:58 AM
A capacity related incident affecting the Apigee NAT gateway tier in the US-East region caused a service impacting issue for API runtime traffic across multiple customers. No other regions were impacted, and all other services were working as expected during this time. During the outage time period, CPU resources on the NAT gateway units were exhausted due to an inordinate increase in API traffic. The NAT servers were functional and processing some traffic, but given the load signature, a high amount of packet loss was registered causing severe request processing latency and/or an increase in HTTP 503 error rates for the duration of the outage. The source of the traffic increase was isolated and corrective actions taken to mitigate impact. A full RCA has been completed and follow-up items prioritized to help prevent the risk of recurrence.
If you were impacted by this incident and would like a copy of the more detailed Incident Report, please open a P3 Support case to Apigee Support.