Connectivity issue for US PGW

Incident Report for Aeris Connectivity Service Status

Postmortem

RCA - TRAC-3353 - IOT Connectivity - Critical - IoT Connectivity Core Network Service Disruption

Aeris understands and regrets the significant service impact experienced by our partners and customers. Please be assured that Aeris is fully committed to making the necessary improvements to prevent a recurrence. We are conducting a comprehensive review of our IoT Connectivity offering, service platforms, and supporting infrastructure, from engineering through operations, to ensure all risks are addressed. A clear action plan will be shared and implemented.

We value our partnership with you and remain dedicated to restoring and maintaining full confidence in our service delivery.

Event Start Date/Time (UTC): 22 June 2026, 13:40

Event End Date/Time (UTC): 23 June 2026, 20:17

Date/Time Reported (UTC): 22 June 2026, 13:40

Severity: Critical (Severity 1)

Services Affected: IOT Connectivity Services

Case Number: TRAC-3353 || ZD#177393

Duration: Up to 31 hours and 13 minutes depending on the service

Description of Failure:

Aeris experienced a multi-stage service disruption affecting components of the IoT Core Network, including Data, and SMS services. The issue was first identified on June 22, 2026, at approximately 13:40 UTC, when Aeris monitoring systems detected alarms on the core network.

Impact varied by product and occurred as sustained service degradation with intermittent recovery, rather than a single continuous outage. Services were fully restored on June 23, 2026, at 20:17 UTC. Post‑restoration monitoring confirmed that key service performance indicators returned to baseline levels.

During the incident, data services experienced intermittent session establishment failures and increased latency, while SMS services experienced delayed delivery and intermittent message failures. Impact severity fluctuated over time and was most pronounced during peak traffic intervals.

The initial trigger for the incident was a fiber cut in the primary transport circuit on June 19th at approximately 20:20 UTC. Following the fiber cut, traffic was rerouted automatically to a backup path, with no immediate service impact observed. However, traffic volumes increased during peak business hours on June 22nd and exposed internal capacity limitations on this backup path. This resulted in packet loss, increased latency, and downstream service degradation across data and SMS services.

The Aeris engineering teams and leadership were engaged continuously around the clock throughout the incident, coordinating across network, platform, and vendor teams. Aeris deployed engineers and data‑center technicians onsite and worked directly with the circuit provider and multiple technology vendors to diagnose the issue and implement mitigation and recovery actions, ultimately restoring full service.

The incident was prolonged during the investigation and restoration phase due, in part, to a software defect in a vendor network component which prevented throughput and congestion alerts from being reported. As a result, neither the Vendor nor Aeris teams were able to promptly observe or identify congestion on the backup path. This lack of visibility delayed fault isolation and corrective actions, thereby increasing the overall restoration time.

Ultimately, the root cause is that Aeris did not adequately update the capacity available on this backup path to meet growing demand in this data center.

Service Impact Summary:

Global GSM / Fusion Global:

  • Data:

    • June 22, 2026, ~14:35 UTC - June 23, 2026, ~11:30 UTC:
    • 20 hours, 55 minutes.
    • Approximately 25% impact observed centered on APNs terminating in the US.
  • Data:

    • June 23, 2026, ~11:30 UTC - June 23, 2026, ~20:17 UTC:
    • 8 hours, 47 minutes.
    • Approximately 75% impact observed centered on APNs terminating in the EU.
  • SMS:

    • Intermittent SMS failures from June 22, 2026, ~14:35 UTC - June 23, 2026, ~5:20 UTC.

Dual-Mode A-LH Service:

  • Data:

    • June 22, 2026, ~14:35 UTC - June 23, 2026, ~11:30 UTC:
    • 20 hours, 55 minutes.
    • Experienced significant service degradation. During this period, devices often registered successfully but were unable to pass user‑plane data or experienced intermittent connectivity.
  • SMS (MO‑SMS only):

    • June 22, 2026, ~14:35 UTC - June 23, 2026 ~ 21:28 UTC:
    • 30 hours and 53 minutes.
    • Experienced a limited impact affecting MO‑SMS only. MT‑SMS and delivery receipts were not impacted.

Fusion NA:

  • Data:

    • June 22, 2026, ~14:35 UTC - June 23, 2026 ~11:30 UTC:
    • 20 hours, 55 minutes.
    • Experienced significant service degradation.
  • SMS (MO, MT, and delivery receipts):

    • June 22, 2026, ~16:00 UTC - June 23, 2026, ~23:13 UTC. SMS services were completely unavailable during this time.
    • 31 hours and 13 minutes.
    • SMS services were completely unavailable during this period, impacting MO‑SMS, MT‑SMS, and delivery receipts.

Impairment Cause:

The failover in traffic routing to the backup network path caused congestion on core network nodes during peak traffic, which led to the service impairment.

Impairment Resolution:

The service issue was resolved through a series of corrective actions. Steps were taken to stabilize the network, including adjusting hardware settings, adding capacity where needed, and improving network configurations. Service stability was restored by June 23, 2026, at 20:17 UTC for the last affected service.

Immediately after our vendor fixed the fiber cut, the primary physical link was restored during a maintenance activity (177725 / IOTCHG‑10256) on June 25, 2026.

Corrective Action Items:

Action 1:

Backup Path Capacity Audit:

Conduct a comprehensive review to ensure backup capacity meets current demand

Action 1 Completion Date: In Progress / July 10th 2026

Action 2:

Implement Corrective Actions from Capacity Audit

Action 2 Completion Date: Target August 15th for all critical actions

Action 3:

Monitoring and Diagnostics:

Conduct audit of monitoring along the backup path and identify improvements to ensure necessary logs and metrics are available

Action 3 Completion Date: In Progress / July 17th 2026

Action 4:

Establish regular testing of the backup path

Action 4 Completion Date: Planned delivery - Aug 30, 2026

Posted Jun 30, 2026 - 15:30 UTC

Resolved

After being placed under extended monitoring, the incident has now been successfully resolved. Aeris engineers have confirmed that full system stability has been restored and redundancy has been fully normalized. Please see the details below for additional information

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 25, 2026 - 17:52 UTC

Monitoring

UPDATE 28: This is to provide an update: the Aeris team has taken corrective action and has been observing service recovery since 20:17 UTC. Aeris will continue to monitor the situation and provide further updates as more information becomes available.

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 20:43 UTC

Update

UPDATE 27: This is an update regarding the earlier reported incident. Aeris Engineering continues to closely monitor service conditions while actively working to resolve an issue impacting Fusion Global and Global GSM SIM data session traffic. Further updates will be provided as new information becomes available

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 19:28 UTC

Update

UPDATE 26: This message provides a status update on the previously reported incident. Aeris Engineering is actively engaged in monitoring service health and addressing an issue affecting Fusion Global and Global GSM SIM data session traffic. Additional updates will be shared as more details become available.

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 18:27 UTC

Update

UPDATE 25: This is to provide an update on the incident reported earlier. Aeris Engineering continues to monitor and validate all services. Aeris Engineering has observed an issue impacting Fusion Global and Global GSM SIM data sessions. The Aeris team continues to investigate, and we will provide further updates as more information becomes available.

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 17:41 UTC

Identified

UPDATE 24: This is to provide an update on the incident reported earlier. Aeris Engineering continues to monitor and validate all services. We will provide further updates as more information becomes available.

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 16:37 UTC

Update

UPDATE [23]: All 6 VMs are up, with AAA traffic passing normally. Entering a 3-hours monitoring period to verify complete service restoration to previous baselines. Health checks are ongoing.

Description: [Connectivity issue for US PGW]

Incident case number: [177393]
Posted Jun 23, 2026 - 15:20 UTC

Update

UPDATE 22: Aeris engineers are currently addressing an authentication and routing issue specifically affecting data sessions on a specific partner. Our team has implemented a partial fix by performing a targeted restart of our authentication servers, which has successfully allowed some affected devices to establish sessions and resume data traffic. To fully restore connectivity, we are now proceeding with a sequential, rolling restart of the remaining authentication servers to clear any stale sessions.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 14:01 UTC

Update

UPDATE 21: Aeris team continue to troubleshoot the remaining issues, Currently we can observe gradual recovery for the session count and traffic across multiple partners. Further updates will be provided once they become available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 13:07 UTC

Update

UPDATE 21: Aeris team performed the reboot at 11:25 AM UTC having partially restored services, we are still looking into the remaining issues, new updates will be provided as soon as they become available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 12:17 UTC

Update

UPDATE 20: This is to inform you that Aeris will shortly begin the process of controlled restart of the core routing component. More updates will be provided as soon as they become available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 11:19 UTC

Update

UPDATE 19: The Aeris team continues to evaluate the network environment and is preparing for a necessary controlled restart of a core routing component to potentially resolve this issue. We will share further updates as soon as they become available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 11:08 UTC

Update

UPDATE 18: In our continued efforts to fully stabilize the network, Aeris engineers have determined that a controlled restart of a core routing component is necessary to resolve the intermittent packet drops observed under high traffic loads. To prevent further service degradation, we will consider to have this controlled restart within the next hour. Further updates will be provided once the assessment is completed.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 10:13 UTC

Monitoring

UPDATE 17: Aeris engineers continue to make progress, and we are seeing a positive recovery trend for affected services. Core network routing is currently operational; however, our teams are observing some intermittent packet drops as traffic levels increase. We are actively running load simulations and fine-tuning network parameters to ensure complete stability under normal traffic loads.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 09:11 UTC

Update

UPDATE 16: Aeris engineers have successfully stabilized the core network routing infrastructure. While this has restored baseline physical connectivity, we are currently investigating a secondary issue related to device authentication and session setup. Recent internal testing shows positive results with sessions now establishing successfully. We are actively monitoring traffic flow and working with partners to confirm full service recovery. No ETR is available at this time. Further updates will be provided once they become available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 08:28 UTC

Update

UPDATE 15: Aeris engineers are actively working to identify the root cause of the issue. Based on current observations, no packet loss or packet drop errors have been detected. The team is also evaluating potential networking workarounds to help minimize service disruption while a permanent resolution is being developed. At this time, there is no estimated time for resolution (ETR) available. We will continue to provide updates as more information becomes available.

Please see below for the impacted services:

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 07:35 UTC

Update

UPDATE 14: Aeris engineers are actively working on identifying the root cause and evaluating possible networking workarounds to minimize service disruption while a permanent resolution is developed. No ETR is available at this time. Please see below for the services that are impacted.

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 06:46 UTC

Update

UPDATE 13: Aeris engineers are actively working on identifying the root cause and evaluating possible networking workarounds to minimize service disruption while a permanent resolution is developed. No ETR is available at this time. Please see below for the services that are impacted.

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 05:30 UTC

Update

UPDATE 12: Aeris engineers are actively working on identifying the root cause and evaluating possible networking workarounds to minimize service disruption while a permanent resolution is developed. No ETR is available at this time. Please see below for the services that are impacted.

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 04:26 UTC

Update

UPDATE 11: Aeris engineers are actively working on identifying the root cause and evaluating possible networking workarounds to minimize service disruption while a permanent resolution is developed. No ETR is available at this time. Please see below for the services that are impacted.

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 03:30 UTC

Update

UPDATE 10: Aeris engineers are actively working on identifying the root cause and evaluating possible networking workarounds to minimize service disruption while a permanent resolution is developed. No ETR is available at this time. Please see below for the services that are impacted.

Description: Connectivity issue for US PGW

Incident case number: 177393
Posted Jun 23, 2026 - 02:26 UTC

Update

UPDATE 9: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted. Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Description: Connectivity issue for US PGW

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 01:23 UTC

Update

UPDATE 8: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 23, 2026 - 00:26 UTC

Update

UPDATE 7: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 23:24 UTC

Update

UPDATE 6: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 22:21 UTC

Identified

UPDATE 5: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 21:36 UTC

Update

UPDATE 4: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Aeris has escalated this incident to our highest internal severity level. Initial remote troubleshooting, including controlled equipment restarts, did not successfully restore the physical connectivity state within our US core infrastructure. On-site data center technicians are currently en route. Upon arrival, they will physically inspect the equipment and provide further details regarding the state of it.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 19:55 UTC

Update

UPDATE 3: Aeris engineers continue to troubleshoot the issue. Additional escalation to senior management team was already engaged. No ETR is available at this time. Please see below for the services that are impacted.

Description: Our engineering team is actively working with our hardware vendor and on-site data center technicians to isolate the physical connectivity issue within our US core infrastructure and work towards a resolution.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 19:07 UTC

Update

UPDATE 2: Aeris engineers continue to troubleshoot the issue. Additional escalation measures have been taken. No ETR is available at this time. Please see below for the services that are impacted.

Description: Aeris engineers have identified a hardware and routing issue within the US core infrastructure causing the current service disruption. Consequently, the incident severity has been upgraded to Critical. Our teams are actively engaged with our hardware vendors to perform diagnostics and replace the affected physical components if needed to restore services. No ETR is available at this time. Please see below for the services that are impacted.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 18:17 UTC

Investigating

UPDATE 1: Aeris engineers continue to troubleshoot the issue. Additional escalation measures have been taken. No ETR is available at this time. Please see below for the services that are impacted.

Description: Aeris is currently troubleshooting an issue regarding the US PGW. More details will be provided once they become available.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 17:10 UTC

Identified

UPDATE 1: Aeris engineers continue to troubleshoot the issue. Additional escalation measures have been taken. No ETR is available at this time. Please see below for the services that are impacted.

Description: Aeris is currently troubleshooting an issue regarding the US PGW. More details will be provided once they become available.

Incident case number: 177393

If you have any additional questions, please contact support@aeris.net and reference the case number above.
Posted Jun 22, 2026 - 16:58 UTC

Investigating

Customer Reported at 14:33 UTC

Description: Connectivity issue for US PGW
Connectivity issue for US PGW. The Aeris technical team is actively troubleshooting to restore services. Further updates will be provided as soon as it become available.

Incident case number: 177393
Posted Jun 22, 2026 - 15:21 UTC
This incident affected: Dual-Mode A-LH Service (Data, SMS), Fusion NA IoT Service (Data, SMS), Fusion Global IoT Service (Data, SMS), Global GSM Services (US Market) (Data, SMS), and Global GSM Services (UK, European and African Markets) (Data, SMS).