Status | Blacksmith - Network degradation for runners in our US region – Incident details

Network degradation for runners in our US region

Resolved
Degraded performance
Started 2 days agoLasted about 13 hours

Affected

Blacksmith Managed Runners

Degraded performance from 1:17 AM to 3:01 AM, Operational from 3:01 AM to 2:41 PM

Github → API Requests

Updates
  • Resolved
    Resolved

    This incident has been resolved, network health is back to normal in our US fleet.

  • Monitoring
    Monitoring

    We have been informed that the upstream networking issue has now been resolved and service should be returning to normal. We are working with our vendor to better understand how to avoid this in the future.

  • Update
    Update

    Our vendor is actively working on a fix for this network degradation. In the mean time we are removing all the affected servers from our fleet to help bring service back to normal.

  • Identified
    Identified

    We are specifically seeing packets being dropped when downloading from api.github.com. We continue to look into the source of the degradation.

  • Investigating
    Investigating

    We are currently investigating reports from an upstream vendor of a degraded network stack in our US region. This could manifest as hanging GitHub Actions runners or slow/stuck steps in the workflow.