Status | Blacksmith - Notice history

Blacksmith Managed Runners - Operational

100% - uptime
Aug 2025 · 99.98%Sep · 99.89%Oct · 100.0%
Aug 2025
Sep 2025
Oct 2025

Incremental Docker Builders - Operational

100% - uptime
Aug 2025 · 99.99%Sep · 99.89%Oct · 99.99%
Aug 2025
Sep 2025
Oct 2025

API - Operational

100% - uptime
Aug 2025 · 100.0%Sep · 99.89%Oct · 100.0%
Aug 2025
Sep 2025
Oct 2025

Website - Operational

100% - uptime
Aug 2025 · 100.0%Sep · 99.98%Oct · 99.79%
Aug 2025
Sep 2025
Oct 2025

Github → Actions - Operational

Github → API Requests - Operational

Github → Webhooks - Operational

Notice history

Oct 2025

Increased queue times in the US region
  • Resolved
    Resolved

    This incident has been resolved, queue times are back to normal. The longer queue times were a combination of an outage with our upstream DB provider and a need for us to scale up our fleet to absorb higher than normal traffic. We are working with our DB provider to prevent such an outage in the future, and have scaled up our fleet to prevent such saturation in the future.

    We sincerely apologize for the prolonged queueing that our customers saw today. We take such a prolonged outage very seriously and will work on hardening our systems based on our discoveries from today. If you still see any queued jobs we would recommend canceling and re-triggering jobs to see normal adoption.

  • Update
    Update

    Queue times are stabilizing and we're continuing to monitor.

  • Update
    Update

    We are monitoring increased queue times in relation to an incident with our database provider and are working towards a fix

  • Update
    Update

    We are continuing to monitor increased queue times.

  • Update
    Update

    We are continuing to monitor increasing queue times. Our team is working on mitigations to bring times back down to normal.

  • Monitoring
    Monitoring

    Increased queue times continue but we are seeing improvement and are monitoring the situation.

  • Update
    Update

    We are continuing to investigate increased queue times in the US and are working to resolve this.

  • Update
    Update

    We are continuing to investigate increased queue times in the US

  • Investigating
    Investigating

    We are being notified of extended queue times in the US region, and are looking into this.

Sep 2025

GitHub Actions is having an outage
  • Resolved
    Resolved

    This incident has been resolved, queue times are back to normal.

  • Monitoring
    Monitoring

    GitHub has resolved the incident, and job dispatch has returned to normal. Our infrastructure is fully operational and new jobs are being picked up without delay. Earlier, you may have noticed some queued jobs being canceled as part of our mitigation efforts to prevent long backlogs -- this ensured new jobs could start processing quickly. Retrying canceled jobs should see them run normally once again. We’ll continue to keep an eye on metrics, but no further customer impact is expected. We apologize for the inconvenience.

  • Identified
    Identified

    We are about to issue a cancelation for some subset of queued jobs across orgs. This is in an effort to bring queue times back to normal by trimming down the backlog. Customers can retry the canceled jobs and expect normal operation thereafter.

  • Update
    Update
    Even though GitHub has reported their incident as resolved, but we’re still seeing delayed job starts for a fraction of jobs. Things are steadily improving as the backlog drains, but you may continue to see slower job starts until recovery is complete.
  • Update
    Update

    We're seeing a massive backlog of queued jobs due to the outage, which is now slowly draining, so you may see jobs being slower to get picked up for the next few minutes.

GitHub Actions checkout are taking longer than usual
  • Resolved
    Resolved

    This incident has been resolved. We will be providing a postmortem to affected customers by tomorrow.

  • Monitoring
    Monitoring
    We implemented a fix and are currently monitoring the result.
  • Identified
    Identified
    We are continuing to work on a fix for this incident.
  • Monitoring
    Monitoring

    Our upstream ISP identified some issues between them and their network vendor. They have made changes to mitigate these blips while working on a longer term resolution. We will share more details as we receive them. All network operations in Blacksmith runners have returned to normal. We are actively monitoring the situation.

  • Update
    Update

    We are being alerted about intermittent degradation in our network stack in one of our regions. This may be sporadically manifesting as slower docker pulls, interactions with certain language mirrors, git checkouts. We are investigating the source of this issue.

  • Investigating
    Investigating
    We're hearing this from several customers today that they are seeing elevated `actions/checkout` times. We are noticing this in our local development as well. This is very likely an upstream degradation with the GItHub control plane but we are investigating what we can do to mitigate these.

Aug 2025

Aug 2025 to Oct 2025

Next