GitHub Incident

(githubstatus.com)

87 points | by aggrrrh 2 hours ago

11 comments

  • bakje 1 hour ago
    Perhaps the gemini-cli bot arguing with itself is taking its toll

    https://github.com/google-gemini/gemini-cli/issues/16750

    • MattIPv4 1 hour ago
      https://github.com/google-gemini/gemini-cli/issues/16723 is even worse, GitHub shows `5195 remaining items` in the collapsed timeline.
    • pdimitar 1 hour ago
      I could not resist to put my sarcastic comment about RAM price increases serving a good cause in there.
      • dgxyz 59 minutes ago
        Having just had to buy 4TB of RAM, I appreciate this.
        • MisterTea 49 minutes ago
          That's like 100,000 USD. I keep thinking about making a rap video wearing a 10 TB gold chain surrounded by big booty girls with their naughty bits covered in m.2 SSD's while dissing the AI industry. Though I cant afford the RAM :-/
          • dgxyz 35 minutes ago
            Yep that much. 64Gb DDR5 ECC sticks (128Gb don't exist at the moment apparently). They declined the PO 6 months ago. That'll teach 'em.

            I was pissed that there weren't any sticks heading to the recycling out of the nodes otherwise I would make myself that chain :)

          • zxcvasd 44 minutes ago
            like most rap videos do with cars/jets/mansions, just rent the ram sticks for a few hours!
          • TheJoeMan 33 minutes ago
            It’s sad that I can’t interpret if you mean to actually shoot your rap video on film, or have an AI generate it lol. Either way you’re going to need RAM.
    • lol768 1 hour ago
      Jeez, what a mess. Some of those issues have over 5000 events on them.

      I really hope that didn't send emails out to people.

    • omoikane 25 minutes ago
      Maybe the bots should implement rule of ko.

      https://en.wikipedia.org/wiki/Rules_of_Go#Ko

    • embedding-shape 1 hour ago
      Haha, reminds me off bringing down office mail servers by accidentally creating loops of emails back in the day... What is old is new again, but this time with probabilities :)
    • johnisgood 1 hour ago
      Wonderful, lmao.
  • nullfish 1 hour ago
    I suspect the migration to Azure is continuing to go well
    • ascendantlogic 1 hour ago
      This feels more like Copilot-as-platform-engineer to me
      • DeepYogurt 12 minutes ago
        Github's been running on vibe code for a while now and it's starting to show
    • rvz 1 hour ago
      Yes indeed. 6 years of non-stop outages across the platform every month.

      Even self-hosting would have been more stable than sitting on GitHub as predicted more than half a decade ago. [0]

      Now there is no 'CEO of GitHub' to contact this time (Satya does not care).

      [0] https://news.ycombinator.com/item?id=22867803

    • someguyiguess 1 hour ago
      I did not come to hacker news expecting comedy gold but you have done it my friend!
  • jbverschoor 1 hour ago
    Good thing git is a distributed system
    • dgxyz 55 minutes ago
      Virtually no one knows how to do anything with it outside of github.
      • nine_k 47 minutes ago
        Your favorite search engine or LLM will show you in a second, it's really easy.

        The problem is that it's not enough. The fact that Github uses Git specifically is a technical detail; it could use mercurial equally easily, as Bitbucket used to. Github Actions, OWNERS files, PRs and review tools, issue tracker, wiki are all not Git features.

        • dgxyz 33 minutes ago
          Not a chance. I think you need to spend some time in low ball corporate IT. It's just monkeys throwing faeces at the wall. We only just levered them off subversion...

          (I use Fossil 100% offline for personal projects for ref)

      • TZubiri 27 minutes ago
        You might be surprised, but that's not true at all.

        I once read someone commenting "Nobody writes code by hand without looking syntax up".

        Man, you are just outing yourself as a complete beginner, the field is way deeper than you imagine and it's not even close.

        • dgxyz 12 minutes ago
          Not really. I've been around a while. Git for about 15 years. Subversion before that. Perforce before that. rcs before that (back down to sun3 machines). Mostly Fossil now for personal things.

          What I am saying is that people learn as much as they need to. They generally don't need to know any more git than is required to interact with github. If anything problematic comes up, they go in with a wrecking ball because they don't truly understand what they are doing. And git has a lot of wrecking balls available.

          If you threw them at raw git and asked them to collaborate with someone they'd be up shit creek. They have no idea how SSH or email works for example.

      • Joe_Cool 48 minutes ago
        That's a them problem.
      • tonymet 37 minutes ago
        i still find insightful ways to use git every day. amazing tool. it's a shame for those who only see it as "how to sync my repo with my coworkers"
    • TZubiri 29 minutes ago
      True, workers can still commit to their local git.

      I've been looking into having a separate git server that we can commit to and add plain ole git hooks to, and just having it be synced with github as a clone.

    • nine_k 50 minutes ago
      Git is!

      PRs and code review are not. CI/CD is not.

      I mean, there are solutions, but none of them seems to have a large enough mindshare and efficiency. (Even though Github's code review tools are pretty spartan.)

      • globular-toast 19 minutes ago
        > PRs and code review are not. CI/CD is not.

        They can be. A PR can be made and code review conducted by submitting a patch to a mailing list. That's how the kernel and, I think, git itself is developed.

        CI/CD is really a methodology. It just means integrating/deploying stuff as soon as its ready. So you just need maintainers to be able to run the test suite and deploy, which seems like a really basic thing.

  • corvad 1 hour ago
    Github's recent reliability has honestly been abysmal. Not surprised.
    • ferguess_k 1 hour ago
      Unless some major customers are moving away, I don't think they are going to seriously care about it.
      • pxc 7 minutes ago
        [delayed]
      • corvad 1 hour ago
        I suspect some companies may already be considering it. Especially with the wealth of alternatives today.
        • supriyo-biswas 40 minutes ago
          In my experience companies are moving into GitHub for Copilot and GHA.
          • appplication 30 minutes ago
            GHA maybe, but copilot is just another mid tier player in a congested space.
        • g947o 45 minutes ago
          Companies are already using on-premise GitHub server, if they are using GitHub in the first place. There are many other self hosted solutions which are quite common in enterprise environment.
        • nine_k 52 minutes ago
          What kind of alternatives do you see as viable for large(ish) commercial users?
          • toephu2 43 minutes ago
            GitHub on-prem. Officially called GitHub Enterprise Server. You can have GitHub, but hosted on your own servers.
            • NewJazz 39 minutes ago
              So you still pay them, you do the hosting work, and you get a product with worse features than gitlab?
              • zxcvasd 27 minutes ago
                but you can be smug when theres a github incident, and thats hard to put a price on
  • howToTestFE 41 minutes ago
    If GH has an issue, it seems to always be around 4pm or 5pm GMT. I'm starting to think that i should avoid any planned production releases around this time.
  • tapoxi 1 hour ago
    helm repo add gitlab https://charts.gitlab.io/ && helm upgrade --install gitlab gitlab/gitlab

    I did this in 2019, it avoided so many headaches. CI is better too since there's a nice clean mapping of build -> pod for everything and I can just exec in if something's borked.

    • odie5533 1 hour ago
      Things would have to get really bad before I considered managing my own repositories. Trading someone else's headaches for my own.
      • tapoxi 1 hour ago
        It's not as bad as you think, I run the helm upgrade when patches come out, the backing store is S3 or managed SQL, it runs a nightly k8s cron called gitlab-backup which tarballs the whole thing into an s3 bucket with a single command restore should disaster strike. (This is part of the product, not a thing I wrote.)

        I probably only babysit it for 30 minutes per year, including all the upgrades.

      • nine_k 39 minutes ago
        It depends how high you value your headaches, and how high, your org's downtime. Github not working accrues over the hourly rate of every developer affected, which is likely $70-$100 a hour. 10 hours of outage in a year affecting a team of 10 would cost north of $70k, enough to hire a part-time SRE dedicated just to tend to your Gitlab installation.
        • zxcvasd 23 minutes ago
          >10 hours of outage in a year affecting a team of 10 would cost north of $70k

          10 hours x 10 developers x $70 per hour = $7000, not $70000.

        • TechDebtDevin 36 minutes ago
          [dead]
      • 0xbadcafebee 1 hour ago
        ^ this. the last thing i want is to add to my workload. take my money and make my life easier, even if it means that for one hour every couple months i can't do anything
        • NewJazz 40 minutes ago
          Have you ever actually hosted gitlab?
  • postexitus 1 hour ago
    I believe it is an Azure outage or some type of MS service - everything on Azure is down.
    • zxcvasd 1 hour ago
      having no issues on azure here, seeing no azure incidents on the status page or any of my admin panels
      • deathanatos 42 minutes ago
        > seeing no azure incidents on the status page

        … in all seriousness, that is hardly proof that Azure isn't having an outage.

        • zxcvasd 39 minutes ago
          if i thought it alone was proof enough, i wouldnt have also included the bit about how i was actively using azure.

          its one signal, among others. and in any case, i wasn't trying to prove the parent commenter wrong. i was offering my own signal to the crowd.

      • verst 1 hour ago
        I second this. Not experiencing any Azure issues at this time.
    • ctxc 1 hour ago
      My az services seem to be up.
  • nottimbo 1 hour ago
    Microsoft, it's time to hire some SREs.
    • arm32 1 hour ago
      We did hire some, boss! Soshie, Vizzy and Dexter. They're AI, but they're supposed to be way better than a human SRE. At least that's what the Sintra salesguy told us.
      • rvz 1 hour ago
        So that's what the Tay, and Zoe AI bots were doing all this time after they were cancelled and banned off of Twitter.

        Working on the GitHub Azure migration and for years it's gone so well so far.

    • lenerdenator 1 hour ago
      Why hire anyone to fix a problem when you can make an AI agent to "fix" it, tell investors about it to pump the price, and not fix anything knowing that you have a monopoly?
    • VirusNewbie 57 minutes ago
      Microsoft doesn't pay well enough to attract good SRE talent.
    • aruggirello 1 hour ago
      Clippy to the rescue! :-)
    • ferguess_k 1 hour ago
      Yes we did hire SREs, unfortunately they are in another continent and they only know how to pull others into the chat. We also have some AI too, do you want to try them? They are pretty good SREs, one of them wrote 100K lines of code in a week while another one reviews every line along the way. It was fantastic! Fantastic!! FANTASTIC!!!

      OK I have no idea about MSFT SREs, just to be /s.

  • toephu2 42 minutes ago
    This is why companies should host their own source code on-prem.
  • MadameMinty 1 hour ago
    Angry unicorns seem to be over.
  • phtrivier 1 hour ago
    Fixed in about 30m to an hour.

    Definitely annoying, but I'll try the hot take that, contrary to popular belief, GH is not critical infrastructure - or so I hope.

    Please tell me no part of the Ukrainian air defense system depends on a gh action hook.

    • eddd-ddde 1 hour ago
      You've heard of infrastructure as code, now presenting air strikes as code!

      Need a new secret offensive operation? Create a new JSON file with the coordinates, make a merge request and get Commander approval, merge it, and our new proprietary GitHub action runner will deploy a drone in seconds!

      • philipallstar 52 minutes ago
        This is far too simple. The correct way is to generate an NFT that's a screenshot from Google Maps of where you'd like to hit, and a blockchain-watching AI will spot it, figure out where you probably mean and send the coordinates to the fire control system.
    • vaylian 48 minutes ago
      It's not critical, but there's still a lot of reliance on it.

      It's also the only reason why I still need IPv4.

    • NewJazz 41 minutes ago
      The status page says things are still not fixed.
    • ares623 54 minutes ago
      When millions of man-hours are lost waiting for your service to be back up, I think that deserves a bit of resiliency.