Monologue: LLM Code Is Already Breaking Big Tech

12 min

•Mar 20, 20264 months ago

Summary

Ed Zittron argues that LLM-generated code is creating a crisis at major tech companies like Meta and Amazon, where non-technical workers and rushed engineers are deploying untested AI-written code to production, causing security breaches and massive outages. He warns that this practice is unsustainable, creates unmaintainable technical debt, and will lead to systemic failures across the software industry.

Insights

Non-technical workers using LLMs to write production code without understanding it creates a compounding technical debt crisis that will take years to repair
Management pressure to ship code faster using LLMs incentivizes engineers to skim-read or skip code review entirely, bypassing safety mechanisms
LLMs hallucinate features and solutions based on training data, making them unreliable for novel problems and creating hidden bugs that are difficult to debug
The conflation of 'fast' with 'good' by executives unfamiliar with software engineering is driving reckless LLM adoption despite documented failures at scale
Friction in the development process (time to understand, review, maintain) is being removed without replacing it with quality controls, creating systemic risk

Trends

LLM-driven code quality degradation at hyperscale tech companies causing production incidentsManagement mandates and surveillance of LLM usage creating pressure on engineers to use tools irresponsiblySecurity incidents and outages directly attributed to AI-assisted code deployment without proper change managementShift from software engineering to code generation without comprehension or intentionalityIncreasing technical debt from high-volume, low-quality AI-generated code accumulationLoss of institutional knowledge as companies rely on LLMs to understand their own codebasesVenture capital narrative driving adoption of LLMs as 'software engineer replacement' despite evidence of failureOpen source projects being flooded with low-quality LLM-generated pull requests from overconfident contributors

Topics

LLM Code Generation RisksTechnical Debt and Code MaintenanceAI-Assisted Software DevelopmentCode Review and Quality AssuranceProduction Incident ManagementSecurity Vulnerabilities from AI CodeNon-Technical Workers Writing CodeManagement Pressure and VelocitySoftware Architecture UnderstandingHallucinations in Language ModelsOpen Source Quality DegradationChange Management FailuresInstitutional Knowledge LossSoftware Engineering EthicsAI Governance in Tech Companies

Companies

Meta

Security incident where in-house AI agent posted unauthorized technical advice to internal forum, exposing sensitive ...

Amazon

Multiple major outages caused by AI tool Q, resulting in 120,000 lost orders and 6.3 million lost orders across marke...

Microsoft

Referenced as hyperscaler allowing non-technical workers to deploy LLM-generated code to consumer-facing products

Spotify

CEO stated top developers are no longer writing code, relying instead on LLM code generation

OpenAI

Referenced as creator of LLM technology being used for code generation at major tech companies

GitHub

Open source projects receiving influx of low-quality LLM-generated pull requests from overconfident contributors

People

Ed Zittron

Delivers monologue warning about systemic risks of LLM-generated code in production at major tech companies

Mo Batar

Recently discussed how LLMs are designed to galvanize and glaze users into accepting overwritten code

Quotes

"LLMs do not understand anything, nor do they think, which means any solutions they build or theoretical bug reports that they may make are immediately questionable."

Ed Zittron

"The idea of having any number of non-technical people ship code is fucking insane, an indicative of an overwhelming ignorance on the part of management."

Ed Zittron

"Generative code is a digital ecological disaster one that will take years to repair thanks to company remits to write as much code as fast as possible."

Ed Zittron

"LLMs convince you that what you're writing is good and stable and does the thing you want it to, and if you're skim reading the outputs or unable to read them at all, it's easy for you to assume that you actually did so."

Ed Zittron

"The push from above to use these models because they can write code fast and a human is a disastrous conflation of fast and good."

Ed Zittron

Full Transcript

This is an iHeart podcast. Guaranteed human. Hey there, this is Josh from Stuff You Should Know with a message that could change your life. The Stuff You Should Know ThinkSpring podcast playlist is available now. Whether Spring has sprung in your neck of the woods yet or not, the Stuff You Should Know ThinkSpring playlist will make you want to get your overalls on, get outside, and get your hands in the dirt. You can get the Stuff You Should Know ThinkSpring playlist on the iHeart radio app, Apple podcasts, or wherever you get your podcasts. With performances by Alex Warren, Kehlani, Laini Wilson, Ludacris, Ray, TLC, Salt and Pepper, and Invoke. Plus Taylor Swift makes her first award show appearance this year. Also Gold Medal Olympian, Alyssa Liu, Neo, Nick Colesure Singer, Nikki Glaser, Sombra, Weiser, and more. Watch live on Fox, Thursday, March 26th, at 8, 7, Central. And listen on iHeart radio stations across America and the free iHeart app. Hello and welcome to this week's Better Offline monologue. I'm your host Ed Zittron. I heard something really worrying the other day about a major hyperscaler. According to the source, said hyperscaler was allowing and even encouraging non-technical workers to deploy code to consumer facing products, specifically those who cannot read or write code, vibe coding their own projects using generative AI, with their code at some point theoretically reviewed by an actual software engineer before it gets pushed to production. The cold comfort of that review is that it assumes that software engineers, or at least the ones reviewing that code, are actually adept at code review, or even if they are, that they have sufficient time to look over the overly verbose code that LLM spew. In some cases, I've heard, management is actively encouraging and even mandating these non-technical workers to use LLMs to make these features, creating a mutation of tech debt where somebody who cannot code uses a machine that doesn't think to create code with no intention, that nobody really understands, and does so at such a velocity that it burdens the actual technical workers with constantly having to monitor and fix it. LLMs do not understand anything, nor do they think, which means any solutions they build or theoretical bug reports that they may make are immediately questionable. Their hallucinations are such that even features you believe are part of your code, after all, you can't read it, might not be there, or might be poorly designed, or might have some sort of unforeseen problem that neither you nor the LLM are aware of, because its training data is based on code that already exists, versus any ability to solve novel problems. Also, the idea of having any number of non-technical people ship code is fucking insane, an indicative of an overwhelming ignorance on the part of management. Even a few years of having overwhelming amounts of code written by LLMs, even by engineers who know what it says and have some intention in the prompts they do, is going to create a situation where most of the code is written without any intention, making it much harder to debug, because nobody really knows why it was written that way, because LLMs don't think, I know I'm repeating myself already, but this situation is chilling me. Even outside of the vibe coding, there's a larger problem of developers writing code with LLMs that they barely review, in some cases because they don't feel they need to, and just skim read it, in many many more because their bosses are demanding their ship features faster than is responsible or safe. Remember, many many tech companies are mandating LLM use, harassing their workers, checking how much they use LLMs, I've heard this from multiple companies, and really it's not just for them, but it's especially hard on software engineers. Adding a layer of code written by people who quite literally do not understand what it says or does, guarantees future situations where major services simply break, and the more of this nonsensical code that's allowed to be stood up on these services, the harder it'll be to fix. Code isn't just something you write once and leave forever, it needs to be maintained by other people, sometimes years in the future, especially when people keep being laid off, which becomes much harder when there's lots of code to go through written by an LLM piloted by somebody who doesn't know what they're actually doing, and relies on the LLM which doesn't know anything to tell them what's going on. Again, I realize I'm repeating myself, but LLMs do not have thoughts or feelings or knowledge, they are generating based on the parameters of their training data, which means that all of their code is, at best, an abstraction of somebody else's back and forth with the chatbomb. The code is not written efficiently or with any consideration of who might have to work with it in the future, or indeed what other things might be involved. LLMs only know what they are fed or what they're connected to, they don't know the nuances of code, they don't know the nuances of software engineering or architecture, which means that they only know so much based on their training data and the environment around them, kind of, they don't know the nuances of how service, let's say, Meta or Microsoft's, has been built over decades, and indeed the more of this code that's used to build those services, the less that these companies know about how their actual fucking software works. This is setting up the software industry for disaster after disaster, and it's already started to happen. To quote the information from this week, According to internal meta communications and an incident report seen by the information, a major security alert occurred last week after a Meta software engineer used an in-house agent tool similar to OpenClaw to analyze a technical question that another employee had posted on an internal discussion forum. After doing the analysis, the AI agent posted a response in the discussion forum to the original question, offering advice on the technical issue according to internal communications. The agent did so without approval from the employee. It's so cool that this is happening. It's so cool. It's great. It's actually brilliant. How fucking insane. What, and I'm, I'm kindly going to assume that the person using this knew what they were doing, but the idea that we have, and I think this is what's happening with Open Source 2, we have people with LLMs who are like, yeah, well, the LLM tells me I'm good, so I must be. Let me just run this LLM past your problems. It's why we're getting all these junk pull requests on GitHub on Open Source projects. People that think they're competent because an LLM told them to are fucking up the entire software world. And according to the information, Meta systems storing large amounts of company and user-related data were accessible to engineers who didn't have permission to see them. And this was marked as Sec 1 incident, the second highest level of severity on an internal scale that Meta uses to rank security incidents. And again, that's quoting the information. The incident follows multiple problems caused to Amazon by its Kiro and QLLMs. I quote business insiders Eugene Kim. On March 2nd, customers across Amazon marketplaces saw incorrect delivery times when adding items to their carts. The incident led to nearly 120,000 lost orders and roughly 1.6 million website errors. Amazon's AI Tool Q was one of the primary contributors to trigger the event, according to an internal review. On March 5th, another outage caused a 99% drop in orders across Amazon's North American marketplaces, resulting in 6.3 million lost orders, one of the internal documents stated. One key factor was a production change that was deployed without using a formal documentation on an approval process called model change management. Very cool. I also want to be clear that it appears that these incidents were created by use of these tools by actual software engineers, people that ostensibly know how code and software architecture works. Reliance on large language models, especially at a time when executives are putting more pressure on engineers to deliver more features and ship more code, means that software engineers are being incentivized to be sloppy and to ship slop itself. There is nothing inherently good about automating code, nor is there any inherent value in shipping a lot of it fast. LLMs convince you that what you're writing is good and stable and does the thing you want it to, and if you're skim reading the outputs or unable to read them at all, it's easy for you to assume that because you asked a model that does not have thoughts where it thinks you got something right that you actually did so and that it got it right. To be explicit, allowing an LLM to write all of your code means that you are no longer developing code, nor are you learning how to develop code, nor are you going to become a better software engineer as a result, nor are you solving actual problems, you are just handing work over to something and taking dog shit out. I'm not saying that all coders using LLMs are inherently bankrupt or anything, but I hear these stories about writing all the code and they give me the willies. And I know what I'm saying sounds like an insult or hyperbole, I don't mean it in that way. If you are just a person looking at code, you're only as good as the code the model makes, and as Mo Batar recently discussed, these models are built to galvanize you, glaze you and tell you that you are remarkable as you barely glance at globs of overwritten code that, even if it functions, eventually grows to a hole built with no intentional purpose other than what the model generated from your prompts. I'm sure there are software engineers using these models ethically who read all the code, who have complete industry over it and use it like a glorified autocomplete. I can see the value. I'm also sure that there are some that are just asking it to do so of glancing at the code and shipping it. It's impossible to measure how many of each camp there are, but hearing Spotify's CEO say that its top developers are basically not writing code anymore makes me deeply worried because this shit isn't replacing software engineering at all. It's mindlessly removing friction and putting the burden of good or right on a user that it's intentionally gassing up. And ultimately this entire era is a test of a person's ability to understand and appreciate friction. Friction can be a very good thing. When I don't understand something I make an effort to do so in the moment it clicks, it's magical. In the last three years I've had to teach myself a great deal about finance, accountancy and the greater technology industry and there have been so many moments where I've walked away from the page frustrated, students self doubt that I'd never understand something. I eventually did. It took time. It really took time and really that luxury of time is important and sadly many software engineers face increasingly deranged deadlines set by bosses that don't understand a single fucking thing about their job or the software industry itself, let alone what LLMs are capable of or what responsible software engineering might be. The push from above to use these models because they can and I quote right code fast and a human is a disastrous conflation of fast and good, all because of flimsy myths peddled by venture capitalists in the media about LLMs being able to replace software engineers. It's fucking stupid. It's a disgrace and there are real problems that are going to happen as a result. The problem is that LLMs can write all code theoretically, they can just put the code out that you might have written yourself, doesn't mean the code is good or that somebody can read it and understand its intention or that it works so that it will work in the future or that you can build any kind of sustainable or I don't know like stable in any way organization on top of it or even that having a lot of code is a good thing both in the present and in the future of any company built using this generative code. Adding the variable of code written by people who quite literally do not understand it guarantees something severe and calamitous in the future though I'd argue that was the case without their influence. Increasing the volume of code contributed to a company naturally increases the amount of time needed to read it and the amount of effort needed to maintain it which naturally encourages people to use LLMs to summarize it and then well you have to rely on the LLMs to tell you what good looks like and they don't know a single fucking thing and it also creates a new burden on the technical workers that have to clean up the slop in their day-to-day lives. Generative code is a digital ecological disaster one that will take years to repair thanks to company remits to write as much code as fast as possible and use LLMs as much as possible too. Every single person responsible must be held accountable especially for the calamities to come as lazily managed software companies see the consequences of building their software on sand. I'll see you all next week. Hey there this is Josh from Stuff You Should Know with a message that could change your life. The Stuff You Should Know ThinkSpring Podcast playlist is available now. Whether Spring has sprung in your neck of the woods yet or not the Stuff You Should Know ThinkSpring playlist will make you want to get your overalls on, get outside, and get your hands in the dirt. You can get the Stuff You Should Know ThinkSpring playlist on the iHeart Radio app, Apple podcasts, or wherever you get your podcasts. Let's go! Our iHeart Radio Music Awards are coming back Thursday, March 26th, live on Fox. Watch as we honor the biggest stars from all genres of music that you love listening to all year long on your favorite iHeart radio station and the iHeart radio app. Hosted by Ludacris, Icon Award recipient John Mellencamp, Innovator Award recipient Miley Cyrus, with performances by Alex Warren, Kehlani, Laini Wilson, Ludacris, Ray, TLC, Salt and Pepper, and Invoke. Plus Taylor Swift makes her first award show appearance this year. Also Gold Medal Olympian Alyssa Liu, Neo, Nick Coleslaw, Nicky Glazer, Sombra, Weiser, and more. Watch live on Fox Thursday, March 26th, at 8, 7, Central. And listen on iHeart radio stations across America and the free iHeart app. This is an iHeart podcast. Guaranteed human.