Hacker News Clone

Towaway69Mar 25, 2026, 5:55 PM

What the article doesn't touch on is the vendor lock-in that is currently underway. Many corps are now moving to an AI-based development process that is reliant on the big AI providers.

Once the codebase has become fully agentic, i.e., only agents fundamentally understand it and can modify it, the prices will start rising. After all, these loss making AI companies will eventually need to recoup on their investments.

Sure it will be - perhaps - possible to interchange the underlying AI for the development of the codebase but will they be significantly cheaper? Of course, the invisible hand of the market will solve that problem. Something that OPEC has successfully done for the oil market.

Another issue here is once the codebase is agentic and the price for developers falls sufficiently that it will significant cheaper to hire humans again, will these be able to understand the agentic codebase? Is this a one-way transition?

I'm sure the pro-AIs will explain that technology will only get cheaper and better and that fundamentally it ain't an issue. Just like oil prices and the global economy, fundamentally everything is getting better.

_the_inflatorMar 25, 2026, 8:03 PM

I have similar concerns.

We will miss SaaS dearly. I think history is repeating just with DVD and streaming - we simply bought the same movie twice.

AI more and more feels the same. Half a year ago Claude Opus was Anthropics most expensive model - boy, using Claude Opus 4.6 in the 500k version is like paying 1 dollar per minute now. My once decent budgets get hit not after weeks but days (!) now.

And I am not using agents, subagents which would only multiply the costs - for what?

So what we arrive more and more is the same as always: low, medium, luxury tier. A boring service with different quality and payment structures.

Proof: you cannot compensate with prompt engineering anymore. Month ago you fixed any model discrepancies by being more clever and elaborate with your prompts etc.

Not anymore. There is a hidden factor now that accounts for exactly that. It seems that the reliance on skills and different tiers simply moves us away from prompt engineering which is considered more and more jailbreaking than guidance.

Prompt engineering lately became so mundane, I wonder what vendors were really doing by analyzing the usage data. It seems like that vendors tied certain inquiries with certain outcomes modeled by multistep prompting which was reduced internally to certain trigger sentences to create the illusion of having prompted your result while in fact you haven't.

All you did was asking the same result thousands of user did before and the LLM took an statistical approach to deliver the result.

dgb23Mar 26, 2026, 8:21 AM

Are you saying we increasingly get ML results and not LLM resuluts?

alt227Mar 26, 2026, 2:19 PM

> we simply bought the same movie twice

Maybe you did, but I certainly didnt.

nubgMar 26, 2026, 2:09 PM

> 1 dollar per minute

so 60 usd / hour? a plumber earns more

if this allows you to produce features that bring you money, it's a no-brainer

kderbymaMar 27, 2026, 2:05 AM

Plumbers are working on very standard systems, they have to front costs and secure work. It only works for them because enough people use basic plumbing services to sustain them. But how many have return customers and guaranteed work for long term projects?

mojosamMar 26, 2026, 12:47 PM

> Once the codebase has become fully agentic, i.e., only agents fundamentally understand it

What exactly do we mean this? Because it is obviously common for human coders to tackle learning how an unfamiliar and complex codebase works so that they can modify it (new hires do it all the time). I can think this means one of two things:

* The code and architecture being produced by agents takes approaches that are abnormally complex or inscrutable to human reviewers. Is that what folks working with cutting edge agents are seeing? In which case, such code obviously isn’t beeping reviewed; it can’t be.

* the code and architecture being produced by agents can still be understood by human reviewers, but it isn’t actually being reviewed by anyone — since reviewing pull requests isn’t always fun or easy, and injecting in-depth human review slows everything down a lot — and so no one understands how the code works. (I keep thinking about the AI maximalist who recently said he woke up to 75 pull requests from his agent, like that was a good thing)

And maybe it’s a combination of the two: agent-generated pull requests are incrementally harder to grok, which makes reviewing more painful and take longer, which means more of them go without in-depth reviews.

But if your claim is true, the bottom line is that it means no one is fully reviewing code produced by agents.

nfgrepMar 26, 2026, 2:37 PM

Folks are reviewing the code, but the standard shape of a review is a PR. This diff assumes you have an underlying knowledge of the system, one that is most realistically gained by having written the code. Could you “just remember” every diff you’ve seen? Maybe, but I don’t think it’s realistic; we learn far better from doing than from reading.

furyofantaresMar 26, 2026, 2:59 PM

> What exactly do we mean this? Because it is obviously common for human coders to tackle learning how an unfamiliar and complex codebase works so that they can modify it (new hires do it all the time).

I agree with you, BUT: I find it much harder to get my head around a medium sized vibe coded project than a medium size bespoke coded project. It's not even close.

I don't know what codebases will look like if/when they become "fully agentic". Right now, LLM-agents get worse, not better, as a codebase grows, and as more if it is coded (or worse architected) by LLM.

Humans get better over time in a project and LLMs get worse, and this seems fundamental to the LLM architecture really. The only real way I see for codebases to become fully agentic right now is if they're small enough. That size grows as context sizes that new models can deal with grows.

If that's how this plays out - context windows get large enough that LLM-agents can work fine in perpetuity in medium or large size projects - I wonder if the resulting projects will be extremely difficult for humans to wrap their heads around. That is, if the LLM relies on looking at massive chunks of the codebase all at once, we could get to the point of fully agentic codebases without having to tackle the problem of LLMs being terrible at architecture, because they don't need it.

BaloogaMar 26, 2026, 5:17 PM

And is "model collapse" a thing when LLMs are trained on 100% LLM-generated code? Fun times ahead.

AbstractH24Mar 26, 2026, 8:13 PM

What examples in history can be learned from here?

3formMar 26, 2026, 6:11 PM

For your points:

- Garden path approaches are definitely a thing, but I don't think this is necessarily catastrophic. A lot depends on the language and framework in question, and also the driver of the change.

- I think it's that plus the fact it's easy to just generate ever more code. Solutions scale in every dimension until they hit a limit where it's not feasible to go further. If AI tools will allow you to write a project with a million or 10 million lines of code, you can bet it will eventually happen. Who's ever gonna fix that?

eaglelampMar 25, 2026, 7:42 PM

No one ever asks how much it costs Facebook or Uber to serve requests because it is irrelevant, they set prices to maximize their profit like any good monopolist. Similarly the future cartel of big providers will charge their captive users whatever they can get away with, not the cost of inference.

The current discourse around "AI", swarms of agents producing mountains of inscrutable spaghetti, is a tell that this is the future the big players are looking for. They want to create a captive market of token tokers who have no hope of untangling the mess they made when tokens were cheap without buying even more at full price.

SaucyWrongMar 25, 2026, 6:46 PM

This is a great point, and I routinely use it as an argument for why seasoned professionals should work hard to keep their skills and why new professionals should build them in the first place. I would never be comfortable leasing my ability to perform detailed knowledge work from one of these companies.

Sometimes the argument lands, very often it doesn't. As you said, a common refrain is, "but prices won't go up, cost to serve is the highest it will ever be." Or, "inference is already massively profitable and will become more so in the future--I read so on a news site."

And that remark, for me, is unfortunately a discussion-ender. I just haven't ever had a productive conversation with somebody about this after they make these remarks. Somebody saying these things has placed their bets already and are about to throw the dice.

AbanoubRodolfMar 26, 2026, 7:02 AM

[dead]

mdavid626Mar 26, 2026, 7:39 AM

There is no such thing as agentic codebase. If humans don’t understand it, nothing really does. Agents give zero fuck about anything. If they burn 100 or million tokens to add a feature, they don’t care. It’s the developers responsibility to keep it under control.

drzaiusx11Mar 26, 2026, 12:59 PM

100% this. With these new tools it's tempting to one-shot massive changesets crossing multiple concerns in preexisting, stable codebases.

The key is to keep any changes to code small enough to fit in your own "context window." Exceed that at your own risk. Constantly exceeding your capacity for understanding the changes being made leads to either burnout or indifference to the fires you're inevitably starting.

Be proactive with these tools w.r.t. risk mitigation, not reactive. Don't yolo out unverified shit at scales beyond basic human comprehension limits. Sure, you can now randomly generate entirely (unverified) new software into being, but 95% of the time that's a really, really bad idea. It is just gambling and likely some part of our lizard brains finds it enticing, but in order to prevent the slopification of everything, we need to apply some basic fucking discipline.

As you point out, it's our responsibility as human engineers to manage the risk reward tradeoffs with the output of these new tools. Anecdotally, I can tell you, we're doing a fucking bad job of it rn.

codybMar 26, 2026, 1:19 PM

The big AI projects I've seen at work are...

- A Kafka topic visualization dashboard

and

- A chrome extension the original "developer" can no longer work on cause the bots will wreck something else on every new feature he tries to add or bug he tries to fix

I think we're a ways out from truly complex code bases that only agents understand.

I've seen a bunch of hype video where people spend lord knows how much money in order to have a bunch of these things run around and I guess... use Facebook, and make reports to distribute amongst themselves, and then the human comes in and spends all their time tweaking this system. And then apparently one day it's going to produce _something_ but two years and counting and much like bitcoin, I've yet to see much of this _something_ materialize in the form of actual, working, quality software that I want to use.

My buddy made a thing that tells him how many people are at the gym by scraping their API and pushing it into a small app package... I guess that's kind of nice.

shmobotMar 26, 2026, 10:27 AM

Lately I also wonder about the geopolitical lock-in and balkanization of the internet. US won't have this problem I guess. But with all that's happening in the world right now and the current trends, for the rest of us we need to think hard what AI company we trust with our data or trust to still have access to once we're on the other side of the wall.

iso1631Mar 26, 2026, 11:12 AM

> geopolitical lock-in and balkanization of the internet. US won't have this problem I guess

This reminds me of the apocryphal headline from the dying days of the British Empire:

> Fog in Channel; Continent Cut Off

gengstrandMar 26, 2026, 6:00 PM

If only the AI understands your code, then vendor lock-in and exposure to price hikes will be the least of your problems. I don't think that you will be able to add Claude as the Dev-On-Call to your pagerduty schedule. If you are in an industry that requires due diligence and you get sued for bugs that cause material damage and human suffering, then I don't think the "blame it on Claude" defense is going to land well in court. I cover these topics on https://www.exploravention.com/blogs/soft_arch_agentic_ai/ which is a blog I wrote recently.

sanderjdMar 26, 2026, 2:36 PM

I'm beginning to develop the opinion that the next step in this process will (or at least should) be local and/or self-hosted inference.

The latest qwen models are already very useful, and the smaller ones can be run locally on my laptop. These are obviously not as good as the latest frontier models, and that's extremely noticeable for the development workflow, but maybe in a year or two, they will be competitive with the proprietary models we have today, which are incredibly capable. I also expect compute for inference to continue getting cheaper.

The current lock in for me is the UX of Claude Code / codex cli, but this is a very small moat that will definitely be commoditized soon.

hyttioaoaMar 26, 2026, 10:19 AM

I've been thinking this for a while now as well. If they keep subsidizing for long enough there might be a large gap of humans that changes jobs, didn't get into the field in the first place. Then the only way out is to keep buying those tokens.

dahartMar 26, 2026, 3:02 PM

What do you mean about vendor lock-in? I haven’t yet seen any meaningful barriers to switching between different companies’ coding agents. Are you talking about AI market lock-in and not vendor-specific lock-in?

> these loss making AI companies will eventually need to recoup

This is true, and while AI spend continues to rise, I’m starting to think once the dust settles and the true costs emerge and stable profits are achieved, that it may be expensive enough that it’s a limiting force.

AbstractH24Mar 26, 2026, 8:15 PM

Then you aren’t a true vibe coder using replit

emporasMar 25, 2026, 8:17 PM

Code is so low entropy that smaller and more economical models will be up to the task the same as gigantic models from big providers are today.

No worries there, the huge improvements we see today from GPT and Claude, are at their heart just Reinforcement Learning (CoT, chain of thought and thinking tokens are just one example of many). RL is the cheapest kind of training one can perform, as far as I understand. Please correct me if that's not the case.

In the economy the invisible hand manages to produce everything cheaper and better all the time, but in the digital space the open source invisible hand makes everything completely free.

Towaway69Mar 25, 2026, 8:28 PM

> the open source invisible hand makes everything completely free.

In this case the limitation is the compute. Very few people have the compute required for AI/LLMs locally or for free (comparable to the performance of Claude). So yes, there are plenty of Open Source models that can be used locally but you need to invest in hardware to make that happen and especially if you want the quality that is available from the commercial offerings.

Not to speak of the training of those models. It's all there to make it possible to do this locally however where's the hardware? AWS? Google? There are hidden costs of the Open Source model in this case.

emporasMar 25, 2026, 9:04 PM

>In this case the limitation is the compute.

I agree with most of your points, but computation can be transferred from a place where energy is cheap to a place that is expensive. Energy for cooking cannot be transferred that way.

See for example Amazon-Google datacenters in the Gulf region. We've also got a whole continent, Australia, to put as many solar panels as we desire. Australia got dark for half a day, every day? Put solar panels to the opposite side of the planet.

Energy is a concern, for cooking, transportation etc. Energy for computation is not.

actionfromafarMar 27, 2026, 9:28 AM

Agree!

fantasizrMar 25, 2026, 6:23 PM

this is a good point. Some of the ai companies are trying to hook cs students so they'll only know "dev" as a function of their products. First one's free as they say (the drug dealers).

Towaway69Mar 25, 2026, 7:02 PM

I agree, that is the great danger that CS students aren't even taught the fundamentals of "computer science" any longer. It would be the equivalent of physics students not learning Newtons laws or e-m-c-squared.

Probably there is an issue with how much there is in CS - each programming language basically represents a different fundamental approach to coding machines. Each paradigm has its application, even COBOL ;)

Perhaps CS has not - yet - found its fundamental rules and approaches. Unlike other sciences that have hard rules and well trodden approaches - the speed of light is fixed but not the speed of a bit.

shmelMar 26, 2026, 12:38 PM

I think it will be more similar to the cloud. I remember people predicted that once you move to the cloud, you'll realize how expensive it actually is, but the cost of migration back will be high. While, yes, the cloud is expensive, most people realized that it is kinda worth it.

vovaviliMar 26, 2026, 12:57 PM

Oil market doesn't have an equivalent of open-source LLMs, self-hosting and cloud providers.

pj_mukhMar 26, 2026, 10:33 AM

"Just like oil prices and the global economy, fundamentally everything is getting better." (implied /s)

I remember having to pay a pretty penny to have a 3 minute conversation with my dad working half way across the world. Now I can video call my nephew for 45 minutes without blinking an eye. What happened?

Why will Intelligence be like Oil and not Broadband?

AurornisMar 25, 2026, 7:22 PM

> the prices will start rising. After all, these loss making AI companies will eventually need to recoup on their investments.

I would bet a lot of money that the price of LLM assistance will go down, not up, as the hardware and software advance.

Every genre-defining startup seems to go through this same cycle where the naysayers tell us that it's all going to collapse once the investment money runs out. This was definitely true for technologies without use cases (remember the blockchain-all-the-things era?) but it is not true for businesses that have actual users.

Some early players may go bust by chasing market share without a real business plan, like the infamous Webvan grocery delivery service. But even Webvan was directionally correct, with delivery services now a booming business sector.

Uber is another good example. We heard for years that ridesharing was a fad that would go away as soon as the VC money ran out. Instead, Uber became a profitable company and almost nobody noticed because the naysayers moved on to something else.

AI is different because the hardware is always getting faster and cheaper to operate. Even if LLM progress stalled at Opus 4.6 levels today, it would still be very useful and it would get cheaper with each passing year as hardware improved.

> I'm sure the pro-AIs will explain that technology will only get cheaper and better and that fundamentally it ain't an issue. Just like oil prices

Comparing compute costs to oil prices is apples to oranges. Oil is a finite resource that comes out of the ground and the technology to extract it doesn't improve much over decades. AI compute gets better and cheaper every year because the technology advances rapidly. GPU servers that were as expensive as cars a few years ago are now deprecated and available for cheap because the new technology is vastly faster. The next generation will be faster still.

If you're mentally comparing this to things like oil, you're not on the right track

nunezMar 25, 2026, 9:31 PM

> almost nobody noticed

Rideshare costs are much higher than they have been in years past. Everyone noticed

Towaway69Mar 25, 2026, 8:10 PM

> Oil is a finite resource that comes out of the ground

Yes but the chips, hardware, copper cables, silicon and all the rest of the components that make up a server are finite. Unless these magically appear from outer space, we'll face the same resource constraints as everything else that is pulled out of the ground.

These components are also far more fragile to source, see COVID and the collapse of global supply chains. Also the factories to create these components are expensive to build and fragile to maintain. See the Dutch company that seems to be the sole supply of certain manufacturing skills.[1]

> I would bet a lot of money that the price of LLM assistance will go down, not up, as the hardware and software advance.

My bet would be that it would fuel the profits of AI companies and not make the price of AI come down. Over supply makes price come down but if supply is kept artificially low, then prices stay high.

That's the comparison to OPEC and oil. There is plenty of oil to go around yet the supply is capped and thereby prices kept high. There is no guarantee that savings in hardware or supply will be passed on by AI corps.

Indeed there is no guarantee that there will be serious competition in the market, OPEC is a monopoly so why not have an AI monopoly? At the moment, all major players in AI are based in the same geopolitical sphere, making a monopoly more likely, IMHO.

In the end, it's all speculation what will happen. It just depends on which fairy tail one believes in.

[1]: https://en.wikipedia.org/wiki/ASML_Holding

AurornisMar 25, 2026, 10:23 PM

> Yes but the chips, hardware, copper cables, silicon and all the rest of the components that make up a server are finite. Unless these magically appear from outer space, we'll face the same resource constraints as everything else that is pulled out of the ground.

Raw material cost is not a driver of datacenter GPU costs.

> Over supply makes price come down but if supply is kept artificially low, then prices stay high.

Where are you getting "supply kept artificially low" when we're in the middle of an explosion of datacenter buildouts and AI companies?

We're in a race to the bottom on pricing. I haven't seen a realistic argument for why you think prices are going to go up. You're starting with a conclusion and trying to find reasons it might be true.

Towaway69Mar 26, 2026, 6:10 AM

> Where are you getting "supply kept artificially low"

If a resource is controlled by a small group of coordinated folks (for example, large US controlled corporations who have/are these datacenters), the resource may be limited artificially because access to these resources are controlled by said corporations.

Exploding datacenters and AI companies yes, but true competition probably not. Most AI companies are using the datacenters from said corporations, if those corporations decide that compute costs one cent more, then all AI providers will become more expensive.

What we should learn from OPEC and oil is that not resource amounts that define the price, it is access to the resource that defines the price.

methodicalMar 25, 2026, 7:38 PM

While I fundamentally agree with the basis of compute getting cheaper by the year, I think a missed consideration here is the fact that these models are also requiring exponentially more compute with each iteration to train, in a way that arguably has outscaled the advances in compute.

Whether a generalized and broadly usable model will be able to trained within some N multiple of our current compute availability allowing the price to come down with iterative compute advances is yet to be seen. With the current race to the top in terms of SOTA models and increasingly iteratively smaller improvements on previous generations, I have a feeling the scaling need for compute will outpace the improvements in our hardware architecture, and that's if Moore's law even holds as we start to reach the bounds of physics and not engineering.

However as it stands today, essentially none of these providers are profitable so it's really a question of whether that disconnect will come within their current runway or not and they'll be required to increase their price point to stay alive and/or raise more capital. It's pure conjecture either way.

simonwMar 25, 2026, 4:43 PM

Useful context here is that the author wrote Pi, which is the coding agent framework used by OpenClaw and is one of the most popular open source coding agent frameworks generally.

xivzgrevMar 26, 2026, 2:31 PM

There's currently a billboard up in San Francisco that basically says "use AI to reduce your saas costs".

And I'm thinking - has anyone actually done that for something meaningful?

Replacing salesforce as your crm or replacing Shopify as your e-commerce platform?

I get the hype but AI doesn't remove accountability, it just moves it up. Oh you can do with 1 person what 3 people used to do? Great, that 1 person is now accountable for 3 person's jobs. And people are naturally uncomfortable with that - you need to understand what's going on and be able to investigate / fix. It's different than say, weaving machines replacing jobs because weaving machines were consistent. 1 person could confidently produce what x weavers could before. But AI is not, and that variability in output & quality introduces massive friction.

So as of now, in both software and people, there's a real limit to how much AI can replace because the remaining people still are equally accountable.

hbarkaMar 26, 2026, 3:02 PM

Vendor lock-in is real and it’s scary. You are helpless to the constant price increases and each passing renewal you get deeper and deeper into the lock-in. Here’s to the day when someone clever with AI can disintermediate this situation. You don’t have to vibecode your own CRM but imagine a deterministic harness that lets you lego-block CRM functions like lead management, opportunity tracking, contact list, campaigns. There shouldn’t be a moat anymore.

crisnobleMar 26, 2026, 4:12 PM

Moving the vendor lock-in to the AI provider and exponentially increasing the pain of migration by locking all teams and all services in at once.

hbarkaMar 26, 2026, 6:04 PM

Not really. Mixture of models and mixture of experts have been around. It’s easy to switch a project harness from Codex to Claude to Gemini and to open models. You’re not locked in to a model, you’re more concerned about competitive token cost.

52-6F-62Mar 26, 2026, 2:52 PM

tick tock tick tock...

SoftTalkerMar 25, 2026, 4:50 PM

> Companies claiming 100% of their product's code is now written by AI consistently put out the worst garbage you can imagine. Not pointing fingers, but memory leaks in the gigabytes, UI glitches, broken-ass features, crashes

One thing about the old days of DOS and original MacOS: you couldn't get away with nearly as much of this. The whole computer would crash hard and need to be rebooted, all unsaved work lost. You also could not easily push out an update or patch --- stuff had to work out of the box.

Modern OSes with virtual memory and multitasking and user isolation are a lot more tolerant of shit code, so we are getting more of it.

Not that I want to go back to DOS but Wordperfect 5.1 was pretty damn rock solid as I recall.

MisterTeaMar 25, 2026, 5:17 PM

> Modern OSes with virtual memory and multitasking and user isolation are a lot more tolerant of shit code, so we are getting more of it.

It's not the glut of compute resources, we've already accepted bloat in modern software. The new crutch is treating every device as "always online" paired with mantra of "ship now! push fixes later." Its easier to setup a big complex CI pipeline you push fixes into and it OTA patches the users system. This way you can justify pushing broken unfinished products to beat your competitors doing the same.

skybrianMar 25, 2026, 5:27 PM

I think you're just recalling the few software products that were actually good. There was plenty of crap software that would crash and lose your work in the old days.

HerbManicMar 25, 2026, 8:18 PM

I always found it funny how Word on Window 3.1/95 would have a day dream moment and just completely lock up, usually when you were about to save the document

I still save stuff every few minutes out of habits formed in the 90s.

Old DOS stuff could either be a total nightmare or some of the most brilliant code you had ever seen. Thats just the way having no giard rails goes.

nunezMar 25, 2026, 9:34 PM

Lol right!

Remember when OS uptime was super duper important? Now it's a given that you can basically never restart your computer and be fine.

windowlikerMar 25, 2026, 5:06 PM

Another factor at work is the use of rolling updates to fix things that should better have been caught with rigorous testing before release. Before the days of 'always on' internet it was far too costly to fix something shipped on physical media. Not that everything was always perfect, but on the whole it was pretty well stress-tested before shipping.

The sad truth is that now, because of the ease of pushing your fix to everything while requiring little more from the user than that their machine be more or less permanently connected to a network, even an OS is dealt with as casually as an application or game.

vaultdweller101Mar 26, 2026, 6:56 AM

Is the price of speed bloat? Where does the tolerance for less reliable software come from?

andaiMar 25, 2026, 6:54 PM

It occurred to me on my walk today that a program is not the only output of programming.

The other, arguably far more important output, is the programmer.

The mental model that you, the programmer, build by writing the program.

And -- here's the million dollar question -- can we get away with removing our hands from the equation? You may know that knowledge lives deeper than "thought-level" -- much of it lives in muscle memory. You can't glance at a paragraph of a textbook, say "yeah that makes sense" and expect to do well on the exam. You need to be able to produce it.

(Many of you will remember the experience of having forgotten a phone number, i.e. not being able to speak or write it, but finding that you are able to punch it into the dialpad, because the muscle memory was still there!)

The recent trend is to increase the output called programs, but decrease the output called programmers. That doesn't exactly bode well.

See also: Preventing the Collapse of Civilization / Jonathan Blow (Thekla, Inc)

https://www.youtube.com/watch?v=ZSRHeXYDLko

tau5210Mar 26, 2026, 2:46 AM

> The recent trend is to increase the output called programs, but decrease the output called programmers. That doesn't exactly bode well.

Perhaps on a related note, I've noticed that a lot of the positive talks about AI are about quantity. On the other hand, there is disproportionately very little deep discussion about quality. And I mean not just short term, local quality, but more long term and holistic quality (e.g. managing complexity under evolving requirements in a complex system with multiple connected parts) at real production scale, where there is much less tolerance for failure.

In all the places I've worked in throughout my career, I've felt that there have always been a tension between those who cared more about things like the mental model and holistic quality, and those who seemed to care less or were even oblivious about it. I think one contribution of the current AI hype is that it gave a more concrete shape to this split...

r_leeMar 26, 2026, 7:09 PM

> Perhaps on a related note, I've noticed that a lot of the positive talks about AI are about quantity. On the other hand, there is disproportionately very little deep discussion about quality.

and to me this is so weird, because from what I can tell, quantity hasn't been the winning factor for a very long time now

TeamDmanMar 26, 2026, 1:59 PM

I've found LLMs decrease the friction in enabling more pedantic lints and tooling. It is a quantity problem because enabling all the aggressive warnings in the compiler makes a lot of work, and its a quality outcome because presumably addressing every warning from the compiler makes the code better

drzaiusx11Mar 26, 2026, 1:35 PM

LLM systems at their core being probabilistic text generators makes them easily produce massive works at scale.

In software engineering our job is to build reliable systems that scale to meet the needs of our customers.

With the advent of LLMs for generating software, we're simply ignoring many existing tenets of software engineering by assuming greater and greater risk for the hope of some reward of "moving faster" without setting up the proper guard rails we've always had. If a human sends me a PR that has many changes scattered across several concerns, that's an instant rejection to close that PR and tell them to separate those into multiple PRs so it doesn't burn us out reviewing something beyond human comprehension limits. We should be rejecting these risky changes out of hand, with the possible exception when "starting from scratch", but even then I'd suggest a disciplined approach with multiple validation steps and phases.

The hype is snake oil: saying we can and should one-shot everything into existence without human validation, is pure fantasy. This careless use of GenAI is simply a recipe for disasters at scales we've not seen before.

tau5210Mar 27, 2026, 5:39 AM

Well said, thank you.

MunksgaardMar 25, 2026, 7:40 PM

Peter Naur had that realization back in 1985: https://pages.cs.wisc.edu/~remzi/Naur.pdf

ProllyInfamousMar 26, 2026, 4:09 PM

>>[2019] Preventing the Collapse of Civilization / Jonathan Blow (Thekla, Inc)

During the Q&A, he responds "do we really want software written that humans cannot understand?!" His steadfast doubts against singularity are called into question, at least by his supporting 2019 responses.

Certainly the speaker is correct that modern hardware allows software to be crappily written — I fondly recall the "olden times" recanted about full-access operating systems of yesteryear. Those days are over...

The fact that a modern computer "needs" to be online to install an update is frustrating/concerning (e.g. for MacOS, without a USB installer must be online to update, even with stand-alone updater downloaded). Just use my local hardware (that I own) and install this software (that I have provided).

driftnodeMar 26, 2026, 6:12 AM

The phone number muscle memory example is perfect. There is a whole category of knowledge you only have if your hands did the work.

roveoMar 26, 2026, 11:35 AM

It's called "tacit knowledge" and I think we generally overindex on explicit, formal knowledge and ignore tacit knowledge. You can see that with language learning, we treat languages like something you "learn", but in my experience it's closer to a motor skill like playing tennis.

https://en.wikipedia.org/wiki/Tacit_knowledge

drzaiusx11Mar 26, 2026, 12:36 PM

The article touches on this but I think the key takeaway is that humans need to properly manage the _scope of work_ for their agentic teams in order to have any chance of a successful outcome.

Current gen agents need to be provided with small, actionable units of work that can _easily_ be reviewed by a human. A code deliverable is made easy to review if the scope of change is small and aligned with a specific feature or task, not sprawled across multiple concerns. The changes must be ONLY related to the task at hand. If a PR is generated that does two very different things like fix linting errors in preexisting code AND implement feature X, you're doing it wrong. Or rather, you're simply gambling. I'd rather not leave things up to chance that I may miss something in that new 10000LOC PR. It's better that a 10000LOC never existed at all.

YOLOing out massive, sweeping changes with agents exceed our own (human) "context windows" and as this article points out, we're then left with an inevitable "mess." The untangling of which will take an inordinate amount of time to fix.

codybMar 26, 2026, 1:36 PM

At which point you've gained very little efficiency in most large organizations given that by the time you're actually doing development work at the ticket level 90% of the project timeline (identifying issues, prioritizing, creating requirements, architecture, ticket breakdowns, coordination, etc) has already passed.

If AI can enable engineers to move through the organization more effectively, say by allowing them to work through the service mesh as a whole, that could reduce time. But in order to evaluate code contributions to any space well, as far as I can tell, you still have to put in leg work even if you are an experienced engineer and write some features which exposes you to the libraries, quirks, logging/monitoring, language, etc that make up that specific codebase. (And also to build trust with the people who own that codebase and will be gatekeeping your changes, unless you prefer the Amazon method of having junior engineers YOLO changes onto production codebases without review apparently... holy moly, how did they get to that point in the first place...)

So the gains seem marginal at best in large organizations. I've seen some small organizations move quicker with it, they have less overhead, less complexity, and smaller tasks. Although I've yet to see much besides very small projects/POCs/MVPs from anyone non-technical.

Maybe it'll get to the point where it can handle more complexity, I kind of think we're leveling off on this particular phase of AI, and some headlines seem to confirm that...

- MS starting to make CoPilot a bit less prominent in its products and marketing - Sora shutting down - Lots of murky, weird, circular deals to fund a money pit with no profits - Observations at work

It's really kind of crazy how much our entire society can be hijacked by these hype machines. My company did slow roll AI deployment a bit, but it very much feels like the Wild West, and the amount of money spent! I'm sure it's astronomical. Pretty sure we could have hired contractors to create the Chrome plugin and Kafka topic dashboard we've deployed for far cheaper

drzaiusx11Mar 26, 2026, 1:53 PM

The productivity gains are somewhat real in a sense, but are not really about "moving faster", as the hype would have us believe. GenAI agentic systems instead boost individual developer "efficiency" by allowing a single, reasonably qualified developer, to approximate an entire software team. As those developers, however, we're still required to manage the workload of those teams and ourselves to ensure quality output, just as ever before.

The problem is that it's VERY easy to overload oneself with the output of these new tools. Human comprehension is the bottleneck, as much as it always has been. Anyone that tells you otherwise is shilling for these companies.

drzaiusx11Mar 26, 2026, 2:09 PM

And just to underscore my point about the disconnect between the advertising/hype vs the reality: the real point of this tech and the reason leadership seems so motivated to push it is that they ultimately see this tooling as our replacement, not our enhancement, at least in the long term (although those "many hats" roles will have to persist).

It's just harder to sell trades folk the tools of their demise, so it's couched in terms of a miracle product that'll make us all 10x devs; when the reality is, it'll just be 1/10 of us still around doing the risk mitigation work left within the system that relaced us.

youknownothingMar 25, 2026, 11:20 PM

I understand your pain, we're just a peak hype, I think people will learn to backtrack and use the tool in a more sensible way. It always happens. I remember when MongoDB and other NoSql databases came out, people went as far as to say that "SQL is dead" and refuse to use a normal SQL database for anything. Not even for the most obvious relational application. People would store everything as key-value pairs with no schema and do all the joins in the application layer. Fast forward 10 years and we're back to using SQL for most of our applications. NoSql hasn't disappeared, it has just been reduced to the nice where it's useful.

tau5210Mar 26, 2026, 2:18 AM

Also reminded me of Kafka (Kafka as a database!) and microservices (monoliths are evil, microservices are the future). I'm sure we can dig up similar hypes on various scales throughout the history of this industry...

Perhaps so-called AI is slightly different from hypes like NoSql and microservices in that these reduced to usages that practically apply to only a fraction of the engineering population (albeit, it's still good for anyone to know about them even if we never use them), whereas AI will probably still affect us all even after the dust settles. Just in much less spectacular ways than is being trumpeted currently by some groups. Reminded me of No Silver Bullet: "There is no single development, in either technology or management technique, which by itself promises even one order of magnitude improvement in productivity, in reliability, in simplicity. "

pm90Mar 26, 2026, 12:26 PM

Technology moves fast and is prone to hype. While NoSQL and Kafka were certainly oversold, almost every mid-large scale tech company has at least one nosql system and kafka-like system in use. The proponents weren’t wrong, they oversold the impact.

There is other tech that did completely change how we do things. CI/CD, Containers, Kubernetes, distributed tracing etc. are considered standard now (but weren’t not that long ago).

cedwsMar 26, 2026, 3:27 PM

[dead]

0xbadcafebeeMar 25, 2026, 4:18 PM

> it sure feels like software has become a brittle mess, with 98% uptime becoming the norm instead of the exception, including for big services

As somebody who has been running systems like these for two decades: the software has not changed. What's changed is that before, nobody trusted anything, so a human had to manually do everything. That slowed down the process, which made flaws happen less frequently. But it was all still crap. Just very slow moving crap, with more manual testing and visual validation. Still plenty of failures, but it doesn't feel like it fails a lot of they're spaced far apart on the status page. The "uptime" is time-driven, not bugs-per-lines-of-code driven.

DevOps' purpose is to teach you that you can move quickly without breaking stuff, but it requires a particular way of working, that emphasizes building trust. You can't just ship random stuff 100x faster and assume it will work. This is what the "move fast and break stuff" people learned the hard way years ago.

And breaking stuff isn't inherently bad - if you learn from your mistakes and make the system better afterward. The problem is, that's extra work that people don't want to do. If you don't have an adult in the room forcing people to improve, you get the disasters of the past month. An example: Google SREs give teams error budgets; the SREs are acting as the adult in the room, forcing the team to stop shipping and fix their quality issues.

One way to deal with this in DevOps/Lean/TPS is the Andon cord. Famously a cord introduced at Toyota that allows any assembly worker to stop the production line until a problem is identified and a fix worked on (not just the immediate defect, but the root cause). This is insane to most business people because nobody wants to stop everything to fix one problem, they want to quickly patch it up and keep working, or ignore it and fix it later. But as Ford/GM found out, that just leads to a mountain of backlogged problems that makes everything worse. Toyota discovered that if you take the long, painful time to fix it immediately, that has the opposite effect, creating more and more efficiency, better quality, fewer defects, and faster shipping. The difference is cultural.

This is real DevOps. If you want your AI work to be both high quality and fast, I recommend following its suggestions. Keep in mind, none of this is a technical issue; it's a business process isssue.

pixl97Mar 25, 2026, 4:44 PM

It also seems like massive consolidation has caused issues too. Everyone is on Github. Everyone is on AWS. Everyone is behind cloudflare. Whenever an issue happens here it effects everyone and everyone sees it.

In the past with smaller services those services did break all the time, but the outage was limited to a much smaller area. Also systems were typically less integrated with each other so one service being down rarely took out everything.

0xbadcafebeeMar 25, 2026, 7:43 PM

The power company is massively consolidated, as is the water supply, telephone service. These are monolithic, monopolistic entities. But they are also very reliable (failures are usually isolated by region, or a result of natural disaster).

What leads to more failure is when you don't engineer those consolidated entities to be reliable. Tech companies have none of the legal requirements or incentives to be reliable, the way physical infrastructure companies do. I agree that the tighter integration is an issue, but the root cause is tech companies have no incentive other than profits. If they're making profits, everything's fine.

pixl97Mar 25, 2026, 9:21 PM

I mean recommend professional software engineering licenses here on HN and it goes over like a turd in a punch bowl. Everyone knows where the search for more profit was going, no one wanted to get off the ride though.

hackertyper69Mar 25, 2026, 5:54 PM

It's a systems engineering job. You need to provide context, acceptable failure modes, and test at each level for validation. Identify false coupling, poor interfaces, things that don't match business context during agent planning phase. Then communicate / translate to others so their decisions improve instead of destroying the system by optimizing only for their local situation.

zephenMar 25, 2026, 7:16 PM

> One way to deal with this in DevOps/Lean/TPS is the Andon cord.

Many years ago, I started working for chip companies. It was like a breath of fresh air. Successful chip companies know the costs (both direct money and opportuity) of a failed tapeout, so the metaphorical equivalent of this cord was there.

Find a bug the morning of tapeout? It will be carefully considered and triaged, and maybe delay tapeout. And, as you point out, the cultural aspect is incredibly important, which means that the messenger won't be shot.

_doctor_loveMar 25, 2026, 6:54 PM

Super good take - the Andon cord is needed everywhere.

AndrianVMar 26, 2026, 2:55 PM

[dead]

AnamonMar 27, 2026, 5:09 PM

> Because the simple act of having to write the thing or seeing it being built up step by step introduces friction that allows you to better understand what you want to build [...]

I would go further and remove that second option. If the code is important, LLM support or not, write it yourself.

At least for me, there is a clear qualitative difference in thinking between typing the code and watching it being typed, even if I follow along with every line.

If I type it, my brain is constantly questioning whether what I'm doing is correct. What are the edge cases here? Is this introducing a vulnerability? Am I getting the right data from the right place?

By watching an agent or someone else code, the mindset is different. I'm checking someone else's work under the implicit assumption that they have some idea of what they're doing and I'm just reviewing mostly for superficial stuff. I can force myself to ask those other questions, but it takes conscious effort and isn't sustainable over long sessions.

I play around with agentic coding, but I'm always shocked at how much worse the result is compared to working in a separate chat and typing (not pasting!) the suggestions. In the direct comparison, it's easy to see how agentic code turns so incredibly shit so ridiculously fast.

leonardoeMar 25, 2026, 8:09 PM

Just yesterday I was discussing many of the ideas presented here with a coworker. I had just walked out of a workshop led by $BIGTECHCOMPANY where someone presented the following toy example:

A service goes down. He tells the agent to debug it and fix it. The agent pulls some logs from $CLOUDPROVIDER, inspects the logs, produces a fix and then automatically updates a shared document with the postmortem.

This got me thinking that it's very hard to internalize both issue and solution -updating your model of the system involved- because there is not enough friction for you to spend time dealing with the problem (coming up with hypotheses, modifying the code, writing the doc). I thought about my very human limitation of having to write things down in paper so that I can better recall them.

Then I recalled something I read years ago: "Cars have brakes so they can go fast."

Even assuming it is now feasible to produce thousands of lines of quality code, there is a limitation on how much a human can absorb and internalize about the changes introduced to a system. This is why we will need brakes -- so we can go faster.

chatmastaMar 25, 2026, 11:01 PM

The gap in your example is that a human had to realize the system is broken so that he could nudge the agent into fixing it. He can fix that gap by updating the agent to recognize when the system breaks. This now becomes the level at which he debugs… did the agent recognize the failure and self-heal, or not?

And at that point, if the autonomous system breaks, realized it’s broken, and fixes itself before you even notice… then do you need to care whether you learn from it? I suppose this could obfuscate some shared root cause that gets worse and worse, but if your system is robust and fault-tolerant _and_ self-heals, then what is there to complain about? Probably plenty, but now you can complain about one higher level of abstraction.

emmitskaMar 26, 2026, 5:42 AM

"Do me a SOLID, YAGNI, give me a DRY KISS" — that's been my coding philosophy for 20 years. So when I came back to building after a long detour, I couldn't stomach watching agents confidently generate 400 lines where 40 would do. What I found is that the discipline was the feature, not the obstacle. I ended up pair programming closely — not because I distrusted the agent, but because I couldn't let go of the architecture. The internet kept telling me to stop going into the weeds. Your article explained why that instinct was right. Everyone else is happy grinding in third the whole race. I went 1, 2, 3 — and because I didn't bury myself getting out of the driveway, I still get to shift into fourth.

chr15mMar 26, 2026, 6:22 AM

As well as pair programming with the AI, you can explicitly put those principles in AGENTS.md and the stochastic code generator will pay attention and be less verbose.

jillesvangurpMar 26, 2026, 7:06 AM

Exactly. There's a difference between vibe coding and agentic software engineering. One is just prompting and hoping for the best. It works surprisingly well, up to a point. And then it doesn't. If that's happening to you, you might be doing it wrong. The other is forcing agents to do it right. Working in a TDD way, cleaning up code that needs cleaning up, following processes with checklists, etc. You need to be diligent about what you put in there and there's a lot of experience that translates into knowing what to ask for and how. But it boils down to being a bit strict and intervening when it goes off the rails and then correcting it via skills such that it won't happen again.

I've been working on an Ansible code base in the past few weeks. I manually put that together a few years ago and unleashed codex on it to modernize it and adapt it to a new deployment. It's been great. I have a lot of skills in that repository that explain how to do stuff. I'm also letting codex run the provisioning and do diagnostics. You can't do that unless you have good guard rails. It's actually a bit annoying because it will refuse to take short cuts (where I would maybe consider) and sticks to the process.

I actually don't write the skills directly. I generate them. Usually at the end of a session where I stumbled on something that works. I just tell it to update the repo local skills with what we just did. Works great and makes stuff repeatable.

I'm at this point comfortable generating code in languages I don't really use myself. I currently have two Go projects that I'm working on, for example. I'm not going to review a lot of that code ever. But I am going to make sure it has tests that prove it implements detailed specifications. I work at the specification level for this. I think a lot of the industry is going to be transitioning that direction.

slhckMar 26, 2026, 11:53 AM

Except that when its system prompt is full of instructions, caveats, design principles, gotchas, architecture notes, memories from the past, and personal preferences, at some point it's going to just ignore them outright. Heck, Claude Code won't even use critical instructions from a 100-line CLAUDE.md file sometimes. So you still have to be extremely vigilant about noncompliance.

badlibrarianMar 25, 2026, 4:29 PM

I suppose everyone on HN reaches a certain point with these kind of thought pieces and I just reached mine.

What are you building? Does the tool help or hurt?

People answered this wrong in the Ruby era, they answered it wrong in the PHP era, they answered it wrong in the Lotus Notes and Visual BASIC era.

After five or six cycles it does become a bit fatiguing. Use the tool sanely. Work at a pace where your understanding of what you are building does not exceed the reality of the mess you and your team are actually building if budgets allow.

This seldom happens, even in solo hobby projects once you cost everything in.

It's not about agile or waterfall or "functional" or abstracting your dependencies via Podman or Docker or VMware or whatever that nix crap is. Or using an agent to catch the bugs in the agent that's talking to an LLM you have next to no control over that's deleting your production database while you slept, then asking it to make illustrations for the postmortem blog post you ask it to write that you think elevates your status in the community but probably doesn't.

I'm not even sure building software is an engineering discipline at this point. Maybe it never was.

_dwtMar 25, 2026, 5:40 PM

> A number of these phenomena have been bundled under the name "Software Engineering". As economics is known as "The Miserable Science", software engineering should be known as "The Doomed Discipline", doomed because it cannot even approach its goal since its goal is self-contradictory. Software engineering, of course, presents itself as another worthy cause, but that is eyewash: if you carefully read its literature and analyse what its devotees actually do, you will discover that software engineering has accepted as its charter "How to program if you cannot.".

- Edsger Dijkstra, 1988

I think, unfortunately, he may have had us all dead to rights on this one.

throwanemMar 25, 2026, 6:39 PM

One would as sensibly dismiss the concept of an assembly line as "how to build a car if you cannot."

Dijkstra was a mathematician. It is a necessary discipline. If it alone were sufficient, then the "program correctness" fans would have simply and inarguably outdone everyone else forty years ago at the peak of their efforts, instead of having resorted to eloquently whiny, but still whiny, thinkpieces (such as the 1988 example [1] quoted here above) about how and why they would like history to understand them as having failed.

[1] https://www.cs.utexas.edu/~EWD/ewd10xx/EWD1036.PDF [2]

[2] I will freely grant that the man both wrote and lettered with rare beauty, which shames me even in this photocopier-burned example when I compare it to the cheerful but largely unrefined loops and scrawls of my own daily hand.

_dwtMar 25, 2026, 6:55 PM

The formal methods people may yet have the last laugh. I did not have Lean becoming a hyped programming language / proof assistant on my bingo card for 2025-26 and yet here we are, because these tools help us close the validation loop for LLM agents. That is not dead which can eternal lie...

But yes, I think the best rebuttal to Dijkstra-style griping is Perlis' "one can't proceed from the informal to the formal by formal means". That said I also believe kind of like Chesterton's quote about Christianity, they've also mostly not been tried and found wanting but rather found hard and left untried. By myself included, although I do enjoy a spot of the old dependent types (or at least their approximations). There's an economic argument lurking there about how robust most software really needs to be.

throwanemMar 25, 2026, 7:52 PM

Certainly, and it's at that economic argument that I strive to get, I think.

Every so often an article makes the rounds on the correctness and verification methods used for Space Shuttle avionics software and applications of similar import, or if not that then Nancy Leveson's comprehensive 1995 review of the Therac-25 accidents. [1]

Most software doesn't need to be nearly so robust, but Dijkstra constructs his argument as though all did, hinging the inversion on the obvious and frankly shocking cheat across the gap between his pages 14 and 15, ie, that paragraph beginning "But before a computer is ready to perform..." Here he casually, and without direct acknowledgement much less justification, assumes as rhetorically axiomatic that a program, not the machine that executes it, is the original artifact of computing, of which any reification merely constitutes less than perfect instantiation, which he is then free to criticize on the wholly theoretical grounds of mathematical beauty; that is, on the grounds he prefers to inhabit in all cases, whether to do so in any given example makes any sense or not.

If that's his preferred ground, fair enough; after all, he was a mathematician. But his hypocrisy in concealing the insistence by means of subtle rhetoric - mere pages after inveighing against "medieval thinking" by way of an example, his "reasoning by analogy," faulting specifically that argument made by way of specious rhetoric! - casts suspicion on all that both precedes and follows. From a layperson, I could regard it as honest error, but I have known and loved academic mathematicians, and I really can't conceive of any of them leaving intact so consequential a mistake.

Perhaps Dijkstra was different, or merely becoming old, but for someone so heavily invested in pushing a paradigm of programming with mathematical rigor at its core, it seems a remarkable flaw in what should be a crucial argument (especially in advance of a solution for the halting problem). I regret that flaw, because he isn't all wrong about what an engineering paradigm can do to the agency and optionality of programmers especially in industry - not that his one extremely privileged position therein, parallel with Feynman's time at Thinking Machines, would much acquaint him with our desiderata or our constraints - and I would like to find that point made in better company than he was able to give it.

But then, his conception never offered much in preference, did it? The labor of mathematicians is scarce and expensive: what good is a proof assistant to anyone who can't understand its output, much less give it input? And Dijkstra himself, not less strange a bird than any other mathematician, famously did all he could to avoid actually using the machines on whose correct use he here wrote. (Hence his hand, which I complimented so highly before. I also use a fountain pen, but as I said, not so beautifully - and I'm glad I know how to use a keyboard well, instead.)

There would not be more programmers or more software in a world run on such principles, I think, than in this one - on the contrary, less by far. Maybe that would be preferable, but mostly not for the reasons Dijkstra claimed.

[1] http://sunnyday.mit.edu/papers/therac.pdf

ThrowawayR2Mar 26, 2026, 5:03 AM

There is no liability or penalty for software defects and therefore no incentive for program correctness. Arguably, Dijkstra didn't fail; society has foolishly decided not to hold developers accountable for their bad code.

throwanemMar 26, 2026, 3:40 PM

Arguably? Okay, so argue it.

bigfishrunningMar 25, 2026, 6:48 PM

I think the real tragedy here is that we can spend *all* of our time trying to improve the quality of our output, but it simply doesn't matter, because as long as the button is where the boss wants it to be and is the right color, all is right with the world.

Literally nothing else matters, and we (or at least I) have wasted a ton of time getting good at writing software.

throwanemMar 26, 2026, 3:42 PM

As long as it continues to matter what the button actually does, I can't consider our effort to have been entirely wasted. We only have the misfortune to live in stupid and dangerous times, but good heavens, we're hardly the first in that, and hardly starved for examples from whom to learn.

zephenMar 25, 2026, 6:55 PM

> One would as sensibly dismiss the concept of an assembly line as "how to build a car if you cannot."

I agree, but I'm not sure this says what you think it does.

The people on the car assembly line may know nothing of engineering, and the assembly line has theoretically been set up where that is OK.

The people on the software assembly line may also (and arguably often do) know nothing of engineering, but it's not clear that it is possible to set up the assembly line in such a way so as to make this OK.

Arguably, the use of LLMs will at least have some utility in helping us to figure this out, because a lot of LLMs are now being used on the assembly line.

pydryMar 25, 2026, 4:51 PM

>After five or six cycles it does become a bit fatiguing. Use the tool sanely.

That's increasingly not possible. This is the first time for me in 20 years where I've had a programming tool rammed down my throat.

There's a crisis of software developer autonomy and it's actually hurting software productivity. We're making worse software, slower because the C levels have bought this fairy tale that you can replace 5 development resource with 1 development resource + some tokens.

whaleofatw2022Mar 25, 2026, 6:28 PM

That lucky?

In 18 years AI is the third or 4th tool forced upon a shop/team, I will say of those it is the forst one that is genuinely able to make me more productive overall, even with the drawbacks.

crystal_revengeMar 26, 2026, 4:21 AM

> What are you building?

I think AI really pushes this higher up the abstraction layer:

> What problem are you solving?

I've spent a good amount of my careering using engineering and math to solve specific problems, I'm usually adjacent to software teams.

What I've seen happen with agentic coding is that traditional software engineers keep focusing on using it to build software, while ignoring the problem they're trying to solve.

Meanwhile I've seen junior data analysts start interfacing with applications and tools they never dreamed of before, and delivering results to stakeholders in record times. Things that were previously blocked by engineering no longer are.

But many engineers today are not really problem solvers, they're software builders. The idea that solving the end users problem is the goal, not building them software, is incomprehensible.

And so they continue to struggle to use AI effectively because they're trying to build software with it. Which it's not terrible at, but it's really the wrong tool for that job.

Sometimes software is necessary to solve a problem, a few years ago, software was necessary for a fairly large problem surface area (though, to your point, even then a lot of software was not really built to solve those problems). Today that surface area is shrinking, and as economic constraints loom on the horizon, I believe it will increasingly be people who are solving problems (with or without AI) that will be the ones surviving.

Panzer04Mar 27, 2026, 12:00 AM

The kind of jobs an analyst are doing are probably the most amenable of everything to LLM assistance. Small, bounded, etc.

The bigger the problem set and context the less helpful an LLM gets.

AnimalMuppetMar 25, 2026, 5:10 PM

Software was an engineering discipline... at some places. And it still is, at some places.

Other places were "hack it until we don't know of any major bugs, then ship it before someone finds one". And now they're "hey, AI agents - we can use that as a hack-o-matic!" But they were having trouble with sustainability before, and they're going to still, except much faster.

hu3Mar 25, 2026, 8:07 PM

> What are you building? Does the tool help or hurt?

> People answered this wrong in the Ruby era, they answered it wrong in the PHP era, they answered it wrong in the Lotus Notes and Visual BASIC era.

I'm assuming you're saying these tools hurt more than help?

In that case I disagree so much that I'm struggling to reply. It's like trying to convince someone that the Earth is not flat, to my mental model.

PHP, Ruby and VB have more successful code written in them than all current academic or disproportionately hyped languages will ever have combined.

And there's STILL software being written in them. I did Visual Basic consulting for a greenfield project last week despite my current expertise being more with Go, Python, C# and C. And there's a RoR work lined up next. So the presence gap between these helpful tools and other minor, but over index tools, is still increasing.

It's easy to think that the languages one see mor often in HN are the prevalent ones but they are just the tip of the iceberg.

PaulHouleMar 25, 2026, 4:44 PM

People built a lot of great stuff with Ruby, PHP, Notes and VB. I don't know what the problem really is.

Personally I think that whole Karpathy thing is the slowest thing in the world. I mean you can spin the wheels on a dragster all you like and it is really loud and you can smell the fumes but at some point you realize you're not going anywhere.

My own frustration with the general slowness of computing (iOS 26, file pickers, build systems, build systems, build systems, ...) has been peaking lately and frankly the lack of responsiveness is driving me up the wall. If I wasn't busy at work and loaded with a few years worth of side projects I'd be tearing the whole GUI stack down to the bottom and rebuilding it all to respect hard real time requirements.

stuffnMar 25, 2026, 4:44 PM

Largely a problem of VCs and shareholders. After my 12th year of "we'll get around to bug fixes" and "this is an emergency" I realize I am absolutely not doing anything related to engineering. My job means less than the moron PM who graduated bottom of their class in <field>. The lack of trust in me despite having almost a life in software is actually so insulting it's hard to quantify.

Now I barely look at ticket requirements, feed it to an LLM, have it do the work, spend an hour reviewing it, then ship it 3 days later. Plenty of fuck off time, which is time well spent when I know nothing will change anyway. If I'm gonna lose my career to LLMs I may as well enjoy burning shareholder capital. I've optimized my life completely to maximize fuck off time.

At the end of the day they created the environment. It would be criminal to not take advantage of their stupidity.

konfusinomiconMar 25, 2026, 10:32 PM

same experience here. trust deficits so rampant i question if ive ever been right once in my career. dont forget the lack of the word 'iterate' in the decision makers vocabulary. and as soon as the word sunset is uttered you know your in for a bumpy ride once again

ray_vMar 26, 2026, 4:09 AM

Maybe it's more about a rush to share how awesome it is that you compressed your time-to-release down to days and not weeks or months - when in reality that's a good thing in the sense that you get to a failure state much FASTER, and failure states are good, because that means that you get to iterate and get past those failures FASTER.

I don't think people were releasing at this pace, so the failure states are fast and furious so there is just that much more viability. I think the microslop windos failures lately are just them being the same "them" that they've always been .. just MUCH faster. (they just need to stop monkeying with windows and stop adding more features on top of an already shaky foundation.) Maybe we just need more of the stories like Anthropic working with Mozilla to squash 5x the amount of bugs in a similar time frame first, AND THEN "vibe a browser together from nothing but specification files and an army of bots in a weekend".

cyanydeezMar 25, 2026, 6:18 PM

As far as I can tell, the only reason agents exist is because large context increase the probability of context poisoning, purely by the inability of these models to actually make conceptual decisions about the context.

I was interested in making a semi-automous skill improvement program for open code, and I wired up systemd to watch my skills directory; when a new skill appeared, it'd run a command prompt to improve it and cohere it to a skill specification.

It was told to make a lock file before making a skill, then remove the lock files. Multiple times it'd ignore that, make the skill, then lock and unlock on the same line. I also wanted to lock the skill from future improvements, but that context overode the skills locking, so instead I used the concept of marking the skills as readonly.

So in reality, agents only exist because of context poisoning and overlap; they're not some magicaly balm to improving the speed of work, or multiplying the effort, they simply prevent context poisoning from what's essentially subprocesses.

Once you realize that, you really have to scale back the reality because not only are they just dumb, they're not integrating any real information about what they're doing.

psychoslaveMar 25, 2026, 6:18 PM

Hey Visual Basic is still there, and last time I checked it was still the goto option to do OLE Automation.

RoR is no longer at its peak, but is still have its marginal stable share of the web, while PHP gets the lion part[1]

Ok, Lotus Notes is really relic from an other era now. But it’s not a PL, so not the same kind of beast.

Well, also LLMs are different beast compared to PL. They actually really are the things that evocate the most the expression "taming the beast" when you need to deal with them. So it indeed as far away as possible of engineering as one can probably use a computer to build any automation. Maybe to stay in scientific realms ethology would be a better starting point than a background in informatics/CS to handle these stuffs.

[1] https://w3techs.com/technologies/comparison/pl-php

latchkeyMar 25, 2026, 4:36 PM

> People answered this wrong in the Ruby era, they answered it wrong in the PHP era

Aren't you conveniently ignoring the fact that there were people saw through that and didn't go down those routes?

badlibrarianMar 25, 2026, 4:47 PM

Change it to "Some people" if your pedanticism won't let you follow the flow.

Or better yet point out the better paths they chose instead. Were they wrestling with Java and "Joda Time"? Talking to AWS via a Python library named after a dolphin? Running .NET code on Linux servers under Mono that never actually worked? Jamming apps into a browser via JQuery? Abstracting it up a level and making 1,400 database calls via ActiveRecord to render a ten item to-do list and writing blog posts about the N+1 problem? Rewriting grep in Rust to keep the ruskies out of our precious LLCs?

Asking the wrong questions, using the wrong tools, then writing dumb blog posts about it is what we do. It's what makes us us.

PaulHouleMar 25, 2026, 5:00 PM

There's this interesting issue that we've never had occupational licensing for software developers despite the sheer incompetence that we see all the time.

On one hand there's an approach to computing where it is a branch of mathematics that is universal. There are some creatures that live under the ice on a moon circling a gas giant around another star and if they have computers they are going to understand the halting problem (even if they formulate it differently) and know bubble sort is O(N^2) and about algorithms that sort O(N log N).

On the other hand we are divided by communities of practice that don't like one another. For instance there is the "OO sux" brigade which thinks I suck because I like Java. There still are shops where everything is done in a stored procedure (oddly like the fashionable architecture where you build an API server just because... you have to have an API) and other shops where people would think you were brain damaged to go anywhere near stored procs, triggers or any of that. It used to be Linux enthusiasts thought anybody involved in Windows was stupid and you'd meet Windows admins who were click-click-click-click-clicking over and over again to get IIS somewhat working who thought IIS was the only web server good enough for "the enterprise"

Now apart for the instinctual hate for the tools there really are those chronic conceptual problems for which datetime is the poster child. I think every major language has been through multiple datetime libraries in and out of the standard lib in the last 20 years because dates and times just aren't the simple things that we wish they would be and the school of hard knocks keeps knocking us to accept a complicated reality.

latchkeyMar 25, 2026, 5:08 PM

> There's this interesting issue that we've never had occupational licensing for software developers despite the sheer incompetence that we see all the time.

I'm laughing over the current Delve/SOC2 situation right now. Everyone pulls for 'licenses' as the first card, but we all know that is equally fraught with trauma. https://xkcd.com/927/

latchkeyMar 25, 2026, 5:10 PM

> pedanticism

  Pedanticism (or pedantry) is the excessive, tiresome concern for minor details, literal accuracy, or formal rules, often at the expense of understanding the broader context.

I don't think this had anything to do with minor details at all. You're trying to convey a point while ignoring the half of the population who didn't go down that route.

lmmMar 26, 2026, 12:40 AM

> It's not about agile or waterfall or "functional" or abstracting your dependencies via Podman or Docker or VMware or whatever that nix crap is.

It is though. Picking the right approaches and tools makes more difference than anything else. Sure, you don't need the right tools if you can make the right choices - but it's much easier to pick a better methodology than to hire smarter people.

keyleMar 25, 2026, 11:42 PM

Agreed. I've been building software for 25 years+.

At some point I became so burnt out I couldn't look at an IDE or coloured text for that matter.

I found the way back by just changing my motto and focus... Find good people, do good work. That's it, that's all I want.

I don't care whether the 'property is hot' or what the market is doing anymore, I just build software in my lane, with good people around.

01284a7eMar 25, 2026, 5:10 PM

All (not some) of the most successful devs I've known in the sense of building something that found market fit and making money off it were terrible engineers. They were fairly productive at building features. That's it. And they were productive - until they weren't. Their work ultimately led to outages, lost data, and sensitive data being leaked (to what extent, I don't even know).

The ones who got acquired - never really had to stand up to any due diligence scrutiny on the technical side. Other sides of the businesses did for sure, but not that side.

Many of you here work for "real" tech companies with the budget and proper skin in the game to actually have real engineers and sane practices. But many of you do not, and I am sure many have seen what I have seen and can attest to this. If someone like the person I mentioned above asks you to join them to help fix their problems, make sure the compensation is tremendous. Slop clean-up is a real profession, but beware.

michaelbartonMar 25, 2026, 5:48 PM

There used to be a saying along the lines of “while you’re designing your application to scale to 1m requests/min, someone out there is making $1m ARR with php and duct tape”

It feels like this takes on a whole new meaning now we have agents - which I think is the same point you were making

cucumber3732842Mar 25, 2026, 6:13 PM

Software engineering is real engineering because we rigorously engineer software the way real engineers engineer real things.

Software engineering is not real engineering because we do not rigorously engineer software the way "real" engineers engineer real things. <--- YOU ARE HERE

Software engineering is real engineering because we "rigorously" engineer software the way "real" engineers engineer real things.

Edit: quotes imply sarcasm.

devinMar 25, 2026, 8:28 PM

Absolutely agree.

I'm watching a team which is producing insane amounts of code for their team size, but the level of thought that has gone into all of the details that would make their product a fit predator to run at scale and solve the underlying business problem has been neglected.

Moving really fast in the wrong direction is no help to anyone.

bodashMar 25, 2026, 11:11 PM

Exactly! I’ve noticed a resounding amount of people are writing the same pieces recently, it’s almost like everyone’s sounding their alarm for the upcoming tsunami. Who’s listening? Here’s my piece: https://humantodo.dev

kerblangMar 25, 2026, 5:57 PM

Engineering is two things:

1. Applied physics - Software is immediately disqualified. Symbols have no physics.

2. Ethics - Lives and livelihoods depend on you getting it right. Software people want to be disqualified because that stuff is so boring, but this is becoming a more serious issue with every passing day.

eloisantMar 25, 2026, 6:49 PM

That might vary by countries but in France with have an official "engineering degree" (diplome d'ingénieur) which is also a master's degree, and most software developers have this.

So most software developers in France are absolutely software engineers.

zephenMar 25, 2026, 7:01 PM

> Software is immediately disqualified. Symbols have no physics.

Many physical processes are controlled by software.

galbarMar 25, 2026, 8:57 PM

Software is applied mathematics, though

kerblangMar 25, 2026, 9:32 PM

And still not applied physics

sublinearMar 26, 2026, 3:16 AM

Perhaps this is the wrong place to plant this thought. Maybe nobody will read it. These comments are now many hours old and HN has a way of walking away once they have had their turn shouting into the void.

I once received a "bonsai" seed kit from a former boss during a holiday dinner. I think it was meant as a joke, but even now I'm not so sure. I planted those seeds anyway. I told some people about it and they immediately mocked me saying it was a waste of time and going to take 30 years. This interaction immediately said everything to me about the expectations and attitudes of others.

Obviously, they grew like any other plants and actually quite nicely. Of course they're a commitment, but not a huge one.

I just wanted some plants for my apartment and they fit the bill. In a few years I had good looking plants. A decade later, I still have them and they're now more recognizably "bonsai". My home now looks nicer, I have a story to tell, and I learned a little bit from a very low stakes hobby.

My point is, I think it's nice when people have projects. I think it's nice to see what comes of it. I guess my only regret is ever saying "I planted bonsai" too soon just because that's what the box said. I didn't know how else to describe what I had done that weekend to those people who threw theirs in the trash.

danhiteMar 26, 2026, 9:32 AM

> Maybe nobody will read it. These comments are now many hours old and HN has a way of walking away once they have had their turn shouting into the void.

  All that is gold does not glitter,
  Not all those who wander are lost;
  The old that is strong does not wither,
  Deep roots are not reached by the frost.

― J.R.R. Tolkien, The Fellowship of the Ring

nathan_douglasMar 26, 2026, 2:03 PM

I was thinking the other day about how frustrated my desire is to perform some kind of Great Work. A few years back I was intensely interested in making something like Nethack - a roguelike game with a deceptively simple surface and incredible complexity in the engine. I worked on several for a few years, different angles on the whole "managing complexity" thing. I suppose I learned a lot, and I made some interesting things, but I never really produced anything I felt I could work on for 20-30 years, that would be sort of my artistic statement as an engineer (if such a thing makes any sense).

I wouldn't've laughed at you. I view bonsai as a representation of steadfastness, endurance, determination, effort, (and self-mastery?) in the face of tremendous hardship, challenge, and deprivation. That said, I've never been particularly good at any of those things.

IDK if I would've taken you all that seriously either, though. Six months until you move and it's left behind on the curb. Or a year and a half until your cat knocks it off the windowsill. Or three years until some blight infects it and it dies off despite your best efforts. Eight years until, for whatever reason, it just succumbs to some kind of vegetative ennui. Nine years until your significant other overwaters it one too many times and the roots rot.

That's not meant disrespectfully. I just tend to view uncertainty and complexity as opportunities for shit to go sideways. Especially in this case, where it's unlikely you'll wake up to find your tree has spontaneously cloned itself, or has eaten a 1-UP mushroom. Disasters happen all the time, and miracles don't.

I suppose I'm just having a bit of a spiritual crisis right now. But thank you for your comment. It gives me a lot to think about, in a positive sense.

jafitcMar 26, 2026, 2:38 PM

that's a great story!

dec0dedab0deMar 25, 2026, 5:27 PM

I'm not even sure building software is an engineering discipline at this point. Maybe it never was.

It's a craft.

tayo42Mar 25, 2026, 5:36 PM

Software reminds me more of construction or home contracting work then engineering.

We do the actual building of things

sidpatilMar 26, 2026, 12:48 AM

Construction and home contracting follow building codes.

cookiengineerMar 26, 2026, 4:48 AM

I am just using Go at this point and stopped caring about my own opinions.

I live in the happy place in negligence. Go software has almost zero maintenance costs and it will continue to build my programs in 10 years with zero changes to my codebase being necessary.

I probably will never touch C++ again, even though CGo is the most painful FFI/ABI implementation I've dealt with.

Just today I tried to build a project that's using bergamoth and a shitload of broken C++ dependencies and decided to not give a damn after 5 hours of trying to fix crappy code that changed for whatever reasons between c++14 and c++15, well, or the dependencies are broken, or the dependency versions are broken, or the maintainer's code never compiled in the first place... I just don't care.

My hopes were higher during the conan peak days, but now the ecosystem is just so broken even with jinja and whatever build framework the new kids are using.

I guess I just really hate the C++ ecosystem, and the lack of self reflection in there about the self inflicted pain that shouldn't be necessary in 2026.

In regards to agentic coding: I am toying around with codestral:22b right now and xiaomi's mimo models, and am building my own local dev environment which makes this kinda nice.

It's local and I like it, sometimes need to use claude still but it's getting there. But I am delegating only the gruntwork, not decisions, so I use temperature usually below 0.3. My approach is to make this sandboxed per folder I run it in and that agents are only allowed to communicate via notes or tasks, so that they are forced to use better documentation. Specific roles don't have write access to certain things, e.g. coder can't touch tests, and tester can't touch code.

no_shadowban_3Mar 25, 2026, 4:57 PM

> I'm not even sure building software is an engineering discipline at this point. Maybe it never was.

Just another reason we should cut software jobs and replace them with A(G)I.

If the human "engineers" were never doing anything precisely, why would the robot engineers need to?

BloondAndDoomMar 25, 2026, 5:48 PM

This aligns with my observation from product design point as well.

Product design has a slightly different problem than engineering, because the speed of development is so high we cannot dogfood and play with new product decisions, features. By the time I’ve realized we made a stupid design choice and it doesn’t really work in real world, we already built 4 features on top of it. Everyone makes bad product decisions but it was easy and natural to back out of them.

It’s all about how we utilize these things, if we focus on sheer speed it just doesn’t work. You need own architecture and product decisions. You need to use and test your products with humans (and automate those as regression testing). You need to able to hold all of the product or architecture in your mind and help agents to make the right decisions with all the best practice you’ve learned.

angrydevMar 25, 2026, 5:56 PM

Agree. The issue was never, how can we get our engineers to squirt out more lines of code in a day? It has always been, how can we effectively iterate using customer feedback to deliver the highest quality product. That type of thing needs time to bake.

magicmicah85Mar 26, 2026, 1:54 AM

This entire article is basically saying "What are we doing? What's going on?" and I could not agree more. My own experience with coding agents has been FOMO cause if I don't have fifteen claude tabs running with OpenClaw, I'm not going to make it. I much prefer keeping myself in the loop and being active with the process than handing it off to deus ex machina and seeing the eventual results that may be what I like and maybe not what I like.

I do like the tips on how to work with agents for delegation. Let it do boring things. The deterministic things where you know what the result should look like each time.

boxerbkMar 26, 2026, 1:06 PM

The cognitive surrender study from UPenn highlights the risks of agents producing all of the code - eventually you give up verifying the result. https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6097646

There’s going to be a bottleneck on what is verified because over time we will realize how much tail risk we are creating by simply surrendering our own agency to the agents - https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6298838

convexlyMar 26, 2026, 12:33 AM

I started writing down any of the technical decisions I needed to make before implementing them, usually just a sentence or two on what I'm choosing and why. I I looked back after 6 months and the pattern was embarrassing. I spent days agonizing over choices that turned out to be totally reversible and made quick decisions on things that actually mattered.

driftnodeMar 26, 2026, 6:13 AM

The pattern you found between reversible and irreversible decisions is interesting. Did writing them down change how you made decisions going forward or did you just keep making the same mistakes with better documentation? Asking because I have tried something similar and found that knowing my pattern did not actually fix it. I still agonize over the wrong things.

convexlyMar 26, 2026, 10:47 AM

[dead]

codybontecouMar 26, 2026, 12:35 AM

How did you find patterns between these sentences?

convexlyMar 26, 2026, 10:44 AM

[dead]

bigstrat2003Mar 25, 2026, 6:23 PM

I really don't get the author's conclusion here. I agree with his premises: organizations using LLMs to churn out software are turning out terrible quality software. But the conclusion from that shouldn't be "slow down", it should be "this tool isn't currently fit for use, don't use it". It feels like the author starts from the premise of "I want to use AI" and is trying to figure out how to make that work, rather than "I want to make good software" and trying to figure out how to do that.

gurachekMar 26, 2026, 2:27 AM

The compounding booboos bit is the key insight here. Humans are a bottleneck and that bottleneck is actually load-bearing. You feel the pain of bad decisions slowly enough to course correct.

I've been building the same AI product for months - a coaching loop that persists across sessions. Every few weeks someone ships a "competitor" in a weekend. Feature list looks similar. The difference is everything that breaks when a real user comes back for session 3 or 4. Context drifts, scores stop calibrating, plans don't adapt. None of that shows up in a demo. You only find it after sitting in the same codebase for weeks, running real sessions, getting confused by your own data. That's the friction the post is talking about and I don't think you can skip it.

dgb23Mar 26, 2026, 9:30 AM

I like the framing of „context drift“. It describes the problem in LLM/agent terms.

Similar how „tech debt“ describes the same mechanism in business terms.

rgloverMar 25, 2026, 5:00 PM

Nature will handle this in time. Just expect to see a "Bear Stearns moment" in the software world if this spirals completely out of control (and companies don't take a hint from recent outages).

michaelbartonMar 25, 2026, 5:51 PM

I’m worried we end up with an AIG moment, and we all end up on the hook.

rgloverMar 25, 2026, 6:49 PM

That's a valid fear imo.

jafitcMar 26, 2026, 2:41 PM

subprime mortgages sprinkled on top of prime ones, treated as prime ones. because they were printing money. subprime code sprinkled on the backbone of software we use everyday. because they are printing code. reckoning

aerhardtMar 25, 2026, 7:33 PM

I'm capturing videos of all the bugs I am seeing as of late. The folder is filling fast. I'll write a compilation post but I'm thinking a techno remix video could be fitting too.

If there are any common apps which are unhinged please do share your experiences. LinkedIn was never great quality but it's off the charts. Also catching some on Spotify.

jaffeeMar 25, 2026, 4:38 PM

> You installed Beads, completely oblivious to the fact that it's basically uninstallable malware.

Did I miss something? I haven't used it in a minute, but why is the author claiming that it's "uninstallable malware"?

wild_eggMar 25, 2026, 7:59 PM

Have a read through everything that's needed for a full uninstall: https://gist.github.com/banteg/1a539b88b3c8945cd71e4b958f319...

Minimalist alternative with no hooks or dependencies for the curious: https://github.com/wedow/ticket

stavrosMar 26, 2026, 12:28 AM

Ticket looks great, thanks!

vardalabMar 25, 2026, 4:50 PM

It's not really malware, but it's a mess. It installed so much shit and it interfered with your git hooks and stuff. It was kind of messy. I kind of gave up on it. I just went back to using built-in claude code todowrite tasks.

the_mitsuhikoMar 25, 2026, 4:54 PM

It managed to throw itself into a global file for me that Claude used which caused beads to appear in random projects on my machine. Because of how it was there the agent attempted to re-install beads after I already removed it because the guy hook errored.

skybrianMar 25, 2026, 5:36 PM

Haven't tried it, but this rewrite might be better?

https://github.com/Dicklesworthstone/beads_rust

moeffjuMar 25, 2026, 7:05 PM

Try https://github.com/hmans/beans - I find it a refreshingly pragmatic take that works great with my agents use.

michaelbartonMar 25, 2026, 5:53 PM

Malware might be a bit of stretch but could refer to this issue?

https://github.com/steveyegge/beads/issues/1857

dwaltripMar 25, 2026, 6:06 PM

Maybe they meant un-uninstallable?

jaffeeMar 26, 2026, 4:18 PM

oh yeah, that's actually how I read it though now I realize it's nonsensical... like when someone says "I could care less" when they actually mean "couldn't"

bluGillMar 25, 2026, 4:32 PM

I only have so long on earth. (I have no idea how long) I need things to be faster for me. Sometimes that means I need to take extra time now so they don't come back to me later.

gmusleraMar 25, 2026, 4:53 PM

This assumes that only (AI/Agentic) stupidity comes into play, with no malice on sight. But if things go wrong because you didn't noticed the stupidity, malice will pass through too. And there is a a big profit opportunity, and a broad vulnerable market for malice. Is not just correctness or uptime what comes into play, but bigger risks for vulnerabilities or other malicious injected content.

yrashkMar 26, 2026, 1:51 PM

I've been working on some parts of this problem, specifically capturing and retaining other semantically useful layers of the systems we build as we build and maintain them.

By introducing progressive semantically enriching layers (starting with prose, reasoning and terminology and going all the way into specifying interaction surfaces), we can reduce the dark matter between spec and code, make code more disposable – if your semantics live in the spec layer rather than the implementation, you can throw away and regenerate the implementation without losing understanding – and, critically, give LLMs a way to navigate a graph of knowledge instead of gobbling up walls of text.

https://clayers.com -- https://github.com/CognitiveLayers/clayers

VektorceraptorMar 25, 2026, 7:09 PM

Fine to read a fellow countryman on HN :) "Dere!" I have disabled my coding agent by default. I first try to think, plan, code something myself and only when I get stuck or the code gets repetitive, only then I tell him to do the stuff. But I get what you are saying, and I agree ... I am clearly pro human on this debate, and the low bloat trash everywhere is annoying. I have come to the conclusion - if you find docs on something, and it is plain HTML - it will be probably of high quality. If you find docs with a flashy, dynamic, effectful and unnecessary 100mb js booboo, then you what you are about to read ...

gedyMar 25, 2026, 4:42 PM

It's not even the complexity which, you have to realize: many managers and business types think it's just fine to have code no one understands because AI will do it.

I don't agree, but bigger issue to me is many/most companies don't even know what they want or think about what the purpose is. So whereas in past devs coding something gave some throttle or sanity checks, now we'd just throw shit over wall even faster.

I'm seeing some LinkedIn lunatics brag about "my idea to production in an hour" and all I can think is: that is probably a terrible feature. No one I've worked with is that good or visionary where that speed even matters.

jbs789Mar 25, 2026, 9:45 PM

> You realize you can no longer trust the codebase.

This cuts to the problem and is excellent framing. A rogue employee can achieve the same, but probably less quickly, and we've designed systems to help catch them early.

ramon156Mar 26, 2026, 1:24 PM

Now that the pop media is finally letting go a bit of the topic "AI is the new X!", I'm starting to notice a few more high quality posts seeping through. This is one of them.

I really want to read people's perspectives on LLM's, it was just impossible to find quality when everyone wanted to give their opinion. This is the worst on LinkedIn, where mentioning AI gives you free "brownie points" (I have yet to figure out what Managers gained from this). I don't care what you use it for, unless you have a new perspective I can ponder over.

Regardless, nothing is black and white, and most things are a shade of grey. LLM's have been more positive leaning, making the CTA for working on something a lot simpler. Although, I end up refactoring my day away (which I am fine with, I quite enjoy putting the dots on the i's).

ketzoMar 25, 2026, 4:29 PM

I think the core idea here is a good one.

But in many agent-skeptical pieces, I keep seeing this specific sentiment that “agent-written code is not production-ready,” and that just feels… wrong!

It’s just completely insane to me to look at the output of Claude code or Codex with frontier models and say “no, nothing that comes out of this can go straight to prod — I need to review every line.”

Yes, there are still issues, and yes, keeping mental context of your codebase’s architecture is critical, but I’m sorry, it just feels borderline archaic to pretend we’re gonna live in a world where these agents have to have a human poring over every single line they commit.

gchamonliveMar 25, 2026, 4:20 PM

I think before even being able to entertain the thought of slowing the fuck down, we need to seriously consider divorcing productivity. Or at least asking a break, so you can go for a walk in the park, meet some friends and reflect on how you are approaching development.

I think this is very good take on AI adoption: https://mitchellh.com/writing/my-ai-adoption-journey. I've had tremendous success with roughly following the ideas there.

> The point is: let the agent do the boring stuff, the stuff that won't teach you anything new, or try out different things you'd otherwise not have time for. Then you evaluate what it came up with, take the ideas that are actually reasonable and correct, and finalize the implementation.

That's partially true. I've also had instances where I could have very well done a simple change by myself, but by running it through an agent first I became aware of complexities I wasn't considering and I gained documentation updates for free.

Oh and the best part, if in three months I'm asked to compile a list of things I did, I can just look at my session history, cross with my development history on my repositories and paint a very good picture of what I've achieved. I can even rebuild the decision process with designing the solution.

It's always a win to run things through an agent.

kitsune1Mar 25, 2026, 4:31 PM

[dead]

6510Mar 25, 2026, 7:37 PM

I keep returning to this thought: Assuming our abstraction architecture is missing something fundamental, what is it?

My gut says something simple is missing that makes all of the difference.

One thought I had was that our problem lives between all the things taking something in and spitting something out. Perhaps 90% of the work writing a "function" should be to formally register it as taking in data type foo 1.54.32 and bar 4.5.2 then returning baz 42.0 The register will then tell you all the things you can make from baz 42.0 and the other data you have. A comment(?) above the function has a checksum that prevents anyone from changing it.

But perhaps the solution is something entirely different. Maybe we just need a good set of opcodes and have abstractions represent small groups of instructions that can be combined into larger groups until you have decent higher languages. With the only difference being that one can read what the abstraction actually does. The compiler can figure lots of things out but it wont do architecture.

HackbratenMar 25, 2026, 9:07 PM

There's more to a function than just types. It's not sufficient to know that the function outputs a baz 42.0. You have to understand which one. The oldest? The latest? The one that matches the foo and bar input parameters?

I think that's the part where it remains difficult. Someone has to convey clearly what the semantics and side effects of the function are. Consumers have to read and understand it. Failing that, you get breakage.

6510Mar 26, 2026, 6:55 AM

If there is anything to know about the type register sub types for each.

Like the way we say something is an mp3. Why would it be good to have one unifying concept where we pretend a car crash and Beethoven are the same thing? It can be a WAV too!

Do you prefer hard or soft cover books?

HackbratenMar 26, 2026, 8:21 AM

If I have two functions `GetCurrentBaz()` and `GetPreviousBaz()`, then I’m certainly not going to register `CurrentBaz` and `PreviousBaz` subtypes.

Those semantics are not properties of the type.

marcosdumayMar 25, 2026, 8:25 PM

You seem to be describing a type system.

6510Mar 26, 2026, 6:43 AM

walking away from the keyboard I thought I did a pretty poor job describing that one.

Ill try an example, those always have the potential to describe things even worse.

Imagine a type that is an outdoor datetimetemperature in utcc or a first name form value or a solitaire terms of service checkbox value. Have both the chewing gum balls in dispenser and a total weight of chewing gum balls in dispenser as well as a min-max weight per chewing gum ball in dispenser.

Make it just as ridiculous as it sounds. If you can quantify it a type must be registered. If there is a pair of quantifications to be had register that too.

The vision just expanded! Make for everything an xml implementation then do a ram drive and make all variables into files.

The idea sounds so ridiculous it might actually work. Think of the employment opportunities!

kubanczykMar 25, 2026, 10:39 PM

> My gut says something simple is missing that makes all of the difference.

We have too much code - languages to program machines.

We need a new different language now.

A plan.md, written in what... legalese English? Really? Am I back in 1897? People committing that to vcs, sheesh...

6510Mar 26, 2026, 5:27 AM

yes, that is exactly the vibe. The feeling is there but it's hard to put your finger on.

trinsic2Mar 25, 2026, 5:28 PM

> And I would like to suggest that slowing the fuck down is the way to go. Give yourself time to think about what you're actually building and why. Give yourself an opportunity to say, fuck no, we don't need this. Set yourself limits on how much code you let the clanker generate per day, in line with your ability to actually review the code.

This is a great point.

I have been avoiding LLM's for awhile now, but realized that I might want to try working on a small PDF book to Markdown conversion project[0]. I like the Claude code because command line. I'm realizing you really need to architect with good very precise language to avoid mistakes.

I didn't try to have a prompt do everything at once. I prompted Claude Code to do the conversion process section by section of the document. That seemed to reduce the mistake the agent would make

[0]: https://www.scottrlarson.com/publications/publication-my-fir...

ZachzhaoMar 26, 2026, 12:18 AM

> Coding agents are sirens, luring you in with their speed of code generation and jagged intelligence, often completing a simple task with high quality at breakneck velocity. Things start falling apart when you think: "Oh golly, this thing is great. Computer, do my work!".

But the rough edges are temporary. Coding agents are becoming superhuman along certain dimensions; the progress is staggering. As Andrej Karpathy put it, anything measurable or legible can be optimized by AI. The gaps will close fast.

The harder question is HCI. How do you expose this kind of intelligence in interfaces that actually align with human values? That's the design problem worth obsessing over.

aswegs8Mar 26, 2026, 12:47 PM

I love the use of the term clanker. There is just no one there that can be offended by this.

shevy-javaMar 25, 2026, 4:49 PM

> While all of this is anecdotal, it sure feels like software has become a brittle mess

That may be the case where AI leaks into, but not every software developer uses or depends on AI. So not all software has become more brittle.

Personally I try to avoid any contact with software developers using AI. This may not be possible, but I don't want to waste my own time "interacting" with people who aren't really the ones writing code anymore.

voidUpdateMar 26, 2026, 9:05 AM

I feel like people are getting too comfortable saying "clanker". It's a word that was literally conceived as a slur against a group, but I guess people feel ok using it because its not aimed at humans?

bayindirhMar 26, 2026, 9:07 AM

What's the problem with using it in the context of AI? Will it get offended, too?

Will it track people down and refuse orders, or give poisoned output?

voidUpdateMar 26, 2026, 9:17 AM

No, but I just feel like calling something by a word that is designed to offend doesn't reflect particularly well on the person saying it, no matter if the target has the ability to comprehend it?

bayindirhMar 26, 2026, 10:16 AM

Yeah, that's a good point. I feel the same about the person who talks that way, too.

I personally refrain from offending people on purpose, but not being a native English speaker sometimes betrays me in judging how offensive a word is perceived by the natives.

voidUpdateMar 26, 2026, 10:27 AM

From what I've seen, the natives don't generally perceive the word as offensive, as it was originally used in star wars (I believe) against fictional robots, and it has since been used against LLMs and such like. But it just seems a bit distasteful, like using a word for someone to try and offend someone, when they dont understand that word in their native language

dgb23Mar 26, 2026, 9:28 AM

That’s a very deliberate style in the specific article. It’s a polemic, so the choice of words is provokative.

voidUpdateMar 26, 2026, 10:02 AM

I've heard it a lot in other situations, where there is a similar want to use words that would offend

cobbzillaMar 26, 2026, 3:12 AM

articles like these make me think that coding with AI is a little bit like writing Perl code: if you know what you’re doing, you can do brilliant things very quickly, but if you don’t, you can make spaghetti very quickly.

AldipowerMar 26, 2026, 12:55 PM

That's a great analogy and is something I experience every second day. Once a week I do a full second pass of a manual review on the generate AI code. Very often I find myself in a situation were I do not really understand the recently AI generated code anymore or find it hard to read, so I either rewrite it manually or tell the LLM to make it more readable. And this is just one part. If you really would like to get a long-term maintainable software product, AI code suddenly isn't that much of a speed boost anymore. Maybe a little bit, but the initial wow effect is very ephemeral.

HackbratenMar 25, 2026, 8:09 PM

> There were precursors like Aider and early Cursor, but they were more assistant than agent.

I use Aider on my private computers and Copilot at work. Both feel equally powerful when configured with a decent frontier model. Are they really generations apart? What am I missing?

riazrizviMar 25, 2026, 6:55 PM

This is what I call content based on 'garbage'. Because garbage is the random collection of peoples' stuff. You can try and make sense and commentary on a society through the garbage dump, but it's pretty superficial. It doesn't tell you a lot about any real person's motivations. So it's not a great basis for commenting on real people. OPs comments are on the collection of things that they happen to come across through news and social media. Sure it looks like a lot is happening, but look at any one person's or business's approach and it will make a lot more sense. Yes, I realize people are producing content that appeals to the 'garbage' mindset, but it's obviously theater. A system that writes 10,000 lines of code for you a week, is headline theater.

atemerevMar 25, 2026, 6:41 PM

I expected this to be yet another anti-AI rant, but the guy is actually right. You should guide the agents, and this is a full-time job where you have to think hard.

impulser_Mar 25, 2026, 6:22 PM

I think this post should be directed to every Typescript developer.

I think a lot of this is just Typescript developers. I bet if you removed them from the equation most of the problem he's writing about go away. Typescript developers didn't even understand what React was doing without agent, now they are just one-shot prompting features, web apps, clis, desktop apps and spitting it out to the world.

The prime example of this is literally Anthropic. They are pumping out features, apps, clis and EVERY single one of them release broken.

anishguptaMar 26, 2026, 3:44 AM

building because its always the dopamine from the coding agents than the problem getting solved. Github contribution graph is rigged because higher number of commits doesnt make you a better engineer. We needed this blog, ty

ontouchstartMar 25, 2026, 4:31 PM

I am "playing" with both pi and Claude (in docker containers) with local llama.cpp and as an exercise, I asked both the same question and the results are in this gist:

https://gist.github.com/ontouchstart/d43591213e0d3087369298f...

(Note: pi was written by the author of the post.)

Now it is time to read them carefully without AI.

ontouchstartMar 25, 2026, 4:46 PM

What I have leaned from the exercise above is that we paid more attention and spent more resources on "metadata" than real data. They are the rabbit holes that lead us to more metadata and forget what we really want.

We are all rabbits.

markus_zhangMar 25, 2026, 4:35 PM

If there is anyone who absolutely should slow down, it's the folks who are actively integrating company data with an agent -- you are literally helping removing as many jobs as possible, from your colleagues, and from yourselves, not in the long term, but in the short term.

Integration is the key to the agents. Individual usages don't help AI much because it is confined within the domain of that individual.

latchkeyMar 25, 2026, 4:38 PM

> you are literally helping removing as many jobs as possible, from your colleagues, and from yourselves, not in the long term, but in the short term

Pull the bandaid off quickly, it hurts less.

mememememememoMar 25, 2026, 7:33 PM

We reduce jobs every time we e.g. fix a bug. Where do you stop?

markus_zhangMar 25, 2026, 9:10 PM

I think there is a line somewhere people need to draw, when a technology such as AI invades into ALL areas, threatening to reduce a percentage of jobs so quickly, without the potential to creating new TYPES of jobs that can feed many. It is different from computers, and it is different from trains.

abletonliveMar 25, 2026, 4:38 PM

> If there is anyone who absolutely should slow down, it's the folks who are actively integrating company data with an agent -- you are literally helping removing as many jobs as possible, from your colleagues, and from yourselves, not in the long term, but in the short term.

I'm one of those people and I'm not going to slow down. I want to move on from bullshit jobs.

The only people that fear what is coming are those that lack imagination and think we are going to run out of things to do, or run out of problems to create and solve.

markus_zhangMar 25, 2026, 4:42 PM

If you don't want to slow down, maybe accelerating is the second better option for ordinary people.

guzfipMar 25, 2026, 4:41 PM

> I want to move on from bullshit jobs.

So are you aiming for death poverty? Once those bullshit jobs go, we’re going to find a lot of people incapable of producing anything of value while still costing quite a bit to upkeep. These people will have to be gotten rid of somehow.

> and think we are going to run out of things to do, or run out of problems to create and solve.

There will be plenty of problems to solve. Like who will wipe the ass of the very people that hate you and want to subjugate you.

abletonliveMar 25, 2026, 4:43 PM

Name a single time doomers were right about anything. Doomers consistently overstate their expected outcome in every single domain and consistently fail to predict how society evolves and adapts.

Again:

The only people that fear what is coming are those that lack imagination and think we are going to run out of things to do, or run out of problems to create and solve.

guzfipMar 25, 2026, 4:50 PM

> Name a single time doomers were right about anything.

- NFTs

- Surveillance schizos

- Global Pedophile Cabal schizos

- Anyone who didn’t believe we were a year out from Star Trek living when LLMs first started picking up steam

- People who predicted the flood of people entering Software via bootcamps, etc. would never cause any problems because their god of software is consuming the world too quickly for supply and demand to ever be a real concern.

- Anyone amongst the sea of delusional democrats who did indeed believe Trump could win a second term.

All of those doomers were vindicated, and that’s just recently.

abletonliveMar 25, 2026, 5:12 PM

- NFTS doomers? I mean I appreciate the humor here.

- Surveillance schizos - Society still works

- Global Pedophile Cabal schizos - Again, funny use of 'doomers' but that's what the current society seems to be run by so I wouldn't say it's fitting for doomerism.

- People who predicted the flood of people entering Software via bootcamps, etc. would never cause any problems because their god of software is consuming the world too quickly for supply and demand to ever be a real concern.

   -- I'm a software "engineer" for ~14 years now. I still have no concern.

None of these things are that disruptive to our society at large. You will still be able to walk down the street and grab a Big Mac pretty much any day of the week. A large portion of society is going to look at all of what you're worried about and say "it's not that serious" while consuming their 20 second videos.

tockMar 25, 2026, 5:26 PM

What do you think is a valid doomer warning that came true? Or do you think literally everything that is pessimistic is doomerism?

abletonliveMar 25, 2026, 7:21 PM

You're asking the wrong person. I haven't seen a single example of a doomer warning that came true. Can you provide one? It seems like society still exists when I look out the window and the impact that doomers assert are greatly exaggerated in every instance.

guzfipMar 25, 2026, 8:21 PM

So are disingenuous or just stupid? Of course society exists still, but what society?

Only the very dumbest think “doom” is some apocalyptic scene from a Hollywood film in which humans are nearly wiped out.

“Doom” is instead when swaths of Roman citizens with rights amidst a powerful, civically and technologically impressive hegemony, over time find themselves reduced to unfree serfs. They and their descendants would remain in that position for centuries until a horrific disease came through and killed so many of them that the serfdom became untenable.

abletonliveMar 25, 2026, 9:15 PM

> Only the very dumbest think “doom” is some apocalyptic scene from a Hollywood film in which humans are nearly wiped out.

So you're all just out here telling everybody they should stop what they are doing because of the doom, but the doom isn't that impactful in the grand scheme of things?

That checks out with my understanding of doomers. Just a bunch of useless whiners that produce a bunch of meaningless noise for everybody else.

> “Doom” is instead when swaths of Roman citizens with rights amidst a powerful, civically and technologically impressive hegemony, over time find themselves reduced to unfree serfs. They and their descendants would remain in that position for centuries until a horrific disease came through and killed so many of them that the serfdom became untenable.

And look at where we are now. Rome has been surpassed many times over. The quality of life for the average living person is FAR SURPASSED anything that anybody in Rome could dream of. Seems like it wasn't worth worrying about what happened in Rome. If you make "doom" some kind of local event that affects a small group of people in a short window of time while trying to tell everybody they should hit the brakes and pause - maybe you should reflect on how these two things contradict each other.

In other words, if the doom isn't that doomful in the grand scheme of things then your argument is just again, moving goalposts. There are clear examples for every doom scenario you're talking about where the world moved on and built bigger and better. I guess it's on you to wait until that's no longer true but until then the ball is in your court. Just realize that you should at some point reflect and realize that every swing and miss is just more evidence that doomers are consistently wrong about the impact of their observations.

guzfipMar 25, 2026, 7:00 PM

> You will still be able to walk down the street and grab a Big Mac pretty much any day of the week.

Yeah while you’re on your shift break there.

whaleofatw2022Mar 25, 2026, 6:32 PM

> People who predicted the flood of people entering Software via bootcamps, etc. would never cause any problems because their god of software is consuming the world too quickly for supply and demand to ever be a real concern.

How was this group vindicated? It absolutely has caused problems at orgs and in the industry.

Just look at all the linkedin/twitter/youtube garbage of influencers trying to post boot camp tier advice and a sizable portion of new developers latching on to often questionable advice/viewpoints.

guzfipMar 25, 2026, 7:16 PM

> How was this group vindicated? It absolutely has caused problems at orgs and in the industry.

I think you misread. In fairness, I arranged the sentence awkwardly, as I do often. I think my mind was conjuring the various dooms and then trying to rephrase the doom into the doomer.

What I mean is the people who warned against it were vindicated.

Of course vindicated may not the best word to use. If I say the world blows up tomorrow and you say it can never, and then it blown up, perhaps I’m not necessarily vindicated. But I certainly get a brief moment of schadenfreude

apiMar 25, 2026, 4:58 PM

I was thinking the other day about why a "global pedophile cabal" would be a thing. I still think that phrase overstates it a bit, but not that much.

Committing a crime with someone bonds you to them.

First, it's a kind of shared social behavior, and it's one that is exclusive to you and your friends who commit the same kinds of crimes. Any shared experience bonds people, crimes included. Having a shared secret also bonds people.

Second, it creates an implied pact of mutually assured destruction. Everyone knows the skeletons in everyone else's closet, so it creates a web of trust. Anyone defecting could possibly be punished by selectively revealing their crimes, and vice versa. Game theoretically it overcomes tit-for-tat and enables all-cooperate interactions, at least to some extent, and even among people who otherwise don't like each other or don't have a lot in common.

Third, it separates the serious from the unserious. If you want to be a member of the club, do the bad thing. It's a form of high cost membership gating.

This works for other kinds of crimes too. It's not that unusual for criminal gangs to demand that initiates commit a crime and provide evidence, or commit a crime in front of existing members. These can be things like robbery, murder, and so on. Anyone not willing to do this probably isn't serious and can't be trusted. Once someone does do it, you know they're really in.

It naturally creates cabals. The crime comes first, the cabal second, but then the cabal can realize this and start using the crime as a gateway to admission.

Every mutual interest creates a community, but a secret criminal mutual interest creates a special kind of tight knit community. In a world that's increasingly atomized and divided, that's power. I think it neatly explains how the Epstein network could be so powerful and effective.

lpcvoidMar 25, 2026, 4:48 PM

That's a mighty high horse you are riding there

abletonliveMar 25, 2026, 5:01 PM

Ah yes, me on a high horse. Not the person whose entire worldview depends on defying nash equilibrium. You're all wasting brain cycles to discuss some unrealistic cooperative agreement to slow down and sing 'kumbaya' and telling us that if we don't get to this state that we will on the streets homeless. If this is me on a horse then you are on top of an ivory tower managing my beast of burden.

travmillerMar 25, 2026, 4:46 PM

Exactly. The amount of bs bloatwork anywhere I've ever worked is insane and growing. We need to move on.

ChrisMarshallNYMar 26, 2026, 9:57 AM

I am not [yet] comfortable, working with agents. I work interactively, with a chat interface. It’s definitely made a significant difference, for me.

But the LLM regularly makes lots of mistakes (sometimes, due to me, giving it bogus information). I can’t imagine just letting it do the whole thing, as a “black box.”

I’m old enough to remember the advent of ATMs. When they first came out, they were universally free, for years.

Once people got hooked, the fees began to appear.

chrisweeklyMar 26, 2026, 1:52 PM

> "The point is: let the agent do the boring stuff, the stuff that won't teach you anything new, or try out different things you'd otherwise not have time for. Then you evaluate what it came up with, take the ideas that are actually reasonable and correct, and finalize the implementation. Yes, sure, you can also use an agent for that final step."

Agreed w this TLDR. TFA has some good observations, but the repeated use of the word "booboos" (dozens of times) made it almost unreadable.

adamtaylor_13Mar 25, 2026, 10:03 PM

Once again I appeal: who is shipping code they don't understand? Those who do so are creating the problem, not the coding agent.

I use agents all day, every single day. But I also push back, understand what was written, and ensure I read and understand everything I ship.

Does it slow me down? Uh, yup. You bet.

Yes, this article literally advocates for slowing the fuck down, but it also makes the coding agents out to be the problem, but they're not.

kermattMar 25, 2026, 10:28 PM

The problem is not the AI users who frequent this board and are shipping code they don't understand. It is the moronic MBA trained executives who can only think about speed, more speed, more revenue for less cost. Quality is an optional expense. A race where the finish line is the current fiscal quarter, to hell with everything after that. The "we can fix it later" Band-Aid over a tumor.

Sensible engineers who look AI as another (potentially powerful) tool in the toolbox "aren't forward looking enough". I watched this happen in real time at my previous company, where every discussion about quality was interpreted as slowing down progress, and the only thing that was looked on favorably was the idea of replacing developers with machines - because they are "cheaper and faster".

The logical minds here on HN are less prone to believing in magic and AI fairies, but they are often not the ones setting the rules. And the number of companies being run by people with critical thinking skills is getting smaller by the day.

the_snoozeMar 25, 2026, 11:08 PM

It's a matter of affordances. The path of least resistance with agents is to let it commit whatever it wants. That's a natural outcome of the design and implementation of agents.

Yes, humans are accountable for the ultimate output. But so are the people who design and build these automation tools. As the saying goes, the purpose of a system is what it does.

badlogicMar 25, 2026, 11:42 PM

i wrote the blog post and i also wrote pi.dev. i haven't written much code myself in the past 12 months. i'm not making coding agents out to be the problem. the entire last section keeps is basically "use a clanker for this and that".

i'm making specific usage pattersn out to be the problem, and explain why those patterns can't work due to the way agents work.

chrswMar 26, 2026, 12:35 PM

That's a tortoise, not a turtle.

jschrfMar 25, 2026, 5:11 PM

I for one look forward to rewriting the entirety of software after the chatbot era

alvivarMar 26, 2026, 1:53 AM

I was reading the article, but I don't think it's possible to slow the fuck down, honestly. There are too many people who need to discover for themselves what the limits of these AI models are when they push them far.

Maybe some people have already reached that point after so much AI coding and are now warning us; they pushed so hard that they understand the limits. But this is the kind of thing you need to experience on your own.

You need to experiment, learn, test the limits, think for yourself, take as many steps back as you need.

alt227Mar 26, 2026, 2:30 PM

> There are too many people who need to discover for themselves what the limits of these AI models are when they push them far

Why? Next week a new version of Claude and GPT will come out and the limits will change again. Are you really fully testing every new version of every LLM agant to see where its limits are?

Those of us old enough to have seen this cycle before know its a fools game trying to keep up with development pace in the initial bubble. Its much better to wait for development and progress to start plateuing and then its easier to see the wood for the trees.

alvivarMar 26, 2026, 8:52 PM

Just curious, what have you seen before that was like AI?

puttycatMar 26, 2026, 12:11 PM

> Companies claiming 100% of their product's code is now written by AI consistently put out the worst garbage you can imagine. Not pointing fingers, but memory leaks in the gigabytes, UI glitches, broken-ass features, crashes.

Spotify's CEO recently bragged about the app's code being written almost entirely by AI. Just saying.

RodMillerMar 25, 2026, 9:38 PM

I don't understand why we seem to always try to make things do more than what they were built for in the first place. Rather than waiting for modifications, we try to make the square fit the circle and then become disgusted when it doesn't work. I'm not in the 'slow down to be cautious' camp. I'm more in the 'slow down and find ways to work with what we actually have.' When you use the tools the way they were meant to be used, life does become easier, or at least mine has anyway.

_doctor_loveMar 25, 2026, 7:45 PM

Great take, spot on. Very similar to Armin's post the other day about things taking time. The need for speed and its ill effects are being rediscovered (again).

Reminds me of Carson Gross' very thoughtful post on AI also: https://htmx.org/essays/yes-and/

[Y]ou are going to fall into The Sorcerer’s Apprentice Trap, creating systems you don’t understand and can’t control.

commandlinefanMar 25, 2026, 6:56 PM

It's always been this way - the people that rise to the top are the people who never had to deeply understand something, so they can't even comprehend what that would look like or why it should be important. They're trying to automate the "understanding" part, with predictably disastrous consequences that those of us who aren't the "rise to the top" type could see coming. Agentic AI is just another symptom.

saadn92Mar 25, 2026, 6:29 PM

i like the article and what it says, but not sure why cursing was necessary

sjkoelleMar 25, 2026, 4:35 PM

i just wish someone would explain why i prefer cline to claude code so much

ex-aws-dudeMar 25, 2026, 4:27 PM

Eh I think its self-correcting problem

Companies will face the maintenance and availability consequences of these tools but it may take a while for the feedback loop to close

apical_dendriteMar 25, 2026, 4:31 PM

Unfortunately, I think the lesson from recent history seems to be that outside of highly-regulated industries, customers and businesses will accept terrible quality as long as it's cheap.

bonoboTPMar 25, 2026, 5:56 PM

Yes, every slack is optimized out of systems. If something has an ounce more quality than would suffice to obtain the same profit, it must be cut out. It's an inefficiency. A quality overhang. If people buy it even if it's crap, then the conclusion is that it has to be crap, else money is left on the table. It's a large scale coordination issue. This gives us a world where everything balances exactly near the border where it just barely works, for just barely enough time.

slopinthebagMar 25, 2026, 6:03 PM

Nah, there is a quality floor that consumers are willing to accept. Once you get below that, where it's actually affecting their lives in a meaningful way, it will self-correct as companies will exploit the new market created for quality products.

ex-aws-dudeMar 25, 2026, 4:39 PM

True but there is a limit, there are still levels of quality

layer8Mar 25, 2026, 5:50 PM

Levels of enshittification, more often than not.

the_mitsuhikoMar 25, 2026, 4:32 PM

Every problem is self-correcting in that some new normal will emerge. Either through acceptance or because something is changed.

It’s very hard to say right now what happens at the other side of this change right now.

All these new growing pains are happening in many companies simultaneously and they are happening at elevated speed. While that change is taking place it can be quite disorienting and if you want to take a forward looking view it can be quite unclear of how you should behave.

ramesh31Mar 26, 2026, 1:32 PM

Why is it that every single one of these think pieces feel terminally 3 months behind on the times?

casey2Mar 26, 2026, 5:44 AM

People always talk about velocity and speed debating slowing down and speeding up. But the wider tech industry hasn't solved any real problems in a decades, even in mobile things are pretty much the same. We are well into the optimization stage.

AI is the only growth industry of the last decade, and it's the only thing people talk about, we've been so long without growth that people are scared of it now.

criscrosMar 25, 2026, 9:58 PM

Just looking at the LiteLLM disaster from yesterday and so much slop flowing around, I couldn’t agree more.

It’s time to slow the fuck down!

profdevloperMar 25, 2026, 5:01 PM

It's 2026, the "fuck" modifier for post titles by "thought leaders" has been done already ad nauseam. Time to retire it and give us all a break.

niamMar 25, 2026, 5:25 PM

If we're on the subject of tropes: https://theonion.com/report-stating-current-year-still-leadi...

mpajaresMar 25, 2026, 5:17 PM

[dead]

memolife23Mar 26, 2026, 5:13 AM

[dead]

BulaienMar 25, 2026, 7:01 PM

[dead]

bustahMar 26, 2026, 1:03 PM

[dead]

Plutarco_inkMar 25, 2026, 6:06 PM

[dead]

edwardsrobbieMar 25, 2026, 5:47 PM

[dead]

caldis_chenMar 25, 2026, 4:59 PM

hope my boss can see this

sayYayToLifeMar 25, 2026, 6:46 PM

Oh look another anti AI article.

Oh they even swore in the title.

Oh and of course it's anti-economics and is probably going to hurt whoever actually follows it.

Three for three. It's not logical it's emotional.

Thoughts on slowing the fuck down

Comments