Hacker News Clone

gpmMay 24, 2026, 6:32 PM

An interesting implication of this is that AI inference and training has a path to a ~3x hardware cost reduction (and maybe ~2x total cost reduction) without any technical innovation whatsoever, we just need to wait for dram supply to meet demand (either by manufacturing scaling or just waiting for the current rate of manufacturing to fill the demand spike).

radialstubMay 24, 2026, 7:23 PM

The memory makers will not expand demand drastically. It is in the nature of their business to keep the market under-supplied, otherwise the following oversupply will kill them. Instead, supply is just rerouted from less profitable segments such as mobile and personal computing.

tooltalkMay 24, 2026, 8:59 PM

This is wrong. It is NOT in their nature to keep the market under-supplied -- eg, Samsung, the industry's largest company, was notorious for expanding their capacity during the industry downturn to gain market share while everyone else was cutting back to minimize loss.

I'm guessing you are also probably unfamiliar with the terms like "chicken game" which refers to the cutthroat, high-stakes price wars where dominant semiconductor manufacturers intentionally overproduce and slash prices. This is literally how the industry went from dozens to just three majors today since the 80's.

CGMthrowawayMay 24, 2026, 9:18 PM

You're making the point for him. Undersupply in a boom, store cash to ramp up capacity in a downturn. Presevres capital and avoids overcapacity during the turning

roenxiMay 25, 2026, 12:13 PM

This sounds like a plan to sell less when prices are high and more when prices are low. That is one of the stupidest strategies a company could adopt. I assure you, the RAM makers are pumping out as much as they can and increasing capacity as fast as they think the market can handle.

I'm not sure what world we live in when the scheming capitalists are all hunched around their table working out how to dodge selling their products into an enormous price boom. Do they not like money all of a sudden?

Tuna-FishMay 25, 2026, 6:48 PM

Building new capacity takes years. The idea is that the market is reliably cyclical, so you should expand when there is a downturn, when costs are low and you can afford the short-term capacity hits that expansion causes (fe. when you divide productive teams in two and fill both halves to full strength with new hires).

roenxiMay 26, 2026, 8:51 AM

If you prefer. But we seem to have gone from "undersupply in a boom" to a strategy of oversupplying so aggressively that manufacturers would finish ramping up supply well before the boom before it even happens. And that would be a better strategy.

curiousllamaMay 24, 2026, 9:21 PM

Sure, but the key word here is "was"

The industry is so naturally prone to oversupply that the only stable equilibrium is undersupply. Aggressive expansion kicks off a price war, which immediately undercuts the logic of the expansion.

This only changes with new entrants, which will come, especially from China. But it takes time to build fab capacity, so the medium-term modal outcome is consistent undersupply.

KptMarchewaMay 25, 2026, 3:15 PM

That works when there are dozen suppliers. Does not when there are three.

mlinseyMay 24, 2026, 8:04 PM

If the existing memory makers retains control of the market and don't defect from the optimal-long-term equilibrium for themselves, that's true. It just takes one player to defect for short term gains as we've seen with some past boom-and-bust cycles. Alternatively, it takes a sufficiently-resourced player with enough incentive to enter the market themselves (NVidia, Google, Amazon, the PRC government through one of many companies...)

dev1ycanMay 24, 2026, 9:16 PM

CXMT is scaling up incredibly fast, they are on a clock (south koreans) their monopoly will end relatively soon, although I'm guessing that the AI companies will crash before that anyways.

topspinMay 25, 2026, 10:15 AM

> their monopoly will end relatively soon

Corsair DDR5 DIMM modules with CXMT RAM started appearing on Friday.

djeastmMay 24, 2026, 8:13 PM

Relevant article posted on HN about this a few days ago: https://davidoks.blog/p/ai-is-killing-the-cheap-smartphone

gimmeThaBeetMay 25, 2026, 6:12 AM

I struggle to think of a line of business as cyclical as DRAM, maybe like certain kinds of mining would be my only thought.

The DRAM fabs have been on a roundabout for 40 years going from getting accused of price fixing and cartel behavior, to struggling to keep the lights on.

And imo it's not really their fault, it's all the lead time of advanced semiconductors, combined with the commodity dynamics of oil. And the goal is to match that supply to the demand of everything from consumer electronics to more datacenters than you can shake a stick at.

It's maddening to try and solve that, so at this point I really don't fault them for prioritizing survival.

theshacklefordMay 25, 2026, 11:18 AM

> from getting accused of price fixing and cartel behavior

"Accused" makes it sound like these things may still be up in the air, when they very much are not. I would choose instead the much clearer "A number of those involved in DRAM production have a proven history of cartel behavior and price fixing."

For those who may not be familiar with some of the history in this area:

https://en.wikipedia.org/wiki/DRAM_price_fixing_scandal

gimmeThaBeetMay 25, 2026, 2:41 PM

I said accused mainly because the big 3 won their last antitrust suit in the US, sort of "what have you caught me for, lately?" approach.

For all I know, maybe they are dumb enough to try and actually coordinate again, my hunch would be no, or they've tried something new and inventive. Like Matt Levine talked about how so many landlords were using the same software to set prices, that one was pretty shady.

But it is interesting where it is popping up at the moment, like power transformers is another area. These companies have lived through these cycles before, and know there is no one to save them if they overleverage and get it wrong.

itopaloglu83May 24, 2026, 8:07 PM

Reminds me of how Samsung is giving out $340,000 per person bonuses. Shows you how much of a stronghold they have in market.

zarzavatMay 24, 2026, 11:37 PM

They did that to avoid losing even more money in a strike, not because they wanted to.

Dylan16807May 25, 2026, 2:31 AM

No company ever wants to give out big bonuses, but it's only 10% of their profits. So it still shows the scale of the money they're making right now.

ViacolMay 25, 2026, 2:13 AM

I think you're probably referring to SK Hynix. Samsung's situation was more about dealing with the fallout from the labor strike.

cromkaMay 25, 2026, 10:52 AM

What you described only works if the manufacturers agree to price fix. Otherwise, in a free market, they'll race to increase their earnings by meeting the demand.

Ey7NFZ3P0nzAeMay 26, 2026, 6:19 AM

> It is in the nature of their business to keep the market under-supplied

What?! If they did an anti competitive agreement sure. Otherwise no as each supplier is incentivized to produce more than its competitor and less than the demand, while divesting just enough to survive the oversupply risk.

ec109685May 24, 2026, 7:36 PM

Supply and demand always balance out. There is no way manufacturers aren’t going to compete away these inflated margins, as long as they feel like this demand is sustainable.

kristopolousMay 24, 2026, 7:47 PM

You know there's other strategies? Companies can be more clever than naively undercutting each other...

Memory in particular ... https://en.wikipedia.org/wiki/DRAM_price_fixing_scandal

The entry-cost to getting into memory is on the order of $billions and years - you can do just about anything...

byzantinegeneMay 25, 2026, 2:22 AM

not if china gets into the picture

kristopolousMay 25, 2026, 6:33 AM

why not? i'm sure they can jump into the hustle.

Increasing the availability doesn't mean decreasing the price ... people think those are intrinsically related - not so much.

You can get a prada shirt for $2,000 ... as many as you'd like, for $2,000 a piece. No problem. They'll make the factories go burr all night long. Still $2,000.sweeping

There's a bunch of things like this. $100 bills for instance ...

a new entrant might yield a price drop, or, it might not.

exceptioneMay 25, 2026, 9:01 AM

  > why not? i'm sure they can jump into the hustle.

Not so quick. Critical difference is the relationship between enterprises and the state. In China, the state owns the enterprise, in one way or another. High costs of memory is a threat to the established Chinese electronics manufacturers. The Chinese state can optimize returns at a higher level than the one some petty chip manufacturer operates at, especially if doing so means it could gain coercive geopolitical strength, aka blackmailing.

array_key_firstMay 24, 2026, 7:38 PM

There's very few manufacturers, I believe 3 globally? And there's a large moat. Nobody can compete with them in the next 10 years. It's really not hard to coordinate action between 3 companies.

KeplerBoyMay 24, 2026, 7:47 PM

There are trillions to be made. That moat won't be as insurmountable in hindsight.

array_key_firstMay 24, 2026, 7:58 PM

There really aren't though. The reason there's only three is because memory is a commodity and margins are historically very low. It's not a very good business to be in, generally.

In the past when memory supply was short and then rebounded, many companies went out of business because making memory was no longer profitable.

roenxiMay 25, 2026, 12:22 PM

And margins will continue to be low, otherwise they'll discover they don't have a moat. Commodity markets being competitive is a self fulfilling prophecy.

The companies have two choices. They either produce RAM cheaply and in large quantities, or they get replaced by someone who will produce RAM cheaply and in large quantities. Current incumbents are free to pick which of those two scenarios they prefer.

overfeedMay 24, 2026, 8:29 PM

There used to be over 50 memory manufactures in the US alone. Everytime there was a bust (following a boom) there'd be bankruptcies. The lucky ones got bought out and consolidated. Empirically, attempting to capitalize on memory booms is a losing strategy.

ccoMay 24, 2026, 7:56 PM

Only in the most naive sense.

If it costs you $1B and five years to build out new supply and you think demand will not sustain for more than three years, it does not make sense to expand supply.

Instead you will maintain your margins currently and await demand to decrease back to your current supply.

This is pretty common and as others have pointed out is even more common in markets where competition is slow and lead times are long.

Ammunition is a great example over the last decade or so as political turnover caused relatively short lived demand spikes and manufacturers didn't expand supply because they knew once political winds shift, demand would decrease.

crazygringoMay 24, 2026, 8:28 PM

...which is presumably why GP said "as long as they feel like this demand is sustainable."

jayd16May 24, 2026, 7:34 PM

Apple could always decide to build their own fab or some such thing.

simonhMay 24, 2026, 8:21 PM

That’s not the Apple way, but they might fund a supplier to build out capacity in return for priority access.

The thing is they tend to only do that when they can get a technological competitive advantage. The priority access gives them a locked in competitive edge, for a while. It’s not clear there is an opportunity like that in memory.

jayd16May 24, 2026, 10:39 PM

It wasn't their way to design CPUs until it was their way.

nextaccounticMay 25, 2026, 5:51 AM

Apple doesn't want to enter low margin business

golem14May 25, 2026, 12:47 AM

Designing and producing are separate

weitendorfMay 24, 2026, 10:23 PM

If you factor in Nvidia’s profit margin due to the scarcity of the current bleeding-edge chips there is a path to a much larger cost reduction still.

There’s a lot to criticize Sam Altman for saying or popularizing culturally but I’ve come to think his “this is the worst it will ever be” is, in the long run, actually a very intriguing and underrated point.

In a decade training LLMs to the current level of sophistication, which is in my opinion rather advanced and probably has lots of additional upside just from constructing better RL training regime independently of hardware advancement, will become just as table stakes as running a database is now. I highly recommend everyone look into the Allen Institute’s projects in GitHub and HF because they have open source training materials (including an LLM from scratch off common crawl, and some quite interesting tunes of qwen) to get a taste for what will be in the near future afternoon projects or educational material. The future is going to be wild

oblioMay 25, 2026, 7:19 AM

These crazy hardware price increases will probably delay everything by at least 2-5 years. Then add at least 5-10 years for all these refinements and optimizations to permeate universally.

Until everything matures, most likely the current iteration of OpenAI and Anthropic will be long gone, along with their current business models.

andrepdMay 24, 2026, 7:15 PM

I wonder if we will see an adoption of alternative floating point formats. IEEE floats are notoriously terrible at lower widths (<= 16 bits). Floating point formats such as posits do much better at 16 or 8 bits. If you could train at 16 bits per value instead of 32, and suffer a much smaller inaccuracy penalty than you would from IEEE32 to IEEE16...

refulgentisMay 25, 2026, 2:57 AM

This has been around for quite some time, to the point I had to read this a couple times to understand what you meant. Mighta predated LLMs even.

jfimMay 25, 2026, 12:14 AM

That's already the case with say bf16

Dylan16807May 25, 2026, 3:10 AM

Notoriously terrible?

Posits do a little better if your numbers are biased enough toward 1, but not much better. A 16 bit posit in a near-ideal situation matches an 18 bit IEEE float, and in a pretty wide range of situations loses to either fp16 or bf16.

Training anything at 8 bits is going to be tough, and it's hard to say if the flexible exponent is worth the precision tradeoffs.

andrepdMay 26, 2026, 5:46 AM

> A 16 bit posit in a near-ideal situation matches an 18 bit IEEE float

Unsure what you mean by this... A posit16 has up to 11 bits of precision. There's no such thing as an 18 bit IEEE float.

> and in a pretty wide range of situations loses to either fp16 or bf16

Many papers have compared neural networks at 16 bits or 8 bits, and posits beat the hell out of floats and it's not even close. Which is very much expected. As they're particularly suited to this task. But also in other domains, like numerical weather simulations, where tests have shown 16-bit posits can replace 32-bit floats.

Dylan16807May 26, 2026, 2:28 PM

> A posit16 has up to 11 bits of precision.

Is this excluding the implied bit?

In that case a short float has 10, but if you're messing with formats you can staple on an extra bit of precision and an extra bit of exponent.

> There's no such thing as an 18 bit IEEE float.

There's a lot of custom sizes out there. But if you keep following IEEE rules then there's no special circuitry needed, just a small scaling factor.

nVidia also laid out a 19 bit format that's a superset of both fp16 and bf16.

> Many papers have compared neural networks at 16 bits or 8 bits, and posits beat the hell out of floats and it's not even close.

Can you link a paper that shows posits beating floats at different sizes?

I found a 2021 paper that compares various posits to 32 bit floats, and finds that the model quality is close for some of them. It does not compare any smaller floats.

> Which is very much expected. As they're particularly suited to this task.

Posits show their value when you need a huge exponent range and your numbers focus very closely around 1. How strongly do neural nets fit that pattern?

And how often is their advantage better than 1 or 2 bits?

If you can keep your weights within a range of 9 orders of magnitude, I expect fp16 to do just fine since it loses a bit on some numbers and gains a bit or two on other numbers.

> But also in other domains, like numerical weather simulations, where tests have shown 16-bit posits can replace 32-bit floats.

Can you link this too? I found a 2019 paper that shows them beating fp16 and falling short of fp64, but no fp32 comparison. They also noted that 16,0 posits and bf16 did badly.

They did conclude that 16 bit posits were probably good enough to beat out measurement error and be suitable for the bulk of simulation, but that same chart showed that fp16 was almost good enough. So again I wonder how many bits you'd actually need, since if you're considering rebuilding your FPUs it would be silly to exclude "float sizes that aren't powers of two".

willis936May 25, 2026, 10:02 AM

This line of thinking makes sense if we're talking about opex like power usage. This is capex though and we'll be financing this overpaying for a long time after the hardware has "aged out". Not really sure there is an upside to it.

Also, inference cost predictions were made before this price jump, so we really haven't started paying for it yet. Inference will not be getting cheaper.

overfeedMay 24, 2026, 7:29 PM

It sure looks like Sam Altman's masterful gambit to corner the memory market has had unforeseen consequences.

roxolotlMay 24, 2026, 7:46 PM

Is any of this actually unforeseen? Buying the vast majority of the world’s supply of something does have mostly predictable consequences.

HDBaseTMay 25, 2026, 2:12 AM

Yes, but that's not what they are insinuating.

dragonwriterMay 25, 2026, 2:44 AM

“Unforeseen consequences” in the same way death of the target is when someone aims a loaded gun at their head and pulls the trigger.

liccilMay 25, 2026, 7:37 AM

What demand? Can't shake the notion that it's fictive considering the amount od data centers being built and GPUs sitting in containers, where they will spend quite some time before being even integrated, even more until used...

WaterluvianMay 24, 2026, 6:51 PM

What’s the lifespan/refurbishability of the capex elements like the “GPU” modules or even the DRAM soldered into them?

jmalickiMay 24, 2026, 7:28 PM

For lifespan, AWS is still running a ton of T4 GPUs from 2018, that power a lot of computer vision models. A ton of these will have a long life, not all ML is about frontier LLMs.

epolanskiMay 24, 2026, 11:00 PM

How can it be economically viable to still run them?

You can get 100x the output with the same energy use.

dragonwriterMay 25, 2026, 2:47 AM

While the 100× is, I think, rather hyperbolic, there is a real and large efficincy difference, but its economically viable to run them because the supply of newer GPUs is insufficient to meet the demand for compute, so they can charge enough to cover costs for the old ones and a premium (relative to operating costs) for the newer ones.

It would be economically unviable to run the older ones if the supply of newer ones were unconstrained, but that’s not the world we live in.

Dylan16807May 25, 2026, 3:21 AM

Going by the stats on wikipedia, T4 and B300 both do about one teraflop of half-precision math per watt? Where are the efficiency gains?

Edit: It looks like they replaced INT8 and INT4 with FP8 and FP4, with the same speedups of 2x and 4x relative to FP16. That's an improvement but not that big of an improvement.

EkarosMay 25, 2026, 7:59 AM

As long as you have customers that are willing to pay more than it cost you are fine. And with AWS seemingly there is plenty of those. So question isn't is this most efficient way but will someone pay at price that is above what new hardware could attain.

MarsymarsMay 25, 2026, 12:27 AM

Presumably people using AWS are paying more than they cost to run, and AWS has finite bandwidth to upgrade things due to personel, etc.

jmalickiMay 25, 2026, 12:08 AM

Good question!

Maybe the capabilities of newer GPUs allow AWS to charge higher margins for them? I don't actually know.

HDBaseTMay 25, 2026, 2:14 AM

There has not been a "100x" in efficiency in the past 6-8 years.

fittingoppositeMay 25, 2026, 2:59 AM

Really wondering what this might mean for local LLMs when RAM costs plummet...

refulgentisMay 25, 2026, 12:28 AM

Well, no: manufacturers charge more than input price generally, here specifically, Nvidia wouldn’t lower prices because RAM went down.

eldenringMay 24, 2026, 7:08 PM

2-3x is completely dwarfed by the remaining improvements in training which is still in its infancy relatively

BearOsoMay 24, 2026, 7:17 PM

Unless there's a new paradigm, scaling up is all they can do to improve performance. They've shrunk down all the way to 1-bit models and all the low-hanging fruit is gone. There's no way for them to get much smaller, so they have to get bigger and faster to meet expectations.

intelkishanMay 25, 2026, 2:56 AM

This hasn’t been true for the past 2 years

oblioMay 25, 2026, 7:24 AM

Is this based on an assumption that Opus 4.7 & co are equivalent or smaller to Opus 4.5 & co? I highly doubt the advanced models (Opus, Pro, etc) aren't biggen than the standard ones (Sonnet, Flash, etc) and fairly sure newer models are bigger than older ones.

eldenringMay 24, 2026, 7:49 PM

this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.

BearOsoMay 25, 2026, 6:52 PM

What novel data hasn't already been used in training? What new algorithms are there? Can you post some links so we can read about them?

gpmMay 24, 2026, 7:18 PM

Probably, but at some point we're very likely to run out of significant training improvements and it's not clear that we'll see that point coming from a long way out.

Likewise it's probably dwarfed by improvements in how we make dram - continuing the roughly exponential (maybe a bit less recently) scaling of chips - but not necessarily.

The 2x from returning to previous costs is interesting because it's practically guaranteed, and it's on top of everything else. We're just currently "overpaying" (relative to the stable market price) for the manufacture of dram because of a sudden increase in demand.

eldenringMay 24, 2026, 9:14 PM

my reply from the other thread fits here too:

> this is just not true at all, there are massive leaps from algorithms, data, etc. every year. scale is one axis of many and you need to get them all correct.

fittingoppositeMay 25, 2026, 3:05 AM

> either by manufacturing scaling or just waiting for the current rate of manufacturing to fill the demand spike

Or the more likely scenario that the AI bubble bursts and the hyperscalars realize they have built too many data centers.

shevy-javaMay 24, 2026, 7:19 PM

> a path to a ~3x hardware cost reduction

Really?

How long do we have to wait until that ... cost reduction hits us?

gpmMay 24, 2026, 7:34 PM

For supply to meet demand. Depends very much on how aggressively producers scale and on how demand grows or shrinks.

Safe to say at least a year or two. It'd be shocking if it took a decade.

da_chickenMay 25, 2026, 3:25 AM

All the projections I've seen have said that the earliest we might see the curve flatten is 2030.

It just takes that long to get a fab up and running.

slicktuxMay 24, 2026, 5:16 PM

I bought 96GB of RAM a couple of years ago for ~$250. That same RAM now costs $1200!

mchusmaMay 24, 2026, 5:46 PM

Everything I read seems to suggest that RAM capacity is going to grow at 20-25% a year, which just doesn't seem good enough. Even in consumer use cases, phones and laptops would benefit greatly by double the amount of RAM. And then obviously, the AI need is gigantic.

I don't see it going away. I mean, it may not grow as fast as now, but I don't see it growing away either. I get why the memory makers do not want to bankrupt themselves, but it feels like there's got to be some way to push that risk off onto model providers and other people in the ecosystem to allow us to grow ram capacity more like 50% per year.

KronisLVMay 24, 2026, 5:34 PM

I'm not moving past my DDR4 build (and the 32 GB of DDR4 2133 MHz backup chips I still have around from way back, before I got the current 3200 MHz ones) until the prices go back to being at least partially sane. This also means that CPU manufacturers are not getting my money (since the 5800X is fine for now) and I have no reason to get a new GPU either (though admittedly the B580 isn't perfect).

johnvanommenMay 24, 2026, 6:04 PM

What if this is the lowest that prices will ever be?

mrandishMay 24, 2026, 7:29 PM

As Yogi Berra famously said, "It's tough to make predictions, especially about the future." But based on historical tech industry trends, a price increase in one component that's this rapid and extreme, is likely to eventually regress somewhat toward the long-term trend line - even if that trend line experiences a longer-term shift upward.

As always, some interpret certain recent events as reason to conclude "but this time it's different." Occasionally they are correct. But that doesn't change the fact that it's reasonable to assume some of the recent extreme, rapid price inflation is due to shorter term market distortion. It's also pretty clear that some of the recent increase in demand represents a stable increase in the long-term trendline. The question is how much is long-term stable and how much is short-term distortion.

KronisLVMay 24, 2026, 8:00 PM

Then I will make my build last as long as it can, in protest of that. I do expect at least a performative price drop in the coming years, though.

willis936May 25, 2026, 10:07 AM

Then I better divert all of my investment into memory maker stocks.

stringfoodMay 25, 2026, 11:21 PM

Memory manufactures don't want your money anymore, Micron just left consumer market 6 months ago and says we want to be B2B from now on, and who can blame him? https://investors.micron.com/news-releases/news-release-deta...

daringrain32781May 24, 2026, 11:15 PM

[dead]

oceanskyMay 24, 2026, 5:22 PM

Awful time for gamers and PC hobbyists not fully into AI.

Legend2440May 24, 2026, 5:39 PM

I wonder why the hyperscalers aren't vertically integrating more and building their own fabs. Sure, a fab costs a billion dollars, but they're currently spending hundreds of billions of dollars purchasing chips from NVidia and others.

elorantMay 24, 2026, 5:24 PM

Bought a second hand Dell server a week ago. The entire rig with a 12-core CPU and 32GB DDR4 ecc RAM cost as much as I'd pay to buy 64 GB of DDR RAM alone. I hope there's an end to this absurdity soon enough otherwise the pain will affect other markets too. I read the other day that PC case sales have collapsed by more than 40%.

proeeMay 24, 2026, 7:59 PM

Memory manufactures sit on a war chest of IP. So even if someone has excess fab capacity and wants to get into memory manufacturing, they will have to fight an uphill battle of about a zillion patents.

Most memory companies have backroom deals to exchange tit-for-tat patent violations against each other.

Not sure how a new memory manufacture comes into being without getting sunk from licensing costs?

byzantinegeneMay 25, 2026, 2:28 AM

china?

deadbabeMay 24, 2026, 5:22 PM

Here’s the thing, what if memory manufacturers take this opportunity to collude and basically never reduce the price of memory below the current levels since it’s too hard for a new competitor to just rise up and undercut them? Everything I hear about is how hard and risky it is to spin up a new fab.

And by doing this, they ensure local LLMs never become feasible for the vast majority of people and AI companies solidify subscriptions forever.

johnvanommenMay 24, 2026, 5:48 PM

I really don’t want to give anyone ideas, but doesn’t this make the Nvidia 5090 an unbelievably good deal right now?

The VRAM in the 5090 is only made by one country in the world.

The 50xx series is special, because its ram is so dependent on a single commodity. It’s not like a 4090 or a 3090; their VRAM chips have been around for years.

If there’s a shortage or interruption in DDR7 VRAM, it seems like every GPU that requires it would explode in value.

I hope I don’t regret posting this because I’d really like to buy one myself…

skiing_crawlingMay 24, 2026, 5:41 PM

I recently built a system at insane ddr4 prices ($2000 for 256gb). But that’s only after seeing how ddr5 prices were 3-4x that!

preisschildMay 24, 2026, 5:51 PM

Yeah I upgraded all of my systems to DDR5 last year, so now I have to buy for ddr5 memory upgrades.

Joel_MckayMay 24, 2026, 5:56 PM

Had to fork over almost $1k for a 64G DDR5 kit a few weeks back. At least AMD chips large L3 cache allows folks to get away with lower grade udimms.

Also had to do an Intel build, and there was no way we were going cudimm at current prices. =3

cineticdaffodilMay 24, 2026, 9:50 PM

I find it deeply ironic, that iran has blocked helium supply- while it relies on AI created slopaganda to subvert its advesary. Its one of those afterwits of history.

YlpertnodiMay 25, 2026, 2:33 PM

> iran...slopaganda

A US soldier i know commented that the iranian ai slop is "scary and powerful".

cloudengineer94May 24, 2026, 10:30 PM

With how things are going, I'm really wondering how we are gonna tackle the consumer market for things like gaming and machine learning.

No doubt Cloud Gaming is in the cards for the future, only purists like myself with an RTX 5090 will pay premium for offline gaming

weitendorfMay 24, 2026, 10:36 PM

In the long run cloud gaming is inevitable, it’s just more economically efficient for the cost of the hardware required to render graphics to be amortized across consumers and not sit idle when being unused by collocating them with game assets in POPs.

Once enough gaming compute runs at the edge it also allows for more technically advanced games than would currently be economically feasible (but aren’t made mostly for lack of a market/adoption of cloud gaming and the resulting lack of technical know-how). So I think it will stick and probably end up winning over the holdouts, once the cost of rendering the games they want to play with consumer hardware becomes too large to stomach.

MarsymarsMay 25, 2026, 2:41 AM

You could make the same economic argument for any SaaS, but the margins SaaS providers look for make it so that the only time it isn't cheaper to run your own software/hardware stack in place of SaaS is when the hardware requirements are very low, not high. SaaS makes sense economically when you take into account the admin, compliance, etc. costs... and the admin costs of a Nintendo Switch are pretty low.

willis936May 25, 2026, 10:15 AM

Economic efficiency does not win the day because the free market is a myth. Cloud gaming is a technically worse solution because the latency floor is higher. It's a microeconomic disaster (rent vs buy, buy wins). The only reason it would become a thing is if the multinationals succeed in concentrating more wealth and power, which consumers aren't interested in supporting. It's a bad deal and consumers know it. They would have to be forced into it by having the consumer hardware market taken off the table (which is happening and the only possible avenue for a technical regression like cloud gaming to have a market).

MrGilbertMay 24, 2026, 5:33 PM

I assume that memory manufacturers don’t really care where the money is coming from, as long as the "numbers go up" game is working.

NVIDIA in their recent quarterly report stopped categorizing "Geforce" as a single category, and merged it into "Edge-Computing".

If you are a PC Gamer or PC Enthusiast as I am, then we have some dark times ahead.

reactordevMay 24, 2026, 5:35 PM

Do we though? DLSS 5 changes that somewhat from a “we need powah” to “we need models”. I think the future consumer GPU market will be tuned for image and world inference while workstation cards will be tuned for image and video inference. The old way of thinking about this will come to an end when we stop looking at the render loop as the be-all-end-all…

Or, we could be fucked.

kgMay 24, 2026, 6:38 PM

If DLSS 5 becomes the norm it's possible that just makes things worse. The DLSS 5 demos required an entire separate card to run the model, though IIRC NVIDIA did claim it would eventually work on a single card. Given what the model is doing (yassifying the whole scene instead of just upscaling/reconstructing) it makes sense to me that it would increase compute demand instead of reduce it like previous versions of DLSS.

reactordevMay 25, 2026, 5:32 PM

The demos did, but look how far we have come in just two years? Running local LLMs, running local diffusion models, running local world models (albeit, barely a scene at this point). I do believe that in 10 years time, game will be producing latents and not events they way they do now. I also hope this means that VR can finally get the fidelity it needs to really take off.

MrGilbertMay 24, 2026, 7:23 PM

From my point of view, I suppose we will enter a "Let AI generate entertainment" era. In which you just might rent everything, including games. No need for a beefy computer at home, you just need a slim endpoint:

"Order yours now, for just $99.99 per month, hardware included! Order today, and you will get three months of 'Office Suite' for free, with a small additional cost of $49.99 after month 4. On a tight budget? Switch to the yearly subscription, and pay comfortably in 18 installments."

reactordevMay 25, 2026, 5:30 PM

On your Karna card…

TraubenfuchsMay 24, 2026, 5:47 PM

Why did this happen so suddenly?

Why were tech savy investors unable to figure this out when the datacenter craze had already started?

How to explain this lag between quickly rising demand for all datacenter components besides memory?

skybrianMay 24, 2026, 6:09 PM

RAM is a boom-and-bust industry, so memory manufacturers were reluctant to invest. Here's a good blog post on the economics:

https://davidoks.blog/p/ai-is-killing-the-cheap-smartphone

Maybe long-term purchase agreements from big buyers might have helped convince them it's okay to build, but apparently it didn't happen.

johnvanommenMay 24, 2026, 5:54 PM

Nine years after Google's seminal paper lit the fuse on AI, a total lack of manufacturing foresight has trapped over a trillion dollars of incoming capital in a hardware bottleneck.

The entire sector is now facing a critical RAM starvation crisis where memory manufacturers are actively slow-rolling supply just to keep prices high and avoid running out entirely.

This has created an unprecedented supply-and-demand distortion where desperate companies are getting rejected even at a 5x markup, and mission-critical SKUs are skyrocketing to 10x and 20x their baseline value.

It is a macroeconomic squeeze at a staggering scale, and the massive venture scale opportunity lies in capturing the value created by this memory gatekeeper.

From the perspective of an armchair economist, the winners will be the investors who invest in RAM wisely. The losers will likely be cash strapped SAAS companies. They’re almost completely dependent on a fleet of servers in the hyperscalers, and they’re leasing those servers and services. That leaves small SAAS companies exposed to incoming inflation in the cost of hosting.

irthomasthomasMay 24, 2026, 6:19 PM

A lot of words to say that Sam Altman bought up the worlds total supply of ram chips for the next few years.

AuracleMay 24, 2026, 7:23 PM

A dick move or just really prescient?

regularfryMay 24, 2026, 7:49 PM

It's only prescient if it works out. But it's a dick move either way.

chairmansteveMay 24, 2026, 6:51 PM

"That leaves small SAAS companies exposed to incoming inflation in the cost of hosting".

Which they will pass on to their customers. If their product provides enough value the customers will pay.....

vb-8448May 24, 2026, 6:25 PM

Capex expenditure start exploding after covid with the chart going hockey stick at the end of 23/start of 24, almost 2.5 years ago.

A lot of capex is supposed to go into the datacentres, didn't they know that datacentres need to be filled among other stuff with RAM? I wonder if at some point we will discover that there is a shortage of fibre optic cables of SFPs ...

PS: Obviously armchair economist here too ... but for it doesn't seem too difficult to foresee the increase of the demand.

LPisGoodMay 24, 2026, 6:16 PM

The same reason they didn’t all sell everything to buy NVIDIA the day chatGPT came out

zeristorMay 26, 2026, 12:42 AM

Or to put it another way, the prices will only come down the other side of an intense catastrophe.

AI growth is locked in now, only if it were to stop will demand be abated.

maxnevermindMay 25, 2026, 2:25 AM

I wonder if it is reasonable to assume the propagation of shortages further. At first it was GPUs, then RAM, then what?

aceazzameenMay 25, 2026, 2:27 AM

Fresh water?

DoctorOetkerMay 24, 2026, 5:39 PM

It's still unclear to me: the shortage is semiconductor boules / wafers? or the shortage is semiconductor fab process step availability?

As long as the discussion seems focused on memory, I'd suspect the latter, but if its really the semiconductor boules/wafers, then I'd expect the boule growers to profit, not the memory makers, who just pass on the cost.

So which is it?

flykespiceMay 24, 2026, 10:14 PM

Since memory is becoming an expensive commodity, I guess the old ways of being precious on the efficient memory usage of your program (like it running on the constrained 1mb memory back then) are making a comeback.

I only feel sorrow for the electron devs, they will have a hard time.

notnullorvoidMay 24, 2026, 8:21 PM

Good time to focus on more memory efficient means of training and inference.

SeedLM from Apple is an interesting approach for inference memory efficiency. I'd like to see someone try and build that into training so that it's not a post training compression step.

I_am_tiberiusMay 24, 2026, 6:30 PM

It seems to me the max memory you can buy in a laptop stagnated for the past 3 years or so.

superkuhMay 25, 2026, 2:40 AM

And the max storage in pre-built computers has stagnated at 2010 levels (~1TB). This was first due to the switch to the much more expensive and much faster charge trap flash. In the 2020s it finally started to approach 2010 sizes in pre-builts but then the corporate finance wars re: fab capacity happened.

giancarlostoroMay 24, 2026, 6:36 PM

I have always felt insulted that most laptops even offer a low 4 GB of RAM I rather take 16 GB in previous gen memory

ffaccount2May 24, 2026, 6:48 PM

My several years old laptop has 128GB of RAM, is that not enough? I admit that it's a pretty heavy one.

grapedangackleMay 24, 2026, 6:54 PM

[dead]

rldjbpinMay 25, 2026, 8:26 AM

for the most part, unless soldered down, it has been hard to find higher than dual channel (maybe quad for a massive odm gaming laptop). each stick and platform having set maximum memory capacity has put a glass ceiling for those machines.

doesn't matter anyway when things are not reasonably priced. i am stuck at the same memory capacity in my personal system for the better part of two decades, partially due to the above and the current pricing today.

shevy-javaMay 24, 2026, 7:19 PM

I think the companies that drive up the prices here, need to pay an extra-tax to all of us. I fail to see why I now have to pay more due to the AI monster companies ruining the economy.

blindriverMay 24, 2026, 8:30 PM

Since January, I've been lucky and picking up various used DDR4 memory sticks for cheap-ish. I got a total of 64 GB for $180. I feel like I hit the jackpot!

chvidMay 24, 2026, 5:46 PM

Time to let ASML sell to the Chinese memory producers … or not.

Escapade5160May 25, 2026, 1:08 AM

And four-fiths the cost of a consumer PC build.

ecommerceguyMay 24, 2026, 7:04 PM

As models gain efficiency, will the need for ram cool?

throwatdem12311May 24, 2026, 7:25 PM

They’ll just fill up the ram with bigger models. Demand will INCREASE, not decrease.

helterskelterMay 24, 2026, 8:40 PM

Every time we add capacity with almost anything, we find ways to saturate it.

robot_jesusMay 24, 2026, 11:25 PM

Braess's paradox for roads. When we add capacity to road networks, traffic increases even more than the capacity.

https://en.wikipedia.org/wiki/Braess%27s_paradox

kingstnapMay 24, 2026, 9:18 PM

Jevons paradox is at play. Right now frontier AI is very expensive which heavily suppresses demand.

If you made it 10x cheaper right now you would see a truly unimaginable wave of bot slop.

IAmGraydonMay 25, 2026, 3:40 AM

Built a new machine with 64GB DDR5 and 5TB SSD in January 2025. It's sheer luck that I dodged that bullet.

Jasonwang123May 25, 2026, 2:43 AM

The cost of memory should continue go up as we tend to have the AI to have context and remember lots more.

TheGrassyKnollMay 24, 2026, 5:44 PM

I wish I had figured that out a year ago. MU up ~10x, SNDK up ~37x. My crystal ball is woefully under performing.

inciampatiMay 24, 2026, 9:18 PM

Memory makes computation universal.

luxuryballsMay 24, 2026, 10:58 PM

it’s fun and ironic that “having a memory” is what AI appears to lack the most in practice while at the same time it demands more computer memory than anything to run

amazingamazingMay 24, 2026, 5:33 PM

A commodity rapidly increasing in price. What could go wrong?

abhaynayarMay 24, 2026, 11:59 PM

How can I use this information to MY advantage? Do I started going into something to do with AI chip memory-stuff? If so, how? But just on a software level cause hardware is hard.

ElenaDaibunnyMay 25, 2026, 3:29 AM

unified memory architectures are getting more interesting for inference workloads.

ck2May 24, 2026, 6:34 PM

if we survive the bubble bursting and there isn't a "too big to fail" bailout with public money manipulation by bought politicians

we are going to have amazing cheap used hardware for a decade

brcmthrowawayMay 24, 2026, 5:36 PM

Anyone invested in Micron stock?

lostloginMay 24, 2026, 6:51 PM

Up 700% in a year.

WallstreeetBets has been disturbingly accurate in its predictions - basically anything related to AI.

emsignMay 25, 2026, 8:44 AM

AI is choking the computing economy. Many companies will die. It's already a mass extinction event and will leave behind deserts.

positron26May 24, 2026, 5:18 PM

The algorithm advances are going to crash this so hard.

jalospinosoMay 24, 2026, 8:11 PM

[flagged]

hottrendsMay 24, 2026, 10:44 PM

[flagged]

Developer_HMay 25, 2026, 3:20 AM

[flagged]

danborn26May 24, 2026, 8:27 PM

[flagged]

thinklanceaiMay 24, 2026, 6:44 PM

[flagged]

LapsaMay 24, 2026, 6:13 PM

[dead]

SeattleAntifaMay 25, 2026, 7:59 AM

[dead]

Memory has grown to nearly two-thirds of AI chip component costs

Comments