More

fooster · 2026-04-18T21:23:33 1776547413

Sorry you are missing out. I use claude all day every day with max and what people are reporting here has not been my experience. My current usage is 16% and it resets Thursday.

fooster · 2026-04-16T19:58:50 1776369530

I didn't experience that at all. I know there are lots of rumblings around here about that, but I'm posting this to show this wasn't a universal experience.

fooster · 2026-04-12T14:07:25 1776002845

Where is your evidence of this "massive cost"? Inference is massively profitable for both anthropic and openai. Training is not.

kibwen · 2026-04-12T14:17:55 1776003475

The evidence is that quotas exist, as seen here, and are low enough that people are hitting them regularly. When was the last time you hit your quota of Google searches? When was the last time you hit your quota of StackOverflow questions? When was the last time you hit your quota of YouTube videos? Any service will rate limit abuse, but if abuse is indistinguishable from regular use from the provider's perspective, that's not a good sign.

jerf · 2026-04-12T14:50:05 1776005405

It's also kind of interesting that they don't think they can do what an economy would normally do in this situation, which is raise prices until supply matches. Shortages generally imply mispricing.

There's a lot of angles you take from that as a starting point and I'm not confident that I fully understand it, so I'll leave it to the reader.

8note · 2026-04-12T23:39:48 1776037188

the sales pitch is that you can keep throwing more and more tokens at a problem to solve it.

if the prices dont keep going down, the pitch falls apart, that you need a specialist to come in and make it work

caminante · 2026-04-12T14:54:55 1776005695

Great point.

The parent's argument is that the marginal cost of inference is minimal. However, the fundamental flaw is that he's separating inference from the high cost frontier models. It's a cross-subsidy that can't be ignored.

bachmeier · 2026-04-12T17:04:56 1776013496

Without any insider knowledge on the economics of these companies, I suspect it's that the amount of infrastructure you have to build is determined by peak usage rather than average usage. If peak usage is much higher for a small part of one day a week (say on Monday morning as software developers across the US get back to work) the cost of fulfilling demand at all times can be insane. That's why companies are implementing batch/standard/priority pricing for the API.

caminante · 2026-04-13T17:30:53 1776101453

Check out this article from today [0].

It sounds like it's more of a profit maximization function (and not just demand) with GPU rental prices increasing 48% since Feb.

> Renting one of Nvidia’s most-advanced Blackwell generation of chips for one hour costs $4.08, up 48% from the $2.75 it cost two months ago, according to the Ornn Compute Price Index.

[0] https://www.wsj.com/tech/ai/ai-is-using-so-much-energy-that-...

weakfish · 2026-04-12T14:43:19 1776004999

This article convinced me otherwise https://www.wheresyoured.at/the-subprime-ai-crisis-is-here/

Narciss · 2026-04-12T15:13:52 1776006832

This is a great article, thanks for sharing

byzantinegene · 2026-04-13T06:12:49 1776060769

good article!

ares623 · 2026-04-13T04:14:22 1776053662

You're assuming they can just stop training. For the entirety of these companies' existence, they have done training. It is part of their price. They must keep pushing out better and better models. That's like saying Nvidia can just stop making new GPUs, they're obviously making so much money with their current models now.

scrollop · 2026-04-12T14:17:53 1776003473

The majority of accounts are free - these are profitable?

IMO they need as many users before their IPO - then the changes will really begin.

reppap · 2026-04-13T11:49:27 1776080967

Inference cannot happen without training the model first, so the distinction is quite pointless.

quikoa · 2026-04-12T15:11:30 1776006690

Inference for API or subscriptions? There is a massive price difference between the two.

wesammikhail · 2026-04-12T14:11:59 1776003119

source?

KaoruAoiShiho · 2026-04-12T14:14:40 1776003280

After googling https://www.reddit.com/r/singularity/comments/1psesym/openai...

wesammikhail · 2026-04-12T15:12:33 1776006753

I've seen sources like this before. It's all hearsay and promo. I was asking for any publicly available verifiable information regarding the cost of inference at scale. I haven't seen any such info personally which is why I asked.

I'm dying to see S-1 filing for Anthropic or OpenAI. I don't actually think inference is as cheap as people say if you consider the total cost (hardware, energy, capex, etc)

KaoruAoiShiho · 2026-04-12T17:17:48 1776014268

Well they're not public yet so you'll have to put up with rumors. But the numbers are available for companies like DeepSeek say they have an 80% profit margin, so it stands to reason OAI etc would do similar numbers considering they charge much more.

caminante · 2026-04-12T19:37:06 1776022626

AFAIK,

1. the 80% margin from 2025 was theoretical,

2. they're relying on distillation/synthetic data for training,

3. and have been very opaque about cross-subsidization of R&D with their models.

The distillation alone adds a big asterisk for comparisons.

KaoruAoiShiho · 2026-04-12T20:02:35 1776024155

Talking nonsense.

caminante · 2026-04-13T00:24:53 1776039893

> you'll have to put up with rumors.

> But the numbers are available for companies like DeepSeek

You'd rather trust self-reported figures? LMAO

caminante · 2026-04-12T14:26:17 1776003977

>OpenAI's compute margin, referring to the share of revenue excluding the costs of running its AI models for paying users

Huh?

The reddit summary comment makes no sense. How are they getting revenues without ads or paying customers?

"After" makes more sense.

FTA:

>The company has yet to show a profit and is searching for ways to make money to cover its high computing costs and infrastructure plans.

fooster · 2026-04-08T23:23:42 1775690622

Why don't you do that? Instead of doing that you unfairly critisize the guy that gave away his time and effort on something which no-one is forcing you to use. Shame.

wesz · 2026-04-08T23:33:11 1775691191

Oh man, he must have gave away so much time prompting ai agent. Shame on me:)

latchkey · 2026-04-08T23:37:14 1775691434

I wrote code. But, the whole point of this project for me was to experiment with how far I could take AI as an assistant to help me with coding. So far, I've been extremely impressed.

fooster · 2026-03-28T02:41:30 1774665690

The Russians certainly did interfere in the 2016 election. It was not bullshit.

parineum · 2026-03-28T08:17:09 1774685829

Define "interfere". Be specific.

the_why_of_y · 2026-03-28T09:49:07 1774691347

https://www.justice.gov/archives/sco/file/1373816/dl

https://www.intelligence.senate.gov/sites/default/files/docu...

fooster · 2026-03-20T16:56:27 1774025787

I mean as if anthropic and openai did.

fooster · 2026-03-16T12:14:52 1773663292

“one PM will organize the work for half a dozen developer”

That isn’t the job of a PM.

fooster · 2026-03-08T04:00:50 1772942450

Jack booted thugs shot a women in the face for the crime of sitting in her car and the administration called her a terrorist. Nothing happened to the thug.

Jack booted thugs shot a man in the back for the crime of defending a woman and the administration called him a terrorist. Nothing happened to that thug either.

insane_dreamer · 2026-03-08T05:50:29 1772949029

Absolutely. But what people don't realize is this sort of thing happens in China too, it's just never reported or heard about, other than some whispers here and there, because of such tight control over the media, the internet, and public discourse. In the US, as much as the fascists are trying to take over, at least you can still protest and make your voice heard.

kelseyfrog · 2026-03-08T06:55:23 1772952923

Source?

insane_dreamer · 2026-03-09T15:02:43 1773068563

Me. I lived and worked there for a number of years. When you talk to the local people in confidence you find out about this stuff, and you sometimes catch glimpses of it before it's scrubbed from local social media. There's a very high level of control.

kelseyfrog · 2026-03-09T15:22:51 1773069771

How can I, as a Westerner, verify this?

insane_dreamer · 2026-03-09T20:02:46 1773086566

Spend a few years in China.

Most people don't understand this about China, and most reporters who go there are like "I spent 2 weeks in China and here's what it's like", or "I spent a semester studying at Beida (Peking U)".

10+ years ago there were places where social media posts were archived by some brave individuals before they were scrubbed so you could see what words were being targeted by censors. But that's long shut down to my knowledge.

kelseyfrog · 2026-03-09T21:14:31 1773090871

Sadly the forum posts were returned to angel Moroni. Really wishing I had a seer stone rn :(

rockskon · 2026-03-09T16:58:28 1773075508

There are news stories that come out occasionally from reputable sources.

kelseyfrog · 2026-03-09T21:11:57 1773090717

Anything I can read? So far to believe it, I either have to move to and live in China for several years or "trust me bro". Neither of those are epistemically viable. It should be clear why.

insane_dreamer · 2026-03-10T18:25:48 1773167148

Why can I say? Sometimes the only way to really learn about a country esp one with such tight control over information is to live there a decent amount of time. And since not many investigative journalists are doing that in China any more (China’s not likely to let them stay if they’re reporting the truth), it’s tough. If you really want to know, you may have to go find out for yourself.

fooster · 2026-03-01T01:02:19 1772326939

I don't find that to be the case.

fooster · 2026-02-28T03:16:16 1772248576

MIGHT be overreach to call this a supply chain risk?!? That is absolutely ludicrous.

timr · 2026-02-28T03:24:12 1772249052

To quote one of the greatest movies of all time: That’s just, like, your opinion, man.