Hacker Newsnew | past | comments | ask | show | jobs | submit | fxtentacle's commentslogin

Maybe this is a UK problem? Or the author just doesn't have kids? (and has no experience?)

German hospital beds for giving birth are at a 45 degree angle, which to me looked like a good compromise between "the mother can safely take a nap when she is tired" and "gravity will help you". Also, they have these thingys to put your legs up, so the overall posture is pretty close to squatting. (But with a back rest to prevent you from falling over if you're sleepy.) And modern German hospitals also have a bathtub with handrails to hang from above. And they have chairs with a hole in them. There's like a lot of options to choose from. But the nurses said that, statistically, most women choose the 45-degree-bed anyway. My guess would be because it looks the most comfortable.


Keep guessing. There are other reasons most women end up in the beds. (I’ve worked L&D and am a woman.)

If that tracker is using paid tokens, as opposed to the regular subscription, then there's no financial incentive for Antrophic to degrade their thinking, so their benchmark likely would not be affected by the cost-cutting measures that regular users face.

Also, it's probably very easy to spot such benchmarks and lock-in full thinking just for them. Some ISPs do the same where your internet speed magically resets to normal as soon as you open speedtest.net ...


.. which confirms all of my stereotypes. Looks like the AWS engineer who reported it used a m8g.24xlarge instance with 384 GB of RAM, but somehow didn't know or care to enable huge pages. And once enabling them, the performance regression disappears.

Because such settings aren’t obvious to those not familiar with them. LLMs should make discoverability easier though

Honest question: what's the value of running the benchmark and reporting a performance regression if the author is not familiar with basic operation of the software? I'd argue that not understanding those settings disqualifies you from making statements about it.

The performance was reduced without a settings change. That is still a regression even if huge pages mitigates the problem.

I'd be curious to know if there's still a regression with hugepages turned on in older kernels.

If you are benchmarking something and the only changed variable between benchmarks is the kernel, that is useful information. Even if your environment isn't correctly setup.


Some software clearly wants hugepages disabled, so it's not always the slam dunk people seem to be making it out to be.

ie Redis:

https://redis.io/docs/latest/operate/oss_and_stack/managemen...


It’s more of a hostage situation.

Those were transmitted offline so they didn't have authentic NVENC H264 compression artifacts. Never before have you seen it with 260 Mbps ;)

/s


To me, the whole thing sounds like cheating in benchmarks.

Intel built a tool that will only activate for a specific benchmark - but not for real-world software which accomplishes similar things - and then the tool will replace generic bytecode with a (most likely) handcrafted and optimized variant for running this specific benchmark on this specific CPU. That means BOT will only boost the benchmark score, but not help at all with the end-user workflows that the benchmark is trying to emulate. Thereby, Intel's BOT makes the benchmark score misleading, which is why Geekbench is flagging them.


This is what I thought, too. Watch them dig a hole for themselves while you take a vacation. And then when they fail to debug the mess, charge them by the day for your expertise, because it’ll be necessary for untangling that mess.

Fully agree. The low-res makes this unnecessarily hard to navigate. Which is a downside if your core gameplay is to teach players to navigate in a challenging environment.

I guess this implies high quality WebRips for years to come, because blocking Intel Laptops isn’t feasible for any streaming service. And from what I heard, control over SGX would let you extract the WideWine etc. key store without!!! the OS knowing about it. So this is like a secret debug access starting at OS&TPM level.

Widevine L1 has never been used on Windows machines anyways. PlayReady SL3000 works on Windows machines, but that’s broken already anyways.

Use AI! It'll make you a 10x engineer! (cost-wise) /s

I've recently had the displeasure of Opus 4.6 hallucinating an API. It would have been great if that API had existed, but it did not. Still, it then looped until I manually terminated it while trying to make tests pass. In my case, I used up about $12 of usage in 30 minutes. My guess would be mostly through the (pretty verbose) thinking tokens.

But it's not just Anthropic. I had the same issue with Gemini 3.1 Pro.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: