More

bavell · 2026-04-28T23:10:33 1777417833

What a weird time for our industry. On one hand, small teams have never been able to move faster than right now.

On the other, the economy and market conditions are brutal for the little guys. Incumbent behemoths hoovering up value, talent and financing.

Instead of shaking things up as usual when a major paradigm shift hits, AI has mostly been a centralizing, consolidating force. Not that I was expecting it to be otherwise, but it's certainly dismaying to witness.

Or am I being too pessimistic / glorifying the past?

pocksuppet · 2026-04-28T23:31:01 1777419061

This is not just the tech industry.

It's easier than ever to make your own furniture. IKEA is bigger than ever.

It's easier than ever to publish a video game. Steam is bigger than ever.

It's easier than ever to 3D-print tractor parts. John Deere is bigger than ever.

It's easier than ever to switch to solar power. The petroleum industry is bigger than ever.

One person reverse-engineered Coca Cola, made an exact taste-alike and published the formula. You can make some at home. Coca Cola is bigger than ever.

Something fundamental is wrong with the economy.

AuthAuth · 2026-04-29T04:44:10 1777437850

The hidden cost to competing in these industries is insane. Its so hard to build a physical product that can compete against a giant like IKEA. You need to make some with less r&d, less automation, less infrastructure and you're going to sell less units and all that needs to be price competitive against something that is made on an production line with a team of experienced engineers and sold to millions at fine margins.

bavell · 2026-04-28T22:59:12 1777417152

In a reductive sense, yeah it's a bit silly. But zooming out, I can understand. Sucks to have your hand forced. Sucks to be let down. Sucks to watch something that was great fall from grace.

Thanks for Ghostty, been my daily driver for awhile now. Hope the rest of your day/week goes much better!

bavell · 2026-04-28T12:38:34 1777379914

Didn't the guy flee the country after posting bail? Doesn't exactly scream "innocent".

nielsbot · 2026-04-29T00:55:31 1777424131

I didn’t see that. Is there a source? I saw Minnesota authorities investigated the business and didn’t find evidence of fraud.

I did read there was some amount of fraud in state sponsored care programs but nothing extensive.

bavell · 2026-04-27T14:33:46 1777300426

Perhaps you could generate a few tokens before the entire model is downloaded, but since every token takes a potentially different "path" through an MoE model, you'd still need to wait for the entire download before getting deeper than a handful of tokens... which is not really a UX improvement imo.

zozbot234 · 2026-04-27T14:54:05 1777301645

Even at its worst, it's a minor UX improvement compared to having to download everything prior to getting to the first token. Ultimately we will complete the download, but we can still pick the best priority so that the first handful of tokens goes through.

bavell · 2026-04-26T17:58:44 1777226324

Going on 10 years now for me, tried Helm a bit and yep - all I've really needed was a package.json deploy script with sed to bump the image version.

bavell · 2026-04-26T16:28:47 1777220927

Funny, I've been doing the same thing lately! CC + godot + some game ideas I've had banging around in my head for years but daunting to dive into.

The results so far are... okay, but getting something working to validate the gameplay loop and experiment with different systems is a lot of fun!

Anonyneko · 2026-04-26T17:06:20 1777223180

How well does it work with Godot? Engines like Unity and Godot are very focused on using the editor UI, so I've always wondered if there's any better workflow than generating code snippets. Unless you're going full .NET/GDExtension...

bavell · 2026-04-24T12:42:23 1777034543

> I would also expect to see it taking exponentially longer to process a prompt. I don't believe LLMs work like that.

Try this out using a local LLM. You'll see that as the conversation grows, your prompts take longer to execute. It's not exponential but it's significant. This is in fact how all autoregressive LLMs work.

bavell · 2026-04-24T12:19:23 1777033163

Yesterday I was playing around with Gemma4 26B A4B with a 3 bit quant and sizing it for my 16GB 9070XT:

  Total VRAM: 16GB
  Model: ~12GB
  128k context size: ~3.9GB

At least I'm pretty sure I landed on 128k... might have been 64k. Regardless, you can see the massive weight (ha) of the meager context size (at least compared to frontier models).

bavell · 2026-04-24T12:08:02 1777032482

> As a user, I _expect_ the cost of resuming X hours/days later to be no different to resuming seconds or minutes later.

As an informed user who understands his tools, I of course expect large uncached conversations to massively eat into my token budget, since that's how all of the big LLM providers work. I also understand these providers are businesses trying to make money and they aren't going to hold every conversation in their caches indefinitely.

andrewingram · 2026-04-24T13:23:09 1777036989

I'd hazard a guess that there's a large gulf between proportion of users who know as much as you, and the total number using these tools. The fact that a message can perform wildly differently (in either cost, or behaviour if using one of the mitigations) based on whether I send it at t vs t+1 seems like a major UX issue, especially given t is very likely not exposed in the UI.

bavell · 2026-04-25T13:12:34 1777122754

I definitely agree that it should be shown and obvious in the UI. They do show a warning now when resuming old sessions but still could be better.

bavell · 2026-04-19T13:11:32 1776604292

Haven't had a chance to test 4.7 much but one of my pet peeves with 4.6 is how eager it is to jump into implementation. Though maybe the 4.7 is smarter about this now.