Hacker Newsnew | past | comments | ask | show | jobs | submit | numbers's commentslogin

I've stopped trusting these "trust me bro" benchmarks and just started going to LM Arena and looking for the actual benchmark comparisons.

https://arena.ai/leaderboard/code


I doubt this is representative of real world usage. There is a difference between a few turns on a web chatbot, vs many-turn cli usage on a real project.

This is not any better of a benchmark

it seems like Slate might be trying that but there's no real cars from them yet so they're just renders at this point. but yes, same concept but printers is my wish.

They have plenty of running/driving mules out there already:

https://www.youtube.com/watch?v=L6_9_HHLOSY

(Not for sale yet though.)



Yes but not a pickup please

Pickups are a fine place to start. If they’re successful they’ll add other kinds of cars over time. Building a whole new car company is extremely risky. Picking the first model correctly is extremely important. I hope they got it right. My gut says plenty of reasons to think they did.

pickup culture sucking the life out of our car industry. give me real cars

But they have merch! Hats, apparel!

Why are you mad that they're trying to build brand recognition?

I get there's been plenty of vaporware cars in the past but by all signs Slate is making real progress towards delivering actual vehicles.


does anyone have recommendations on replacing CC with something else for around $20-30 / month?

I left cursor and went back to VS Code b/c the editing experience is basically the same and cursor was adding more and more agentic features which don't appeal to me. I'm a happy Claude Code user and having my code separate from the planning/brainstorming part of the task makes implementing its own step with me driving/writing the code.


but you'd wait only long enough for a version that's good enough, not forever.


because they're brand fonts, none of them feel great to write more than a headline with


Is your phone connected to the router through a cable or wirelessly?


They can do both - cable or bluetooth. Don't think wifi


I want something like this for Plex, where I can just turn it on and have some of my favorite shows play random episodes, and I wish Plex made that easy to do.


I was using Wisper Flow and had a pretty bad experience with their support related to billing and so I started building my own version of a speech to text app, it's very doable with Parakeet and Whisper models available now. I've got the app working on mac and it's been so much easier to make progress on it with AI available now.

I'm not sure I'll be putting it out there because it feels like there's already 100s of these apps out there so I don't feel strongly about it.


interestingly, Claude has been doing this for me a lot but most often just saying this like "Looks like your coworker was misunderstanding this feature..." not really shifting blame but more like pointing out things


Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: