Hacker Newsnew | past | comments | ask | show | jobs | submit | aqme28's commentslogin

Modeling is hard. Some did, some didn’t. Generally we have historically underestimated climate change.

I tried to write a different "convince an AI" game about a year ago, but it was hard to work with, hard to figure out a business-model for, and more importantly-- it just wasn't very fun to play. Maybe there's a different scenario than the one I chose.

I tried prototyping a detective game also but like you said it ends up being a wrapper for a LLM and it’s a bit chaotic, not much more fun than just talking to the LLM in a web interface or whatever

You're misunderstanding. What this paper did-- Those three physicians set a ground truth to compare the AI response to.

What people in this thread are asking for-- Evaluate a set of doctors on those cases as well, and compare doctor vs AI accuracy.


If we're going to do this at all, it should be on the device, not the website/app. Parents flag their child's device or browser as under 18, and websites/apps follow suit. Parents get the control they're looking for, while service providers don't have to verify or store IDs. I guess it's just more difficult to pressure big dogs like google/apple/mozilla for this than pornhub and discord.

I’ve wondered if a age verification gig worker app could ever be viable: have people you can meet in person to prove your age without ever uploading any PII anywhere. Then issue a private key proving you are who you say you are.

Yeah vchip style with ratings, with a setting to hide unrated sites. A simple header. Done. Have all the browsers/os support it - easy peasy.

This sounds pretty reasonable to me. What am I missing?

It doesn’t come with a ton of PII you can sell to data brokers.

Ask your claude to make a cron to wake itself up. Done.

How do you enforce this? You have a system where the agent can email people, but cannot email "too many people" without a password?

It's not a perfect security model. Between the friction and all caps instructions the model sees, it's a balance between risk and simplicity, or maybe risk and sanity. There's ways I can imagine the concept can be hardened, e.g. with a server layer in between that checks for things like dangerous actions or enforces rate limiting

If all you're doing is telling an LLM to do something in all caps and hoping it follows your instructions then it's not a "security model" at all. What a bizarre thing to rely on. It's like people have literally forgotten how to program.

These people often never knew in the first place.

Thank you for saying this. I read this and was like: wtf?

Love agents, but the security risk is insane.


“AI changes everything!”

If I were the CEO of a place like Plaid, I'd be working night and day expanding my offerings to include a safe, policy-driven API layer between the client and financial services.

What if instead of allowing the agent to act directly, it writes a simple high-level recipe or script that you can accept (and run) or reject? It should be very high level and declarative, but with the ability to drill down on each of the steps to see what's going on under the covers?

Platforms could start to issue API tokens scoped for agents. They can read emails, write and modify drafts, but only with a full API token meant for humans it is possible to send out drafts. Or with confirmation via 2FA. Might be a sensible compromise.

A new The Terror? The one that came out some years ago was incredible, and very under-discussed I think.


The first one, the one based on the book, was great and did fly a good deal under the radar. But definitely one of those ones with a core fanbase that evangelized for it and good critical notices. Elsewhere in this discussion Jared Harris's role in Foundation has been mentioned; he's a major, consistent, and excellent fixture in The Terror.

Since they used the book's story already, they made a turn for the series to be an anthology of loosely thematically-similar stories (think American Horror Story). The basic setting of season 2 is Japanese internment during World War II in America, and it's from different writers than the first, and of course isn't adapting the novel anymore. It was much less popular both in terms of viewers and critics.

I'm a little surprised they think the brand still carries enough power to put another original story in there under its name for a season 3. It's also a bit of a double-edged sword: you do get name recognition and some built-in initial audience, but you're also taking on expectations and baggage from the original. This is a factor in season 2's tepid reception, and there have been other similar attempts to slide something unrelated in under an existing banner that backfired: True Detective Night Country comes to mind.


The saddest part to me is that their status update page and twitter are both out of date. I get a full 500 on github.com and yet all I see on their status page is an "incident with pull requests" and "copilot policy propagation delays."



Yes, the title is exaggerated. But I think a lot of you are underestimating the societal impact of roughly half a billion climate refugees. That kind of destabilization could easily lead to societal collapse, world war, etc...

The Syrian refugee crisis meant something like a million people fleeing into Europe and it caused massive political upheavals.


> But I think a lot of you are underestimating the societal impact of roughly half a billion climate refugees.

If North America and Europe enters an ice age, the preferred term would be "climate-expatriates"


And the company I worked for hired a full devops team to save us like 5 grand per month on Heroku, only to end up with a much worse developer experience.


This problem one doesn't have, if one pays attention to devops from the start, maybe keeping 1 or 2 capable devops people, who keep things lean. Problem is of course finding the capable ones with the right mindset to keep things as simple and lean as possible.

The result of suddenly needing to hire devops should be to get a convenient setup, but then do you really still need the whole devops team? And if you don't, then hiring them for limited time might come at a cost (hiring freelancers or consultants).


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: