Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This is most likely an inference serving problem in terms of capacity and latency given that Opus X and the latest GPT models available in the API have always responded quickly and slowly, respectively




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: