Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Hoping a 30-A3B runs circles around a 117-A5.1B is a bit hopeful thinking, especially when you’re testing embedded knowledge. From the numbers, I think this model excels at agent calls compared to GPT-20B. The rest are about the same in terms of performance


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: