Was the on-device local LLM stack that you tried llama.cpp or something like MLC... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		pickettd 5 months ago \| parent \| context \| favorite \| on: Show HN: Built a conversational language learning ... Was the on-device local LLM stack that you tried llama.cpp or something like MLC? I've seen better performance with MLC than llama.cpp in the past - but it has been probably at a least a year since I tested iphones and androids for local inference

benjah 5 months ago [–]

I looked into using https://github.com/mybigday/llama.rn. Ultimately, it was too slow to be conversational. The demands of the rendering the WebGL would likely not help.

It was a while ago. If I was to do it over again I might try https://github.com/tirthajyoti-ghosh/expo-llm-mediapipe. Maybe newer models will help.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact