For the best experience on desktop, install the Chrome extension to track your reading on news.ycombinator.com
Hacker Newsnew | past | comments | ask | show | jobs | submit | history | fromregister
Compiling LLMs into a MegaKernel: A path to low-latency inference (zhihaojia.medium.com)
314 points by matt_d 9 months ago | past | 76 comments

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search:

HN For You