Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Or if you want a large model but don’t need high performance, get a Mac with 128GB UMA.


How many tokens/s would you get in such a setup?


This Reddit thread says an m3 max 128GB gets 23 tokens/sec with deepseek r1 32B, and 4 tokens / sec with 70b: https://www.reddit.com/r/LocalLLaMA/comments/1i69dhz/deepsee...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: