Nice! I'm interested in your cubecl-wgpu patches. I've been struggling to get lo... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		scronkfinkle 24 days ago \| parent \| context \| favorite \| on: Rust implementation of Mistral's Voxtral Mini 4B R... Nice! I'm interested in your cubecl-wgpu patches. I've been struggling to get lower than FP32 safetensor models working on burn, did you write the patches to cubecl-wgpu to get around this restriction, to add support for GGUF files, or both? I've been working on something similar, but for whisper and as a library for other projects: https://github.com/Scronkfinkle/quiet-crab

adefa 22 days ago [–]

The cubecl-wgpu were only needed to reduce the number of kernel workgroups, otherwise I was getting errors in WASM.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact