Hacker News

by zeropon 11/15/2024, 10:06 AMwith 0 comments

LLM inference with tensor parallelism on a CPU