Rich WashburnSep 235 min readImproving Model Inference Beyond GPUs: AI’s Next FrontierWhen we think of AI, especially large language models (LLMs) like ChatGPT, most of us picture massive networks of GPUs powering the...