Efficient LLM:Bandwidth, Compute, Synchronization, and Capacity are all you need arxiv.org 5 points by matt_d 14 hours ago