Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs arxiv.org 2 points by matt_d 15 hours ago