XIAOMAI NEWS
Shazeer et al (2024): you are overpaying for inference >13x — Mews