XIAOMAI NEWS
FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI — Mews