Hosted on MSN1mon
Microsoft says 'rStar-Math' demonstrates how small language models (SLMs) can rival or even surpass the math reasoning capability of OpenAI o1 by +4.5%According to the research paper published on arXiv.org: "rStar-Math achieves this by exercising ... which the policy SLM and PPM are built from scratch and iteratively evolved to improve reasoning ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results