HNNewShowAskJobs
Built with Tanstack Start
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking(arxiv.org)
39 points by roboboffin 6 months ago | 7 comments
  • s-macke6 months ago

    > Notably, no self-reflection training data or prompt was included, suggesting that advanced System 2 reasoning can foster intrinsic self-reflection.

    They suggest, that self-reflection is an emergent phenomena of reasoning. Impressive. Can't wait to see the code.

  • throwaway815236 months ago

    Abstract is impressive. I'm surprised this post hasn't gotten more attention.

    • roboboffin6 months ago |parent

      Yeah, that's what I thought.

  • helltone6 months ago

    Off topic but how is MCTS usually implemented efficiently? It has a branching structure that doesn't seem parallelizable (GPU).

  • fabmilo6 months ago

    I was just about to submit this link and redirected me to this page. I am shocked that it received only four comments. If you are working in the LLMs/Agent space ( you are, right?) and you don't understand the significance of this paper, you are set for failure.

  • dantodor6 months ago

    The repo gives 404?

    • funcDropShadow6 months ago |parent

      The abstract says the code will be available.