Self-Explore to Avoid the Pit: Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards
CoRR(2024)
Key words
Language Modeling,Topic Modeling,Pretrained Models,Syntax-based Translation Models,Sequence-to-Sequence Learning
AI Read Science
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined