Thinking Preference OptimizationWang Yang,Hongye Jin,Jingfeng Yang, Vipin Chaudhary,Xiaotian HanCoRR(2025)Cited 0|Views1AI Read ScienceMust-Reading TreeExampleGenerate MRT to find the research sequence of this paperChat PaperSummary is being generated by the instructions you defined