AC
by Di Zhang, Weida Wang, Junxian Li, Xunzhi Wang, Jiatong Li, Jianbo Wu, Jingdi Lei, Haonan He, Peng Ye, Shufei Zhang, Wanli Ouyang, Yuqiang Li, Dongzhan Zhou • Published June 3, 2025 at 04:00 AM
Research

Control-R: Towards controllable test-time scaling

🔬 Research 🤖 AI-Enhanced

📖 Article Preview

🤖 AI Summary

The paper introduces Reasoning Control Fields (RCF), a novel test-time method that injects structured control signals to guide and adjust reasoning effort in Large Reasoning Models, addressing issues of underthinking and overthinking in long chain-of-thought reasoning. It also presents the Control-R-4K dataset with annotated reasoning processes and proposes a Conditional Distillation Finetuning (CDF) approach, achieving state-of-the-art results on benchmarks like AIME2024 and MATH500 with controllable reasoning capabilities

Read the Complete Article

Get the full story with in-depth analysis, expert insights, and comprehensive coverage from the original source.

Read Full Article
🔒 Secure Link
🌍 Original Source
📊 Verified Content
Fast Loading

Stay Informed

Get the latest AI insights and breakthroughs delivered to your inbox weekly.

Follow Our Updates

Join the conversation and stay connected with our AI community.

We respect your privacy. Unsubscribe at any time. Privacy Policy