The researchers automated the design of the LLM reasoning strategy and reduced token usage by 69.5%
Test time scaling (TTS) has emerged as a proven method to improve the performance of large language models in real-world applications by giving them additional computation cycles during inference. However, TTS strategies have historically been hand-crafted, relying heavily on human…




