AutoTTS: Automating Inference Strategies, Cutting LLM Token Costs by 69.5%
The new AutoTTS framework enables large language models to automatically search for optimal inference strategies, cutting token consumption by up to 69.5% while enhancing problem-solving performance.
Sources venturebeat.com