Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters (Paper)

2024-10-06 00:55:47 on Yannic Kilcher




Page generated in - 0.00563 sec