Test Time Compute

Test Time Compute (also known as Inference Time Compute) refers to the amount of computational resources used by a model during the inference stage (when it is generating an answer).

Key Concept

Example

In the second case, the model uses significantly more resources (7-8 times more) to generate the answer, which corresponds to “thinking more” to arrive at a better result.

    Mike 3.0

    Send a message to start the chat!

    You can ask the bot anything about me and it will help to find the relevant information!

    Try asking: