Supervised Fine-Tuning (SFT)

Supervised Fine-Tuning (SFT), often called Instruction Tuning, is the process of training a pre-trained Large Language Model on a smaller, high-quality dataset of labeled examples (typically pairs of Instruction \rightarrow Response) to adapted it for a specific task or behavior.

Purpose

Process

  1. Input: A pre-trained Foundational Model (bases model).
  2. Data: A dataset of prompts (inputs) and desired completions (labels).
    • Example: {"prompt": "Summarize this article...", "completion": "The article discusses..."}
  3. Training: The model parameters are updated to minimize the difference between its generated output and the labeled completion.

    Mike 3.0

    Send a message to start the chat!

    You can ask the bot anything about me and it will help to find the relevant information!

    Try asking: