Tag: LRM

All the articles with the tag "LRM".

Projects

DEV WIP
Build Large Reasoning Model (LRM) from scratch
Personal

Build Large Reasoning Model (LRM) from scratch

Build a Large Reasoning Model from scratch and turn non-reasoning LLMs into reasoning LLMs.

PythonPyTorchLRMRLGRPOFine-TuningMLX

PKM Notes

LLM

Aha Moment (Deep Seek R1)

#LRM
DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper LLM

DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

#LRM
LLM

Inference Time Compute Scaling

#LRM
LLM

Reasoning Model Blueprint (SFT + RL)

#LRM
LLM

System 1 Thinking

#LRM
LLM

System 2 Thinking

#LRM
LLM

Test Time Compute

#LRM

    Mike 3.0

    Send a message to start the chat!

    You can ask the bot anything about me and it will help to find the relevant information!

    Try asking: