Tag: LRM

All the articles with the tag "LRM".

Projects

Build Large Reasoning Model (LRM) from scratch

Build Large Reasoning Model (LRM) from scratch

Build a Large Reasoning Model from scratch and turn non-reasoning LLMs into reasoning LLMs.

PythonPyTorchLRMRLGRPOFine-TuningMLX

PKM Notes

Aha Moment (Deep Seek R1)

DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

DeepSeek-R1 - Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Inference Time Compute Scaling

Reasoning Model Blueprint (SFT + RL)

System 1 Thinking

System 2 Thinking

Test Time Compute

Mike 3.0

Send a message to start the chat!

You can ask the bot anything about me and it will help to find the relevant information!

Try asking: