Tag: Python

All the articles with the tag "Python".

Projects

Education

BERT Fine-tuning

Improving a Japanese Text-Generation Model through Fine-Tuning BERT.

PythonPyTorchBERTTransformer

Education

Reinforcement Learning with OpenAI Gym

Implementation of Reinforcement Learning Algorithms with OpenAI Gym.

PythonPyTorchGymRLPPODQN

Personal

Build a GPT-like LLM from scratch

An attempt to build a GPT-style LLM from scratch with PyTorch. Covers the full architecture, pre-training loop, decoding strategies, and loading OpenAI GPT-2 weights.

PythonPyTorchTransformerGPT

DEV WIP

Personal

Build Large Reasoning Model (LRM) from scratch

Build a Large Reasoning Model from scratch and turn non-reasoning LLMs into reasoning LLMs.

PythonPyTorchLRMRLGRPOFine-TuningMLX

POST WIP

Personal

Mike 3.0: RAG Powered LLM Model for Chatbot Backend

RAG backend chatbot service built with FastAPI and pgvector. It features real-time document ingestion and LLM-powered responses.

RAGFastAPIPythonuvPostgreSQLpgvectorPytest+6

Personal

Pong (Reinforcement Learning) with WebSocket Backend

Training PPO agents to master Atari Pong using PyTorch and PettingZoo, served to browser via WebSockets.

RLPPOPyTorchFastAPIuvWebSocketPettingZooGymnasium+5

Personal

RAG vs LoRA: LLM Fine-Tuning Comparison for Mike 3.0

An experimental comparison between RAG and LoRA for building a personal portfolio chatbot.

LoRARAGMLXOllamaGemmaLangflowSupervised Fine-Tuning+4

Work

Rehabilitation Platform Backend

A multi-tenant B2B2C rehabilitation platform built with FastAPI and Python, implementing Clean Architecture.

PythonFastAPISQLAlchemyPydanticClean ArchitectureAlembicStripeTerraform+4

PKM Notes

Deep Learning

Neural Network Backbone

#Activation Function #Backpropagation #Cross-Entropy Loss #Gradient Descent +8

Tag: Python

Projects

BERT Fine-tuning

Reinforcement Learning with OpenAI Gym

Build a GPT-like LLM from scratch

Build Large Reasoning Model (LRM) from scratch

Mike 3.0: RAG Powered LLM Model for Chatbot Backend

Pong (Reinforcement Learning) with WebSocket Backend

RAG vs LoRA: LLM Fine-Tuning Comparison for Mike 3.0

Rehabilitation Platform Backend

PKM Notes

Neural Network Backbone

Chat with Mike 3.0