2025  8

March  2

Large Language Model Agents

2025-03-27 · 32 min · 6788 words · Yue Shui

Parallelism and Memory Optimization Techniques for Training Large Models

2025-03-01 · 60 min · 12755 words · Yue Shui

February  2

LLMs Alignment: DPO

2025-02-08 · 13 min · 2577 words · Yue Shui

Normalization in Deep Learning

2025-02-01 · 13 min · 2576 words · Yue Shui

January  4

Deep Reinforcement Learning (Ongoing Updates)

2025-01-31 · 34 min · 7096 words · Yue Shui

OpenAI o1 Replication Progress: DeepSeek-R1

2025-01-27 · 48 min · 10156 words · Yue Shui

Attention Mechanisms in Transformers: Comparing MHA, MQA, and GQA

2025-01-16 · 29 min · 6139 words · Yue Shui

Building Domain-Specific LLMs

2025-01-05 · 21 min · 4340 words · Yue Shui

2024  1

December  1

Building a Home Deep Learning Rig with Dual RTX 4090 GPUs

2024-12-21 · 10 min · 1988 words · Yue Shui

2021  1

April  1

Stock Price Prediction and Quantitative Strategy Based on Deep Learning

2021-04-21 · 65 min · 13702 words · Yue Shui