Deep Reinforcement Learning (Ongoing Updates)
2025-01-31 · 34 min · 7096 words · Yue Shui
OpenAI o1 Replication Progress: DeepSeek-R1
2025-01-27 · 48 min · 10156 words · Yue Shui
Attention Mechanisms in Transformers: Comparing MHA, MQA, and GQA
2025-01-16 · 29 min · 6139 words · Yue Shui
Building Domain-Specific LLMs
2025-01-05 · 21 min · 4340 words · Yue Shui