Deep Reinforcement Learning (Ongoing Updates)
Created: 2025-01-31 · Updated: 2025-01-31 · 34 min · 7230 words · Yue Shui
OpenAI o1 Replication Progress: DeepSeek-R1
Created: 2025-01-27 · Updated: 2025-01-27 · 48 min · 10182 words · Yue Shui
Attention Mechanisms in Transformers: Comparing MHA, MQA, and GQA
Created: 2025-01-16 · Updated: 2025-01-16 · 29 min · 6141 words · Yue Shui
Building Domain-Specific LLMs
Created: 2025-01-05 · Updated: 2025-01-05 · 21 min · 4340 words · Yue Shui