LLM from scratch, part 28 – training a base model from scratch on an RTX 3090

📅 2025-12-02    ⚓ Hacker News    🌐 Source    🖼️ Load Image