逐行对照 MiniMind 源码精读、并延伸到大模型技术体系的中文学习笔记 —— 预训练 / SFT / DPO / PPO / GRPO、训练机制、MiniMind2→3 版本对照、真实实验证据。
Related
veryCoolTimo/imagegen-skills: Turn a one-line idea into a production-grade, copy-paste-ready AI image prompt for posters, landing and UI mockups, ads, game art, and logos. A Claude Code skill optimized for gpt-image-2. MIT.
Turn a one-line idea into a production-grade, copy-paste-ready AI image prompt for posters, landing and UI mockups, ads, game art, and logos. A Claude Code skill optimized for gpt-...
cheerupzhu/PPM_VLA: This work presents an efficient physics-based Vision-Language-Action (VLA) approach that integrates Vision-Language Models (VLMs) with diffusion models to generate trajectory predictions with enhanced physical realism.
This work presents an efficient physics-based Vision-Language-Action (VLA) approach that integrates Vision-Language Models (VLMs) with diffusion models to generate trajectory predi...
jongwonkim987/League_of_Legends_Skillshot_Dodger: AI-powered skillshot dodger, last-hit assistant, and map hack for LoL. Dodge skillshots perfectly, never miss a CS, and see invisible enemies.
AI-powered skillshot dodger, last-hit assistant, and map hack for LoL. Dodge skillshots perfectly, never miss a CS, and see invisible enemies.