从全能医院到智慧分诊:用医院作比喻理解 DeepSeek 的模型进化
用医院的分诊系统作比喻,通俗解释DeepSeek v2和v3的混合专家架构原理,包括专家分工、路由机制、负载均衡挑战和跨节点通信问题。
Computing Life · An engineering notebook
Long-form notes on agentic systems, engineering judgment, astrophotography, hardware, coffee, and the tools that make a life easier to inspect and improve.
用医院的分诊系统作比喻,通俗解释DeepSeek v2和v3的混合专家架构原理,包括专家分工、路由机制、负载均衡挑战和跨节点通信问题。
Explains DeepSeek v2 and v3's Mixture of Experts architecture using a hospital analogy, covering expert specialization, routing, load balancing challenges, and cross-node communication.
以一次debug经历说明三个AI管理技巧:克制抢键盘的冲动、提供可视化上下文而非模糊抱怨、授之以渔而非授之以鱼——即从IC到Manager的思维蜕变。
Uses a debugging session to illustrate three key AI management skills: resisting the urge to take over, providing visual context instead of vague complaints, and teaching methodology rather than giving answers—the shift from IC to manager mindset.
通过分离Planner和Executor、强制文档沟通、用o1当Planner三项改造,解决Cursor鬼打墙问题,实现质量显著提升的多智能体系统。
How separating Planner and Executor roles, enforcing document-based communication, and using o1 as Planner transformed Cursor from a simple assistant into a multi-agent system.
AI课程运营一年反思:打破学员对AI的刻板印象、手把手辅导带来可复制解决方案、长期更新机制保持技术前沿,以及认知升级到应用落地的完整学习路径。
以ZWO产品策略为例详解漏斗分析思维:如何识别用户流失瓶颈、降低天文摄影门槛,以及分布式设备在科研中的潜在价值。
Learn funnel analysis through ZWO's product strategy: how ASI Air and Seestar break down barriers in astrophotography, and how distributed devices could transform scientific research.
Agentic AI为什么常常只能做到七八成:自我迭代的反馈环被打破,缺乏感知能力和主观评价标准,以及如何补足这些能力缺口。