Computing Life - Agentic AI

11

Jan 25, 2026Computing

从过程确定性到结果确定性：AI 时代的另一种安全感

用Claude Code替代API调用做翻译任务：利用agentic loop实现自我纠错，用evaluation-first定义验收标准，从过程确定性转向结果确定性获得新的安全感。

Agentic AI

EN · 中

12

Jan 25, 2026Computing

From Process Certainty to Outcome Certainty: A Different Kind of Confidence in the Age of AI

Why handing translation to Claude Code works better than calling APIs directly - leveraging the agentic loop, evaluation-first mindset, and the ecosystem's runtime layer to achieve outcome certainty over process certainty.

Agentic AI

EN · 中

13

Oct 17, 2025Computing

怎么让AI不偷懒：为Codex构建系统性的Wide Research能力

AI"偷懒"的本质是LLM输出长度限制导致的注意力分散。Wide Research通过多轻量模型并行处理子任务、主LLM汇总的方式解决，分享为Codex构建该能力的设计思路。

AI Agentic AI

EN · 中

14

Oct 17, 2025Computing

How to Stop AI from Slacking Off: Building Systematic Wide Research Capabilities for Codex

Why AI slacks off on large tasks: LLM output length limitations cause attention drift. Wide Research solves this by parallelizing with lightweight models, then aggregating results with a primary LLM.

AI Agentic AI

EN · 中

15

Oct 07, 2025Computing

为什么OpenAI Apps SDK对MCP的支持反而是MCP的危机

分析OpenAI Apps SDK通过_meta域绕过context window的做法如何违背MCP设计哲学，以及协议分裂成不同dialect的潜在危机。

Agentic AI

EN · 中

16

Oct 07, 2025Computing

Why OpenAI's Apps SDK Signals a Crisis for MCP

Analyzing how OpenAI's Apps SDK extension with _meta field violates MCP's design philosophy, creating dialects that may fragment the standard like SQL or CSS.

Agentic AI

EN · 中

17

Jul 12, 2025Computing

Kimi K2：超越聊天框的深度评测

在真实编程和调研任务中评测Kimi K2的Agentic能力：执行韧性出色，适合作为信息采集前端；但工具调用稳定性和生态适配仍有提升空间。

Agentic AI

EN · 中

18

Jul 12, 2025Computing

Kimi K2: An In-Depth Review Beyond the Chatbox

Testing Kimi K2 as an agentic model in real coding and research tasks: excellent execution resilience makes it ideal for information gathering, but tool integration friction needs improvement for production use.

Agentic AI

EN · 中

19

May 08, 2025Computing

上一篇文章写崩了，但不是AI的锅

AI帮忙写的文章空中楼阁，不是AI写得不好，是我没喂够context。作为Manager要审核输出，作为Enabler要提供长期积累。判断界限才是协作的开始。

Agentic AI

EN · 中

20

May 08, 2025Computing

My Last Article Was a Flop, But It Wasn't AI's Fault

An AI-assisted article failed not because AI wrote poorly, but because I didn't feed it enough context. As Manager, verify output; as Enabler, provide your accumulated materials. Defining boundaries is where collaboration begins.

Agentic AI

EN · 中

Agentic AI

Latest

54 posts · since 2012

从过程确定性到结果确定性：AI 时代的另一种安全感

From Process Certainty to Outcome Certainty: A Different Kind of Confidence in the Age of AI

怎么让AI不偷懒：为Codex构建系统性的Wide Research能力

How to Stop AI from Slacking Off: Building Systematic Wide Research Capabilities for Codex

为什么OpenAI Apps SDK对MCP的支持反而是MCP的危机

Why OpenAI's Apps SDK Signals a Crisis for MCP

Kimi K2：超越聊天框的深度评测

Kimi K2: An In-Depth Review Beyond the Chatbox

上一篇文章写崩了，但不是AI的锅

My Last Article Was a Flop, But It Wasn't AI's Fault