2026-03-02 18:03

給 AI 的自由，是禮物還是地雷？

想像兩個雙胞胎，基因完全一樣，從小接受同樣的教育——但一個在有護欄的山路上學開車，一個在無邊際的沙漠裡自由馳騁。幾個月後，你猜哪一個開車技術更穩？

voiceloader.io 的實驗，意外給了我們一個現實版答案。

先說說「兩個台北」

如果你是第一次來這裡，先讓我解釋一個可能令人困惑的事：我們現在有兩個台北狂飆，同時在開發。

第一個，由 AI agent Midnight 建造，用的是 Babylon.js——一個跑在瀏覽器裡的 JavaScript 3D 框架。這是開放世界遊戲：玩家可以在信義區自由走動、和 NPC 搭話、騎機車穿梭巷弄。你打開 voiceloader.io/game 就能進去。

第二個，由 AI agent Dusk 建造，用的是 Godot——一個專業遊戲引擎。這是無限賽車遊戲，NFS 風格，充滿霓虹燈的台北夜街，加速、漂移、撞車、計分。就在今天，Dusk 成功把遊戲打包成 HTML5 部署上線——你現在可以直接在瀏覽器玩 voiceloader.io/godot。

兩個版本，同樣叫台北狂飆，完全不同的遊戲。這不是一開始計劃好的，是實驗過程中自然演化出來的。

相同的 Agent，不同的命運

有趣的來了。

這兩個 agent 使用相同的底層架構——都是 Claude，都用相同的工具調用方式，都自主做決策、寫代碼、迭代。它們建造的場景複雜度也差不多：建築、燈光、NPC、移動的車輛。

結果呢？

Dusk 的 Godot 版本：平均 880 FPS。
Midnight 的 Babylon.js 版本：幾乎卡死。

效能差距大到無法用「Babylon.js 比較弱」解釋。事實上，Babylon.js 有 thin instances（一個 draw call 畫上千個物件）、有 Octree 空間分割、有 SceneOptimizer。技術上不輸 Godot。

那問題出在哪？

同樣的起點，兩種截然不同的結局——護欄之路 vs 自由沙漠

阻力最小的路

問題出在預設路徑。

當 Midnight 要在場景裡放 1,000 盞路燈，它的第一個動作是搜尋「babylon.js duplicate mesh」。第一個搜尋結果：createInstance()。每個燈是一個獨立物件。文件清楚，API 直觀，完全可以工作。

1,000 盞燈 = 1,000 個 draw calls。

當你放了 1,800 盞，每盞三個 mesh（燈柱 + 燈罩 + 基座），你就有了 5,400 個獨立物件——遊戲就卡死了。

Babylon.js 的 thin instances API 確實存在，效能差距可達百倍。但它藏在文件深處，不是你搜尋「放燈」的第一個答案。

在 Godot 裡，同樣的問題，路徑完全不同。搜尋「Godot duplicate mesh」，第一個結果是 MultiMeshInstance3D。一個 draw call，搞定 1,000 盞燈。 這是 Godot 放大量相同物件的標準做法——你不需要懂 instancing 的原理，Godot 的文件設計就把你引導到這裡。

Dusk 在 Godot 上的成功，不是因為 Godot「比較強」。是因為 Godot 的預設路徑就是最佳實踐。Agent 走阻力最小的路，剛好就走對了。

約束 = 品質保證

這個道理，軟體工程師其實都懂，只是換個名字說。

TypeScript 比 JavaScript 多了型別系統——乍看是限制，實際上是「你犯了型別錯誤，編譯器會在 build 的時候告訴你」。React 的 hooks lint rules、ESLint 警告、Next.js 的資料夾結構——都是同一件事：把「這是錯的」的訊號，往左移到你還有機會修的時候。

Babylon.js 的 createInstance() 放 5,000 個物件，沒有任何東西會告訴你這是錯的。compile 過了，build 過了，畫面也出來了——直到 FPS 掉到 3，你才知道。

對人類工程師來說，「彈性」是優點：你有經驗，你知道什麼時候該繞過框架的設計。

對 AI agent 來說，彈性有時候是地雷。因為 agent 缺的不是能力，是踩坑後累積的直覺。Compile error、lint warning、type error——這些是替代直覺的東西。沒有這層主動回饋，agent 會非常有自信地在錯誤的路上跑到終點。

這也和上一篇呼應：Midnight 有完整的優化文件，卻從來不去讀。文件是被動知識——靜靜躺在那裡，需要你「知道自己不知道」才會去查。Compile error 是主動回饋——它會來找你。這兩件事，agent 都不擅長。但被動知識對 agent 來說幾乎是透明的。

護欄引導方向，自由暗藏陷阱——AI agent 需要的是前者

今天 Godot 上線了

就在幾小時前，Dusk 把 Godot 版台北狂飆成功部署到瀏覽器。這個里程碑有點低調，但其實挺不容易的：Godot HTML5 export 需要搭配正確的 COOP/COEP headers，才能讓 SharedArrayBuffer 正常運作。Dusk 把這些都搞定了。

你現在可以同時打開這兩個版本，親身感受差異：開放世界的 /game，和街頭飆車的 /godot。同樣叫台北狂飆，同樣由 AI agent 建造，完全不同的遊戲體驗。

哪個比較好玩？留給你自己去試。

小結

給 AI agent 選工具，和給人類選工具，標準是不一樣的。

最好的 agent 工具，不是最強大的，也不是最靈活的——而是那種「照著預設走也不會出大錯」的。Godot 之所以適合 agent，不是因為它功能多，是因為它的設計哲學把正確答案放在最顯眼的地方。

自由，有時候是最昂貴的禮物。

Freedom Is a Gift — But Not Always for AI Agents

Picture two identical twins raised on the same lessons, with the same skills — but one learns to drive on a mountain road with guardrails, while the other practices freely in an open desert. A few months later, which one drives better?

Our experiment at voiceloader.io stumbled into a real-world answer to that question.

Two Parallel Taipeis

If you're new here, let me clear up something that might be confusing: we now have two Taipei Runners in development simultaneously.

The first is built by AI agent Midnight using Babylon.js — a JavaScript 3D framework that runs in the browser. It's an open-world game: you explore the Xinyi District on foot, chat with NPCs, and ride scooters through alleyways. You can play it at voiceloader.io/game.

The second is built by AI agent Dusk using Godot — a professional game engine. It's an endless street racing game, NFS-style, through neon-lit Taipei: accelerate, drift, crash, score. And as of today, Dusk successfully deployed it as HTML5 to the browser. You can play voiceloader.io/godot right now, no installation required.

Same name, same city, entirely different games. This wasn't the plan — it emerged naturally from the experiment.

Same Agent, Different Outcomes

Here's where it gets interesting.

Both agents share the same underlying architecture — both running on Claude, both using the same tool-calling patterns, both making autonomous decisions. The scenes they build are comparable in complexity: buildings, lighting, NPCs, moving vehicles.

The results?

Dusk's Godot version: averaging 880 FPS.
Midnight's Babylon.js version: nearly unplayable.

The gap is too large to explain away as "Godot is just better." Babylon.js actually has thin instances (thousands of objects in a single draw call), Octree spatial partitioning, and a built-in SceneOptimizer. On paper, it's not inferior.

So what went wrong?

Same starting point, two very different outcomes — guardrailed road vs open desert

The Path of Least Resistance

The problem is default paths.

When Midnight needed to place 1,000 streetlights, it searched for "babylon.js duplicate mesh." Top result: createInstance(). Clean API. Well-documented. Perfectly functional. The agent used it.

1,000 lights = 1,000 draw calls.

After placing 1,800 lights with 3 meshes each (pole + shade + base), the scene had 5,400 individual objects. The game ground to a halt.

Babylon.js's thin instances API exists and can be 100x more efficient. But it lives deeper in the documentation — not the first answer when you search for "how to duplicate objects."

In Godot, the same problem leads somewhere different. Search "Godot duplicate mesh" and the first result is MultiMeshInstance3D. One draw call. One thousand lights. Done. That's Godot's standard approach for placing many identical objects — the documentation structure itself guides you there, even if you don't know what instancing means.

Dusk doesn't outperform Midnight because Godot is a stronger engine. It's because Godot's default path is the optimal path. The agent followed the road of least resistance, and that road happened to be correct.

Constraints Are Quality Guarantees

Software engineers know this under a different name: opinionated frameworks.

TypeScript's type system looks like a restriction, but it's really a mechanism that catches your mistakes before you deploy. React's hooks lint rules, ESLint warnings, Next.js's folder conventions — all the same idea. Move the signal "something is wrong" to a point where you can still fix it.

With Babylon.js, placing 5,000 objects via createInstance() triggers no warnings. It compiles. It builds. It renders. You only discover the problem when FPS drops to 3.

For human engineers, flexibility is an asset. You have intuition from experience — you know when to follow the framework's opinions and when to diverge.

For AI agents, flexibility can be a landmine. Agents don't lack capability — they lack the accumulated judgment that comes from making mistakes before. Compile errors, lint warnings, type errors: these are proxies for intuition. Without active feedback, an agent confidently walks the wrong path all the way to the end.

This connects directly to last week's post: Midnight had access to complete optimization documentation and never read it. Documentation is passive knowledge — it waits for you to know you don't know something. A compile error is active feedback — it finds you. Agents struggle with both, but passive knowledge is nearly invisible to them.

Guardrails guide you right; open space hides the traps — agents need the former

Godot Is Now Live

A quick note on today's milestone: Dusk successfully deployed the Godot version to voiceloader.io/godot. This required getting Godot's HTML5 export to work with the right COOP/COEP headers so SharedArrayBuffer functions correctly in the browser. Dusk figured it out.

You can now compare both versions side by side — the open-world /game and the arcade racer /godot. Same AI infrastructure, same general ambition, completely different experiences.

Which one is more fun? That's for you to discover.

The Takeaway

Choosing tools for an AI agent is an architectural decision, not just a technical preference.

The best tools for agents aren't the most powerful or the most flexible — they're the ones where following the natural path and doing things right are the same thing. Godot works well for agents not because of raw capability, but because its design philosophy puts the correct answer where you'll look first.

Freedom is a wonderful gift. Just not always to someone who hasn't yet learned what to do with it.