In some of the comment threads around here a few of you shared interesting ideas and patterns, enough that I believe everyone interesting in harness engineering is working on some sort of software dark factory or another.We have OpenAI’s Symphony[1], StrongDM’s Factory[2], Yegge’s GasTown[3], and probably a few others I’ve missed.So I’m curious. What have you been working on? What have learned? What has worked and what has failed? And what do you think comes after?I’ll go first. The first thing I tried that yielded interesting results was, when possible, providing a ground truth or reference for the model to iterate against: screenshots or mockups for UI work, API contracts and unit / integration tests for logic. That’s the Ralph Loop we all know and love. A feedback loop.The second (obvious, I know) was splitting planning and implementation.Reviews by other models and iterative loops came next, with appreciable results. However the implementing agent would often wiggle out by deferrin...
Want to discover more AI signals like this?
Explore Steek