📊 Full opportunity report: The Model Is Only 10%: The Real Lesson of the New SDLC on ThorstenMeyerAI.com — validation score, market gap, and execution plan.
TL;DR
A recent Google whitepaper emphasizes that in AI-assisted coding, the model itself is only 10% of the system. The focus should be on harness design and context engineering, which have a much larger impact on performance and costs.
A new Google whitepaper published in early 2026 states that the model used in AI coding agents accounts for only about 10% of the overall system behavior. The primary lesson is that harness design and context engineering are the real determinants of performance, cost, and reliability in AI-assisted development, not the size or sophistication of the model itself.
The whitepaper, authored by Addy Osmani, Shubham Saboo, and Sokratis Kartakis, argues that the prevailing focus on acquiring the latest AI models is misplaced. Instead, the paper emphasizes that 90% of the behavior of an AI agent depends on the harness—the prompts, rules, tools, and observability layers surrounding the model. This is supported by experiments showing that a single team improved their agent’s performance by only changing the harness, with the model remaining constant.
The authors introduce the concept of agentic engineering, where AI is embedded within a structured framework of verification, testing, and guardrails, contrasting with vibe coding, which relies on minimal prompts and quick fixes. They also highlight that costs associated with AI development are driven more by how the harness is built and maintained than by the model’s complexity, with ad-hoc prompting becoming more expensive over time.
The model is only 10%
A Google whitepaper argues software’s biggest shift is from writing code to expressing intent. Its sharpest claim: the model you obsess over is the smallest part of the system — the scaffolding around it does the real work.
The clearest map yet of how serious AI development works — and mostly tool-agnostic. But it’s a Google funnel: the concepts are neutral, the on-ramps point to Gemini, Jules & the ADK. If the harness is 90% and it’s yours, your moat and your costs both live there — so own your scaffolding, route across models, and remember: AI amplifies whatever engineering culture it lands in.
Why Harness and Context Engineering Are Game Changers
This shift in focus matters because it redefines where organizations should invest resources for AI development. Instead of chasing newer, larger models, companies should prioritize building robust harnesses and context management systems. This approach can reduce costs, improve reliability, and give organizations a durable competitive advantage, as their custom configurations and frameworks are less likely to be overtaken by model upgrades.
Furthermore, understanding that verification and judgment are the new craft in AI development underscores the importance of human oversight and structured testing, which are critical for deploying AI safely and effectively at scale.
AI development harness design tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
The Evolution of AI Development Practices
Since early 2026, the AI development landscape has been shifting from a focus on model size and raw performance towards system design and configuration. The whitepaper builds on earlier observations that AI adoption is widespread, with 85% of developers using AI coding agents regularly and 41% generating code primarily with AI. Prior to this, emphasis was placed on acquiring the latest models; now, the emphasis is on how those models are integrated and controlled.
This development aligns with broader trends in software engineering, where the emphasis on verification, testing, and structured workflows has increased, especially in safety-critical applications.
“The model is only 10% of what determines behavior; the harness is 90%. Focus on configuration, tools, and context.”
— Addy Osmani
AI prompt engineering tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
What Aspects of Harness Design Remain Unclear
It is not yet clear how different industries will adopt these principles at scale or how quickly organizations will shift their investment from models to harnesses. The long-term impact on AI development costs and safety protocols is still being evaluated, and practical implementation guidance is evolving.AI observability and testing software
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Next Steps for AI Development and Adoption
Organizations are expected to begin prioritizing harness development and context engineering in their AI workflows. Future research and industry practices will likely focus on creating standardized frameworks, tools, and best practices for building durable, cost-effective AI systems. Additionally, further empirical studies are anticipated to quantify the benefits of this approach across different sectors.
Monitoring how AI vendors and enterprise teams adapt to this paradigm shift will be crucial in understanding the full impact of the new SDLC framework.
AI model verification tools
As an affiliate, we earn on qualifying purchases.
As an affiliate, we earn on qualifying purchases.
Key Questions
Why is the model only 10% of the system behavior?
The whitepaper argues that the model itself provides only the core generation capability, while the surrounding harness—including prompts, rules, tools, and oversight—controls the actual behavior and performance.
How does focusing on harness design reduce costs?
Building a robust harness minimizes unnecessary token usage, reduces maintenance costs, and improves reliability, making AI deployment more cost-effective over time.
What is agentic engineering?
It is an approach that embeds AI within structured workflows, verification, and guardrails, emphasizing systematic configuration over minimal prompting.
Does this mean larger models are obsolete?
Not necessarily. The whitepaper suggests that while larger models offer capabilities, their advantage diminishes unless paired with well-designed harnesses and context management.
What should organizations do now?
They should evaluate and improve their harnesses—including prompts, tools, and verification processes—and shift focus from model size to system configuration and oversight.
Source: ThorstenMeyerAI.com