Etched - Building AI Hardware to Make Inference Faster and Cheaper - [Invest Like the Best, EP.480] | Invest Like the Best with Patrick O'Shaughnessy

Etched - Building AI Hardware to Make Inference Faster and Cheaper - [Invest Like the Best, EP.480]

2 hours ago•Invest Like the Best with Patrick O'Shaughnessy•Colossus | Investing & Business Podcasts

Podcast1 hr 27 min

Listen to Episode

Note: AI-generated summary based on third-party content. Not financial advice. Read more.

Quick Insights

The shift from AI training to inference (running models) is creating a massive opportunity for specialized hardware that outperforms general-purpose GPUs. Investors should look toward TSMC (TSM), which remains the essential manufacturing partner for next-generation AI chips like those from Etched and big-tech hyperscalers. While NVIDIA (NVDA) remains the industry benchmark, its high margins are under threat from application-specific chips (ASICs) designed to run "Transformer" architectures more efficiently. Consider exposure to the "Token Economy" by focusing on companies that maximize "tokens per watt," as energy availability is now the primary bottleneck for AI scaling. By 2027, the market is expected to pivot from "per-seat" software pricing to "per-inference" costs, benefiting high-growth AI agent platforms like Perplexity and Anthropic.

Detailed Analysis

Etched (Private Company)

Etched is a hardware startup founded in 2023 by Gavin Uberti and Rob Locken. The company is building specialized chips and full-rack systems specifically designed for AI inference (running models) rather than training. They have raised over $800 million and report over $1 billion in customer demand.

The "Sohu" Chip: Their first product is a specialized chip designed to run AI models faster and cheaper than general-purpose GPUs.
Vertical Integration: Unlike many competitors, Etched is building the entire stack: the chip, the boards, power delivery, interconnects, and the full server rack.
Technical Bets:
- Low Voltage Inference: They run chips at under half the voltage of traditional GPUs (similar to Bitcoin miners), which prevents "thermal throttling" and allows for much higher performance density.
- Cluster Scale Memory: They built a custom interconnect stack to allow multiple chips to communicate with 5x lower latency than current industry standards (like NVIDIA’s Blackwell).
- Architecture Specificity: While GPUs are general-purpose, Etched chips are hard-coded for the "Transformer" architecture (the math behind ChatGPT), allowing for massive efficiency gains.

Takeaways

Inference is the Future: The founders argue that while training was the first wave, inference will become the largest market in the world as billions of users begin using AI agents concurrently.
Efficiency over Versatility: For investors, the lesson is that "general purpose" hardware (like NVIDIA GPUs) may eventually lose market share to "application-specific" hardware (ASICs) like Etched as AI workloads become standardized.
Speed as a Moat: Etched emphasizes "wall clock time." If a complex AI task takes a month on current hardware, Etched aims to compress that into days, which could accelerate scientific and medical breakthroughs.

NVIDIA (NVDA)

The discussion frequently uses NVIDIA as the benchmark for current industry standards, while highlighting the limitations of general-purpose GPUs for specific AI tasks.

The "NVIDIA Tax": Hyperscalers (Google, Meta, Microsoft) are building their own chips (TPUs, Maya, etc.) primarily to avoid the high costs of buying NVIDIA hardware.
Hardware Retrofitting: The founders argue that current GPUs were designed before the ChatGPT era and are being "retrofitted" for modern AI, leaving room for new architectures to outperform them.
Interconnect Latency: The transcript notes that NVIDIA's Blackwell chips have a point-to-point latency of about 4,000 nanoseconds, which Etched claims to significantly reduce.

Takeaways

Dominance but Vulnerability: While NVIDIA remains the gold standard, its "general purpose" nature is its potential weakness. Specialized startups and internal big-tech chips are targeting NVIDIA’s high margins in the inference market.
Talent Migration: Etched has successfully recruited "legends" from NVIDIA (including the creator of the HGX/DGX systems), suggesting a shift in where top-tier hardware talent is moving.

Investment Theme: The "Token" Economy

The podcast introduces a shift in how productivity and economic value should be measured in the AI era.

Tokens as a Commodity: The founders view "tokens" (units of AI text/thought) as a commodity that will eventually have economies of scale similar to iPhones or cars.
GDP per Megawatt: A future metric for national success may be "agents per megawatt"—how much AI work a country can generate from its energy grid.
The $100 Billion Data Center: The founders predict the rise of "mega token factories"—single data centers costing $100 billion—designed to serve a handful of massive models to billions of people.

Takeaways

Energy is the Bottleneck: Power availability is the primary constraint on AI growth. Companies that can provide more "tokens per watt" will have a significant competitive advantage.
Agentic Workforce: By 2027, the founders predict there will be more AI agents in the workforce than humans, shifting the focus of software companies from "per seat" pricing to "per token" or "per inference" costs.

Mentioned Companies & Platforms

The following companies were mentioned as key players in the AI infrastructure and software ecosystem:

TSMC (TSM): Highlighted as an essential partner with superior customer service and yield optimization. Etched uses TSMC's 4nm process.
Hyperscaler Chips: Mentioned as competitors to NVIDIA/Etched, including Google’s TPU, Meta’s MTIA, Microsoft’s Maya, and OpenAI’s rumored Jalapeno project.
AI Software/Agents: Cursor, Anysphere, Perplexity, and Anthropic were noted as high-growth teams driving the demand for cheaper and faster inference.
Infrastructure Tools: WorkOS (enterprise readiness), Vanta (compliance), and Ridgeline (investment management software) were highlighted as essential tools for the modern AI/Finance stack.

Takeaways

Supply Chain Diversification: Etched intentionally built its first-gen product on a different supply chain (4nm) than NVIDIA’s next-gen Rubens (3nm) to avoid zero-sum competition for manufacturing capacity.
The "Existential" Advantage: The founders argue that specialized startups (like Etched) may build better chips than Big Tech (Google/Meta) because the chip is the startup's only product, whereas for Big Tech, it is just a cost-saving measure.

Ask about this postAnswers are grounded in this post's content.

Episode Description

My guests today are Gavin Uberti and Rob Wachen, the founders of Etched. A few years ago, when they set out to build a better AI chip than the largest companies in the world, almost everyone I called told me it could not be done. They have since done it, taping out a working chip on their first attempt and becoming the first hardware company founded after ChatGPT to do so. They already have more than a billion dollars of customer demand for their first product, and have raised eight hundred million dollars to build it. Etched builds chips and systems designed to run AI models faster and at lower cost. They started the company in 2023, and that product is a complete rack for inference, the chip along with the boards, the power delivery, the interconnects, and the manufacturing to produce it all. We talk about the technical bets behind their architecture, how they hired industry legends and paired them with elite 22 year-olds, and why they believe inference will become one of the largest markets in the world. I think you will find the story of what they have built hard to forget. Please enjoy my conversation with Gavin and Rob. For the full show notes, transcript, and links to mentioned content, check out the episode page ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠here⁠⁠⁠⁠⁠. ----- Become a Colossus member to get our quarterly print magazine and private audio experience, including exclusive profiles and early access to select episodes. Subscribe at ⁠colossus.com/subscribe⁠. ----- ⁠Ramp’s⁠ mission is to help companies manage their spend in a way that reduces expenses and frees up time for teams to work on more valuable projects. Go to⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ ⁠ramp.com/invest⁠⁠ to sign up for free and get a $250 welcome bonus. ----- Trusted by thousands of businesses, ⁠Vanta⁠ continuously monitors your security posture and streamlines audits so you can win enterprise deals and build customer trust without the traditional overhead. Invest Like the Best listeners get a special offer of $1,000 off Vanta when you go to ⁠vanta.com/invest⁠. ----- WorkOS⁠ is the infrastructure B2B and AI-native companies use to sell to enterprise. It covers everything enterprise security requires: SSO, SCIM, RBAC, Audit Logs, AI governance, and more. Trusted by 2,000+ fast-growing companies, including OpenAI, Anthropic, Cursor, and Vercel. ----- Rogo is the AI platform for finance. They're building agents for Wall Street that are trained to understand how bankers and investors actually do work: from diligence and modeling, to turning analysis into deliverables. To learn more, visit rogo.ai/invest. ----- ⁠Ridgeline⁠ has built a complete, real-time, modern operating system for investment managers. It handles trading, portfolio management, compliance, customer reporting, and much more through an all-in-one real-time cloud platform. Visit⁠ ⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ridgelineapps.com⁠. ----- Editing and post-production work for this episode was provided by The Podcast Consultant. Timestamps: (00:00:00) Welcome to Invest Like The Best (00:02:07) Gavin Uberti and Rob Wachen (00:03:54) Two 21-Year-Olds Taking on NVIDIA (00:07:52) The Two Technical Bets Behind Their Architecture (00:14:15) Why Inference Becomes the Biggest Market (00:20:23) Rob and Gavin's Origins Stories (00:28:38) How They Recruit Industry Legends (00:36:30) Moving a Dozen Engineers to Bangalore for Six Months (00:38:01) Speed Wins (00:43:58) Getting More Concurrency Out of Every Megawatt (00:52:44) Vertical Integration (00:57:43) Hardest Obstacles to Overcome (01:01:09) Raising The Largest AI Chip Series A Ever (01:06:29) TSMC (01:13:20) Designing Gen 2 for Gigawatt-Scale Production (01:16:42) Why Machines Don't Think Like People (01:20:03) A Year of Compute Compressed Into a Month (01:23:44) The Trillion-Dollar Data Center (01:26:19) The Kindest Thing

About Invest Like the Best with Patrick O'Shaughnessy

Invest Like the Best with Patrick O'Shaughnessy

By Colossus | Investing & Business Podcasts

Conversations with the best investors and business leaders in the world. We explore their ideas, methods, and stories to help you better invest your time and money. Hear stock market and boardroom insights you can't find anywhere else. If you're a professional investor, CEO, entrepreneur, or business strategist, this is for you. Explore all our episodes and learn more at https://www.joincolossus.com