VCF 9.1: From AI Hype to Production Reality

Virtually Speaking Podcast21mMay 5, 2026

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “VCF 9.1: From AI Hype to Production Reality” inside PodZeus.

AI-Generated Summary

In this episode of the Virtually Speaking Podcast, hosts Pete Fletcher and John Nicholson welcome Chris Wolfe, Global Head of AI at Broadcom, to discuss the real-world evolution of AI deployment with VMware Cloud Foundation (VCF) 9.1. The conversation centers on the growing momentum of private AI, where organizations are moving beyond hype to implement production-grade AI solutions on-premises or at the edge. Chris highlights how customers are leveraging VCF for specific, measurable use cases—like government chatbots and contact center automation—while balancing cloud-based frontier models with local inference for governance, cost efficiency, and data control. A key theme is the shift from theoretical AI adoption to operational reality, with VCF acting as an intelligent orchestrator that enables smart routing of workloads between on-prem and cloud environments. The episode also explores technical advancements in VCF 9.1, including GPU pass-through support, CPU inference via Lama CPP, enhanced observability through detailed metrics, and the new MCP tools registry for governance. Chris emphasizes that production AI is far more complex than just generating tokens, and VCF’s role as a 'production enabler' is gaining traction across enterprises and partner ecosystems. The episode concludes with Chris expressing excitement about VCF’s strategic alignment with the industry’s shift toward private AI inference, the growing partner enthusiasm from model creators, and the increasing recognition that AI deployment requires more than just models—it demands infrastructure, governance, and operational maturity. The hosts underscore that the AI journey is no longer about experimentation but about building scalable, secure, and measurable AI systems, with VCF emerging as a critical foundation for this transition.

Key Takeaways
1

Private AI is moving from concept to production, with real-world use cases like government chatbots and contact center automation driving adoption.

2

Smart workload routing—using local models for simple tasks and cloud models for complex queries—can reduce costs by up to 70% and improve data governance.

3

VCF 9.1 enhances production readiness with GPU pass-through, CPU inference support, and improved observability for better resource management.

4

The rise of MCP tools and governance layers is critical to prevent security risks from uncontrolled AI integrations.

5

Partnerships are key: VCF’s open architecture allows third parties to extend the stack without competition, accelerating deployment at scale.

…and 2 more takeaways available in PodZeus

Chapters
0:00
2 min

Welcome to VCF 9.1: From Hype to Production

Hosts Pete and John welcome Chris Wolfe to discuss the shift from AI hype to real-world deployment, setting the stage for a deep dive into VCF 9.1’s production-ready capabilities.

2:00
4 min

The Rise of Private AI and Real-World Use Cases

The prime minister of this government is using a chatbot that's running on VCF to answer questions and get data from the government.

Highlight
6:00
5 min

Smart AI Routing: Balancing On-Prem and Cloud

If I can be smart about what I'm sending to the cloud and what I'm not sending, the price savings could be around 70%.

Highlight
11:00
5 min

Why 95% of AI Projects Fail—and How VCF Solves It

Production inference is hard. I can't buy an appliance that was just designed to generate tokens like this is the problem I have with the whole AI factory narrative.

Highlight
16:00
5 min

VCF 9.1: Technical Advances and the Future of AI Infrastructure

The episode wraps with a deep dive into VCF 9.1’s new features: GPU pass-through, CPU inference, MCP governance, and enhanced observability, all designed to make AI deployment scalable and secure.

High-Impact Quotes
Production inference is hard. I can't buy an appliance that was just designed to generate tokens like this is the problem I have with the whole AI factory narrative.
Chris Wolfe19:46
Viral: 95.0
If I can be smart about what I'm sending to the cloud and what I'm not sending, the price savings could be around 70%.
Chris Wolfe5:07
Viral: 90.0
The prime minister of this government is using a chatbot that's running on VCF to answer questions and get data from the government.
Chris Wolfe2:12
Viral: 85.0
Speakers

Hosts

Pete FletcherJohn Nicholson

Guest

Chris Wolfe
Topics Discussed
Private AI Deployment95%AI Cost Optimization90%Production AI Challenges85%Hybrid AI Architecture85%AI Governance and Security80%GPU and Hardware Support75%MCP Tools and Integration70%AI Observability and Metrics65%
People & Brands

VMware Cloud Foundation

product

25xPositive

Chris Wolfe

person

18xPositive

VCF 9.1

product

15xPositive

Pete Fletcher

person

12xNeutral

John Nicholson

person

10xNeutral

NVIDIA Blackwell

product

6xPositive

MCP Tools

other

6xPositive

Lama CPP

product

3xPositive

Triton Inference Server

product

2xPositive

Cohere

organization

2xPositive

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “VCF 9.1: From AI Hype to Production Reality” inside PodZeus.

Start discovering podcast insights today

Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.

No credit card required • 7-day trial • Cancel anytime