Why everyone HATES GPT-5 (and how to fix it) | Matt Wolfe | Kazuha

Why everyone HATES GPT-5 (and how to fix it)

Why everyone HATES GPT-5 (and how to fix it)

313 days ago•Matt Wolfe•@mreflow

YouTube39 min 1 sec

Watch on YouTube

Note: AI-generated summary based on third-party content. Not financial advice. Read more.

Quick Insights

The AI landscape is shifting as OpenAI's new model faces performance issues, creating opportunities for competitors. Anthropic's Claude is emerging as a strong challenger, which presents a bullish case for its major public investors Google (GOOGL) and Amazon (AMZN). Investors should be cautious as the negative reception for OpenAI's product could pose a risk to the AI narrative supporting Microsoft (MSFT). The intense competition suggests a diversified investment strategy across the AI ecosystem may be more prudent than betting on a single company. The market is now rewarding practical business applications and cost-effectiveness over just raw model power.

Detailed Analysis

OpenAI (and its primary public investor, Microsoft - MSFT)

The podcast presents a largely negative sentiment regarding the launch of GPT-5, citing widespread user frustration and disappointment.
The launch was perceived as being massively overhyped by CEO Sam Altman, which led to the product feeling underwhelming.
A major point of user anger was the initial removal of popular legacy models like GPT-4.0. While OpenAI later reversed this decision due to backlash, the move was seen as a "bait and switch" that damaged user trust.
There is a strong narrative that the new model is primarily a cost-saving measure for OpenAI. The new "model router" is believed to automatically send user prompts to the cheapest, least capable model possible, resulting in lower quality and "fast slop answers."
Performance Issues were highlighted in several areas:
- Coding: In a direct comparison to create a browser game, GPT-5 was outperformed by competitor Claude Opus 4.1 and even OpenAI's own older model, O3 Pro.
- Accuracy: The host concludes that accuracy is a significant concern, stating there's a "50-50 chance" of getting a correct answer.
  - It failed a simple math problem: 5.9 = X + 5.11.
  - It failed a logic riddle about how to drink from a cup that is sealed on top and open on the bottom (the answer is to turn it over).
- Speed & Reliability: The host noted that the model is "annoyingly slow sometimes" and occasionally freezes, requiring a new chat to be started.

Takeaways

OpenAI, a market leader in AI, is facing significant reputational and product-related challenges with its flagship GPT-5 model.
The perception that quality has been sacrificed for cost-cutting could lead to users switching to competing platforms, representing a threat to OpenAI's market share.
For public market investors, these issues represent a potential risk for Microsoft (MSFT), which is the primary financial backer of OpenAI. A decline in OpenAI's technological leadership or public perception could negatively impact Microsoft's broader AI strategy and stock narrative.
On a positive note, OpenAI did listen to user feedback by bringing back legacy models. The long-term vision of creating highly personalized AI models for each user could be a powerful future catalyst, but execution is key.

Anthropic's Claude (and its public investors, Google - GOOGL & Amazon - AMZN)

The podcast positions Claude, an AI model from the company Anthropic, as a very strong and capable competitor to OpenAI.
The sentiment towards Claude is highly positive, with one user being quoted as saying, "After thorough evaluation of chat GPT-5... Claude is pretty freaking awesome."
In a head-to-head coding challenge to build a game, Claude Opus 4.1 was judged to have created a more "aesthetically pleasing" and functional version than GPT-5.
The host also recalls a previous test where Claude was superior to GPT in creating a productivity app, reinforcing the idea of its strong coding capabilities.

Takeaways

Anthropic's Claude is emerging as a formidable competitor that is potentially capturing market share from dissatisfied OpenAI users. Its strong performance suggests the "AI race" is far from over.
Claude's perceived superiority in key areas like coding could give it a significant technical and competitive edge, particularly with developers and enterprise customers.
This is a bullish signal for Anthropic's major public investors, which include Google (GOOGL) and Amazon (AMZN). Strong performance from Claude validates their significant investments and strengthens their competitive position in the AI market.

Globant (GLOB)

This company was mentioned in a sponsored advertisement within the podcast.
Globant is presented as a company focused on enterprise AI solutions.
Its platform, Globent Enterprise AI, allows businesses to build AI agents tailored to specific needs, such as a research analyst.
The company offers a subscription service called AI Pods, described as an "enterprise-grade AI engineering" service to build functional products for businesses.

Takeaways

Globant represents a "picks and shovels" investment opportunity in the AI sector. It focuses on the practical B2B (business-to-business) application and integration of AI, which is a major growth area.
Instead of building foundational models, Globant helps other companies use them, a potentially less risky and highly valuable part of the AI ecosystem.
Important: Investors should note this information came from a paid sponsorship and is not an independent analysis from the podcast host. Further personal research and due diligence are essential.

General AI Sector & Competition

The podcast highlights that the AI market is highly competitive and dynamic, with no single clear winner. The "best" model often depends on the specific task.
The host's tests included models from several key players: OpenAI (GPT-5, O3 Pro), Anthropic (Claude Opus 4.1), and xAI (Grok 4).
The flawed launch of GPT-5 has tempered expectations for a near-term "singularity" or AGI (Artificial General Intelligence), with one user noting they are "a lot less concerned about... doomy scenarios."
In the coding test, Grok 4 from Elon Musk's xAI was ranked last, described as the slowest and producing the least functional code of the four models tested.

Takeaways

The AI investment landscape is volatile. The leader today may not be the leader tomorrow. This suggests a diversified investment strategy across the ecosystem (e.g., model makers, cloud providers, enterprise integrators) may be more prudent than betting on a single company.
The poor performance of Grok 4 in this specific test could indicate it is lagging its competitors in certain capabilities, which may be a point of concern for the valuation and prospects of its parent company, xAI.
The market appears to be maturing, with a growing focus on cost-effectiveness and practical business applications (like OpenAI's router and Globant's services) rather than just raw model power.

Ask about this postAnswers are grounded in this post's content.

Video Description

Start creating AI-first services from the ground up today with Globant's Enterprise AI and AI Pods. Learn more: https://globant.link/4lFGs1f There was a ton of hype surrounding the launch of ChatGPT-5, and a lot of people feel that the newest offering from OpenAI fell short. In this video we dive deep into the gripes people have with ChatGPT-5 and test a lot of issues to see if there's any validity to the complaints. Discover More: 🛠️ Explore AI Tools & News: https://futuretools.io/ 📰 Weekly Newsletter: https://futuretools.io/newsletter 🎙️ The Next Wave Podcast: https://youtube.com/@TheNextWavePod Socials: 🖼️ Instagram: https://instagram.com/mr.eflow ❌ Personal Twiter/X: https://x.com/mreflow ❌ Future Tools Twiter/X: https://x.com/futuretoolsio 🧵 Threads: https://www.threads.net/@mr.eflow 🟦 LinkedIn: https://www.linkedin.com/in/matt-wolfe-30841712/ Let’s work together! - Brand, sponsorship & business inquiries: mattwolfe@smoothmedia.co #AINews #AITools #ArtificialIntelligence Time Stamps: 00:57 - All Gripe Overview 04:36 - Gripe 1: Where'd All The Models Go??? 09:32 - Gripe 2: Personality 16:09 - Gripe 3: Not Better at Coding 24:19 - Gripe 4: (In)Accuracy 33:09 - Final Gripe-nalysis

About Matt Wolfe

Matt Wolfe

Matt Wolfe

By @mreflow

AI News Breakdowns every Saturday and other cool nerdy tech and AI stuff in between. Let's work together! - For brand ...