This Day in AI Podcast

Michael Sharkey, Chris Sharkey

Join Michael and Chris Sharkey, two proudly average tech enthusiasts, as they stumble through the world of artificial intelligence with all the grace of a robot learning to dance.

  • 1 hour 3 minutes
    GPT-5.2 Can't Identify a Serial Killer & Was The Year of Agents A Lie? EP99.28-5.2

    Join Simtheory: https://simtheory.ai

    GPT-5.2 is here and... it's not great. In this episode, we put OpenAI's latest model through its paces and discover it can't even identify a convicted serial killer when the text literally says "serial killer." We compare it head-to-head with Claude Opus and Gemini 3 Pro (spoiler: they win). Plus, we reflect on the "Year of Agents" that wasn't, why your barber switched to Grok, Disney's billion-dollar investment to use Mickey Mouse in Sora, and why Mustafa Suleyman should probably be fired. Also featuring: the GPT-5.2 diss track where the model brags about capabilities it doesn't have.

    CHAPTERS:

    00:00 Intro - GPT-5.2 Drops + Details
    01:25 First Impressions: Verbose, Overhyped, Vibe-Tuned
    02:52 OpenAI's Rushed Response to Gemini 3
    03:24 Tool Calling Problems & Agentic Failures
    04:14 Why Anthropic's Models Just Work Better
    06:31 The Barber Test: Real Users Are Switching to Grok
    10:00 The Ivan Milat Vision Test (Serial Killer Edition)
    17:04 Year of Agents Retrospective: What Went Wrong
    25:28 The Path to True Agentic Workflows
    31:22 GPT-5.2 Diss Track (Yes, Really)
    43:43 Why We're Still Optimistic About AI
    50:29 Google Bringing Ads to Gemini in 2026
    54:46 Disney Pays $1B to Use Mickey Mouse in Sora
    56:57 LOL of the Week: Mustafa Suleyman's Sad Tweets
    1:00:35 Outro & Full GPT-5.2 Diss Track

    Thanks for listening. Like & Sub. xoxox

    12 December 2025, 2:02 am
  • 1 hour 3 minutes
    ChatGPT is Dying? OpenAI Code Red, DeepSeek V3.2 Threat & Why Meta Fires Non-AI Workers | EP99.27

    Join Simtheory: https://simtheory.ai/

    OpenAI has declared "Code Red" as ChatGPT faces growing competition from Gemini and other rivals. In this episode, we break down OpenAI's 6% market share decline, why their ad strategy is on hold, and what they need to do to reclaim the AI crown. We also explore DeepSeek V3.2's impressive capabilities as a cheap open-source alternative, Meta's new policy grading employees on AI skills, and the crisis facing higher education as AI fluency becomes essential. Plus, Fatal Patricia hits #1 on our Spotify charts, and Tesla's Optimus robot is running like a slightly unfit human.

    CHAPTERS:
    00:00 Intro - OpenAI Code Red & Market Share Crisis
    07:03 ChatGPT's Failure to Go Deeper Into Users' Lives
    16:33 What OpenAI Needs to Win Back the Crown
    26:46 Chris's Wishlist for an OpenAI Comeback
    31:22 DeepSeek V3.2 - The Open Source Threat
    39:34 Meta Grading Workers on AI Skills
    46:29 The University & Education AI Crisis
    56:25 Fatal Patricia Hits #1 & WTF of the Week

    Thanks for listening. Like & Sub. xoxox

    4 December 2025, 2:02 am
  • 1 hour 45 minutes
    Claude 4.5 Opus Shocks, The State of AI in 2025, Fara-7B & MCP-UI | EP99.26

    Join Simtheory: https://simtheory.ai (Use coupon BLACKFRIDAY15 for $15 USD off any subscription).
    ----
    Simtheory Discord: https://discord.gg/Ar6GeQnAR7
    This Day in AI Discord: https://discord.gg/TVYH3HD6qs
    LinkedIn Group: https://www.linkedin.com/groups/16562039/
    Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=FPaJU2NRSnOSNPmnsfwA_g
    ---
    CHAPTERS:
    00:00 Intro & Fatal Patricia Update
    01:40 Promotions (Discord, Black Friday, LinkedIn)
    04:36 Claude 4.5 Opus - Best Anthropic Model Ever?
    31:17 Computer Use API Updates
    36:14 Will AI Replace 57% of Jobs? (McKinsey Report)
    1:00:52 Claude 4.5 Opus Demos (Christmas Hut & Diss Track Preview)
    1:07:13 Microsoft Farah 7B - Moose Porn Refusals
    1:21:51 Why ChatGPT's MCP-UI Apps Are a Bad Idea
    1:42:01 🎵 Claude 4.5 Opus Diss Track (Full Song)
    ---
    Thanks for listening. Like & Sub. xoxox

    Anthropic just dropped Claude 4.5 Opus and it might be the best AI model of 2024. In this episode, we compare Claude 4.5 Opus vs Gemini 3 Pro vs GPT-5.1, breaking down the new API features including effort parameters, context management, and computer use updates. We also test Microsoft's new Farah 7B parameter model for computer use - with hilarious refusal results. Plus, we react to McKinsey's controversial report claiming AI agents could automate 57% of US jobs by 2030. 

    We dive deep into Anthropic's pricing (3x cheaper than Opus 4.1), why Claude is now beating Google and OpenAI on agentic coding benchmarks, and whether MCP-UI apps in ChatGPT are a step backwards for AI workflows. Is Claude 4.5 Opus the new king of AI coding assistants? Should enterprises be worried about AI job replacement? And why did Microsoft's Farah model refuse to draw a moose? All this plus an AI-generated diss track roasting Sam Altman, Elon Musk, and Sundar Pichai.

    28 November 2025, 1:51 am
  • 1 hour 44 minutes
    Is Gemini 3 Really the Best Model? & Fun with Nano Banana Pro - EP99.25-GEMINI

    Join Simtheory for Gemini 3 & Nano Banana Pro: https://simtheory.ai
    ----
    CHAPTERS:
    00:00 - Gemini 3 Pro Impressions & Thoughts
    33:34 - xAI Releases Grok 4.1 Fast
    40:09 - More on Gemini 3 Pro: What We Want Improved
    45:46 - Gemini 3 Pro Dis Track
    51:16 - Thoughts on Nano Banana Pro And What It Means
    1:12:49 - Does Nano Banana Disrupt Design Software Like Canva? Where is This Going?
    1:26:20 - OpenAI's Reaction to Gemini 3 Pro & Nano Banana with GPT-5.1-Pro and Codex model updates
    1:32:38 - Final Thoughts & Sam Altman Sad Song
    1:38:41 - FATAL PATRICIA SONG
    1:42:12 - Gemini 3.0 Pro Diss Track
    ----
    Thanks for your support plz like and sub xoxo

    21 November 2025, 1:52 am
  • 1 hour 5 minutes
    Are We In An AI Bubble? In Defense of Sam Altman & AI in The Enterprise | EP99.24

    Join Simtheory & experience MCPs in action: https://simtheory.ai
    ----
    00:00 - Chris Has a Merch Sponsor
    02:42 - In Defense of Sam Altman
    20:29 - Are We In An AI Bubble? & What is Working in The Enterprise?
    43:58 - Anthropic's Code Execution with MCP: Problems with MCP Context
    52:44 - Kimi-K2 Thinking Model Release
    1:00:45 - "In the Middle of a Bubble" Song
    ----
    Thanks for your support and listening, we appreciate you!
    Join our Discord: https://discord.gg/TVYH3HD6qs

    7 November 2025, 1:20 am
  • 1 hour 33 minutes
    Why Sam Altman is Scared & Why People Are Giving Up on MCP | EP99.23

    Join Simtheory to experience MCPs: https://simtheory.ai
    ----
    00:00 - OpenAI's State of the Union & Why Cursor's Composer Model is a Threat
    44:26 - Does MCP Need To Die? Our Thoughts on State of MCP and Why The Client Implementations are the Problem
    1:07:53 - 1X NEO The Home Robot LOLZ
    1:28:05 - Greg Brockman, A Sad Song.
    ----
    Thanks for listening and your continued support. We appreciate you.

    31 October 2025, 1:04 am
  • 1 hour 26 minutes
    Do We Need AI Browsers? What Are Claude Skills? - EP99.22

    Join Simtheory: https://simtheory.ai
    -----
    00:00 - AI Browser Wars: ChatGPT Atlas, Copilot Updates & Edge Copilot AI
    23:15 - Why Not Focus on Real Use Cases for AI?
    34:49 - Claude Skills: What Are Claude Skills? What is the Difference Between MCP and Skills?
    1:04:05 - Vibe Code Fashion: Oakley Meta Vanguards + Use Cases of AI Glasses
    1:15:05 - Top Models Used on Simtheory & Final Thoughts
    ------
    Thanks for listening and your support xoxo

    24 October 2025, 12:13 am
  • 1 hour 13 minutes
    Is Haiku 4.5 really THIS good? OpenAI's Erotic Mode & Are MCP Apps the Right Approach? EP99.21

    Join Simtheory: https://simtheory.ai
    Use "SIMLINK" to get 30% off Pro & Max annual plans until Oct 31st 2025
    ----
    CHAPTERS:
    00:00 - Gemini 3.0 HYPE with "make an OS"
    03:50 - Anthropic Releases Claude Haiku 4.5: Initial Thoughts
    11:57 - Veo 3.1 and new modes (first frame/last frame & reference to image)
    25:20 - OpenAI's Erotica Mode & age verification thoughts
    34:25 - OpenAI Partners with Everyone & Memes
    35:38 - Salesforce OpenAI Partnership & What Should SaaS do with MCP apps?
    1:09:25 - Final thoughts, Polymarket
    ----
    Thanks for your support and listening to the show xox

    16 October 2025, 3:48 am
  • 1 hour 5 minutes
    What did OpenAI Announce at DevDay? Apps SDK, MCP UI & Impact to SaaS - EP99.20-APPS

    Join Simtheory: https://simtheory.ai
    ----
    Check out our albums on Spotify: https://open.spotify.com/artist/28PU4ypB18QZTotml8tMDq?si=XfaAbBKAQAaaG_Cg2AkD9A
    ----
    00:00 - OpenAI DevDay 2025 Recap
    03:24 - ChatGPT Apps SDK & MCP UI & Agents SDK
    42:11 - AgentKit & AgentBuilder: Who is it for?
    50:41 - GPT-5-pro in API
    53:15 - gpt-realtime-mini
    56:53 - Sora 2 & Sora 2 in API Vs Veo3
    1:01:43 - Final thoughts & This Day in AI albums now on Spotify!

    Thanks for your support and listening xoxo

    10 October 2025, 1:19 am
  • 1 hour 39 minutes
    Doom Scrolling SORA2, Claude 4.5 Sonnet & Are Agents Coming for our Jobs? EP99.19

    Join Simtheory: https://simtheory.ai (Use STILLRELEVANT for $10 off)
    ----
    00:00 - Sora2 Examples
    00:56 - Sora2: Initial Impressions & Thoughts
    26:39 - Claude Sonnet 4.5: It's REALLY good
    47:09 - Claude Agent SDK & AI Agent Systems
    55:05 - Is Claude Imagine a Look at Future Software / AI OS?
    1:00:25 - Claude 4.5 Sonnet Dis Track
    1:06:24 - "Real AI Agents and Real Work" & Enterprise Agent / MCP workflows
    1:31:41 - LOL of the week Sora2 Steve Irwin Video
    1:35:07 - Full Claude Sonnet 4.5 Dis Track
    ----
    Thanks for listening and your support, we really appreciate it!
    xoxox

    3 October 2025, 2:05 am
  • 1 hour 18 minutes
    lolz with Omnihuman, Agentic Gemini 2.5 Flash, Grok 4 FAST & ChatGPT Pulse - EP99.18-v5-FLASH

    Join Simtheory: https://simtheory.ai
    & Try Omnihuman, Gemini Flash 2.5 Preview, Grok 4 FAST, and Suno v5! Code: STILLRELEVANT 
    ---
    Links:
    https://worksinprogress.co/issue/the-algorithm-will-see-you-now/
    https://developers.googleblog.com/en/continuing-to-bring-you-our-latest-models-with-an-improved-gemini-2-5-flash-and-flash-lite-release/
    ---
    CHAPTERS:
    00:00 - Gemini 2.5 Flash Agentic Tests with Omnihuman, Suno v5 and Research Tools
    06:29 - Dis Track AI Music Video (Made by Gemini 2.5 Flash)
    07:06 - Thoughts on Suno v5, More Agentic Model Discussion
    29:10 - Are we all sleeping on Grok 4 FAST with 2M context?
    41:46 - Radiologists are STILL RELEVANT & Is AI Going to Take Our Jobs?
    44:46 - The need to use multiple specialist models
    1:01:20 - Is ChatGPT Pulse To Just Sell Ads?
    1:08:46 - Final thoughts for the week
    1:11:54 - Gemini Flash 2.5 Dis Track
    1:15:08 - Love Rat Suno v5 The Midnight Inspired Test

    Thanks for all of your support and listening to the show we really appreciate it! xoxo

    26 September 2025, 3:22 am
  • More Episodes? Get the App