Best Carina Devi Podcasts (2025)

1
The Mythical Journey of Mothers: Making Meaning on the Spiritual Path 1:17:24

19d ago1:17:24

1:17:24

In this final episode of the season, we explore motherhood as a profound spiritual and mythic journey - one that cracks us open, reshapes our identity, and calls us into deeper presence - not unlike the Hero’s Journey and ancient rites of passage. From the unraveling of identity to the depths of our shadows and the emergence of a wiser, more embodi…

1
Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757 48:44

1d ago48:44

48:44

In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argues that the current industry standard of running all AI workloads on high-end GPUs is unsustainable for agents, which consume significantly more tokens than traditional LLM applications. We explore Gim…

1
Proactive Agents for the Web with Devi Parikh - #756 56:04

15d ago56:04

56:04

Today, we're joined by Devi Parikh, co-founder and co-CEO of Yutori, to discuss browser use models and a future where we interact with the web through proactive, autonomous agents. We explore the technical challenges of creating reliable web agents, the advantages of visually-grounded models that operate on screenshots rather than the browser’s mor…

1
AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755 54:46

22d ago54:46

54:46

Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss how AI systems can be used to automate complex workflows and unlock value from legacy enterprise data. Robin and Luke detail high-impact use cases from HPE and Kamiwaza’s collaboration on an “Agentic…

1
Building an AI Mathematician with Carina Hong - #754 55:52

30d ago55:52

55:52

In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code …

1
High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753 52:23

1M ago52:23

52:23

In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling proces…

1
Overstimulated, Touched Out, On Edge: Tools for the Breaking Point and Beyond 1:34:28

1M ago1:34:28

1:34:28

In this episode of The Inner Work of Motherhood, we explore what happens when a mother’s nervous system becomes overloaded: the overstimulation, anger, irritability, and wanting to disappear. You’ll learn how sensory overwhelm shows up in the body, why anger and reactivity are actually protective responses, and how to gently guide your body and min…

1
Vibe Coding's Uncanny Valley with Alexandre Pesant - #752 1:12:36

1M ago1:12:36

1:12:36

Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take on how AI is enabling a shift in software development from typing characters to expressing intent, creating a new layer of abstraction similar to how high-level code compiles to machine code. We explor…

1
Dataflow Computing for AI Inference with Kunle Olukotun - #751 57:37

2M ago57:37

57:37

In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match th…

1
Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750 57:23

2M ago57:23

57:23

Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the…

1
The Decentralized Future of Private AI with Illia Polosukhin - #749 1:05:03

2M ago1:05:03

1:05:03

In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve glo…

1
Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748 1:03:39

2M ago1:03:39

1:03:39

Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose m…

1
Reclaim the Magic of Motherhood with Presence and Mindfulness 41:07

2M ago41:07

41:07

Motherhood holds some of the brightest, most magical moments of our lives - ecstatic joy, oceanic love, and the kind of wonder that makes time stand still. But in the busyness of our day to day lives, it’s all too easy to lose sight of these moments. Between the endless tasks of feeding, cleaning, and planning, joy can feel like a distant memory ra…

1
Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747 58:26

3M ago58:26

58:26

Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build more adaptable and creative models. We dig into her ICML 2025 Outstanding Paper Award winner, “Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction,” which ex…

1
Overwhelm in Motherhood Part 2: How to Break Free from the Freeze Response 54:06

3M ago54:06

54:06

In Part 2 of Overwhelm In Motherhood, we explore the freeze response - the nervous system state where overwhelm lives - and that can leave mothers feeling shut down, numb, or paralyzed. While freeze can feel like a place of shame or failure, it’s also a clear message from our body - we just have to learn to listen to what she’s saying. This episode…

1
Building an Immune System for AI Generated Software with Animesh Koratana - #746 1:05:11

3M ago1:05:11

1:05:11

Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools production-ready at scale. Animesh explains how rapid advances in AI-assisted coding have created an “asymmetry” where the speed of code output outpaces the maturity of processes for maintenance and su…

1
Overwhelm in Motherhood Part 1: The Invisible Weight We Carry 24:16

3M ago24:16

24:16

In this episode, we explore the hidden factors that contribute to overwhelm: chronic fatigue and sleep deprivation, the mental load of remembering and managing countless details, the role that estrogen plays, sensitivity to the world around us, and the unique challenges of high needs children. We’ll also reflect on the realities of doing this work …

1
Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745 1:11:48

3M ago1:11:48

1:11:48

In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the application of formal mathematics and reasoning enables the creation of more robust and safer AI systems. A pioneer behind concepts like the Inception architecture and adversarial examples, Christian now focuses on autoformalization—the AI-driven process …

1
Creativity in Motherhood: Returning to the Root of Who You Are 54:02

3M ago54:02

54:02

In this episode, we explore what it means to stay connected to your creativity in the midst of motherhood. Whether you feel creatively blocked, disconnected from your sense of self, or you’re simply longing to feel inspired again, this conversation is here to remind you that creativity is not separate from mothering - it is a vital part of it. You’…

1
Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744 1:10:20

3M ago1:10:20

1:10:20

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on …

1
Why Modern Motherhood Feels So Hard: Raising Children Without A Village and the Need for Cultural Midwifery 1:11:18

3M ago1:11:18

1:11:18

Why does motherhood so often feel overwhelming, lonely, and heavier than we ever imagined? In this opening episode of The Inner Work of Motherhood, we explore the roots of the modern maternal struggle: the loss of the village, the unrealistic expectations of the “Good Mother,” and the cultural pressures that leave women doing the work of many, ofte…

1
Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743 1:01:01

4M ago1:01:01

1:01:01

Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to discuss the recent release of Genie 3, a model capable of generating “playable” virtual worlds. We dig into the evolution of the Genie project and review the current model’s scaled-up capabilities, including creating real-time, interactive, and high-re…

1
Closing the Loop Between AI Training and Inference with Lin Qiao - #742 1:01:11

4M ago1:01:11

1:01:11

In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She explains why aligning training and inference systems is essential for creating a seamless, fast-moving production pipeline, preventing…

1
Introducing Season 1: The Inner Work of Motherhood 8:01

4M ago8:01

8:01

This first season is a deep dive into the emotional, spiritual, and somatic terrain of mothering. The identity shifts, nervous system regulation (and all that disrupts it), the healing, grief, joy, rage, and bliss of this beautiful, wild journey that each mother must navigate in her own way. This is for the mothers who are called to do the deep wor…

1
Context Engineering for Productive AI Agents with Filip Kozera - #741 46:01

4M ago46:01

46:01

In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the new programming interface. Filip breaks down the architecture of these "background agents," explaining how they use a reflection loop and tool-calling to execute complex tasks. He discusses the current…

1
Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740 1:13:02

4M ago1:13:02

1:13:02

In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerful, efficient applications by composing multiple, often diverse, AI models and services. We discuss how these "networks of networks" can push the Pareto frontier, delivering results that are simultaneo…

1
Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739 1:13:02

5M ago1:13:02

1:13:02

In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture and challenges of building real-time, production-ready conversational voice AI. Kwin breaks down the full stack for voice agents—from the models and APIs to the critical orchestration layer that manages…

1
Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738 1:00:29

5M ago1:00:29

1:00:29

Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted papers and demos featured at this year’s CVPR conference. We start with “DiMA: Distilling Multi-modal Large Language Models for Autonomous Driving,” an end-to-end autonomous driving system that incorpora…

1
Building the Internet of Agents with Vijoy Pandey - #737 56:13

5M ago56:13

56:13

Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we make specialized agents from different vendors collaborate effectively? As companies like Salesforce, Workday, and Microsoft all develop their own agentic systems, integrating them creates a complex, pr…

1
LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736 59:31

6M ago59:31

59:31

Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI in equities feature forecasting, covering how they identify and create features, collect and quantify historical data, and build predictive models to forecast market behavior and asset prices for tradin…

1
Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735 56:45

6M ago56:45

56:45

Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer vision. Jason introduces FiftyOne, an open-source platform for visualizing datasets, analyzing models, and improving data quality. We focus on Voxel51’s recent research report, “Zero-shot auto-labeling riv…

1
Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734 1:25:21

6M ago1:25:21

1:25:21

Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher, an open-source tool for analyzing and improving Deep Neural Networks (DNNs) based on principles from theoretical physics. We explore the foundations of the Heavy-Tailed Self-Regularization (HTSR) theory that underpins it, which combines random matri…

1
Google I/O 2025 Special Edition - #733 26:21

6M ago26:21

26:21

Today, I’m excited to share a special crossover edition of the podcast recorded live from Google I/O 2025! In this episode, I join Shawn Wang aka Swyx from the Latent Space Podcast, to interview Logan Kilpatrick and Shrestha Basu Mallick, PMs at Google DeepMind working on AI Studio and the Gemini API, along with Kwindla Kramer, CEO of Daily and cre…

1
RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732 57:09

7M ago57:09

57:09

Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomberg, to discuss AI safety in retrieval-augmented generation (RAG) systems and generative AI in high-stakes domains like financial services. We explore how RAG, contrary to some expectations, can inadvertently degrade model safety. We cover examples o…

1
From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731 1:01:25

7M ago1:01:25

1:01:25

Today, we're joined by Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, to discuss how reinforcement learning (RL) is reshaping the way we build custom agents on top of foundation models. Mahesh highlights the crucial role of data curation, evaluation, and error analysis in model performance, and explains why RL offers a more robust altern…

1
How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730 1:07:27

7M ago1:07:27

1:07:27

Today, we're joined by Josh Tobin, member of technical staff at OpenAI, to discuss the company’s approach to building AI agents. We cover OpenAI's three agentic offerings—Deep Research for comprehensive web research, Operator for website navigation, and Codex CLI for local code execution. We explore OpenAI’s shift from simple LLM workflows to reaso…

1
CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729 56:18

7M ago56:18

56:18

Today, we're joined by Nidhi Rastogi, assistant professor at Rochester Institute of Technology to discuss Cyber Threat Intelligence (CTI), focusing on her recent project CTIBench—a benchmark for evaluating LLMs on real-world CTI tasks. Nidhi explains the evolution of AI in cybersecurity, from rule-based systems to LLMs that accelerate analysis by p…

1
Generative Benchmarking with Kelly Hong - #728 54:17

7M ago54:17

54:17

In this episode, Kelly Hong, a researcher at Chroma, joins us to discuss "Generative Benchmarking," a novel approach to evaluating retrieval systems, like RAG applications, using synthetic data. Kelly explains how traditional benchmarks like MTEB fail to represent real-world query patterns and how embedding models that perform well on public benchm…

1
When The Earth Sings Again: Remembering The Ancient Feminine Origins of Easter 59:12

8M ago59:12

59:12

In this soul-stirring episode, we journey beyond the Easter we know today and into the ancient, feminine roots. Long before chocolate eggs and Sunday services, cultures around the world honored the Earth's reawakening in alignment with Goddess traditions - Eostre, Ishtar, Inanna, and hundreds of others - who embodied life itself, renewal, fertility…

1
Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727 1:34:06

8M ago1:34:06

1:34:06

In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model Computational Graphs" and "On the Biology of a Large Language Model." Emmanuel explains how his team developed mechanistic interpretability methods to understand the internal workings of Claude by rep…

1
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726 51:45

8M ago51:45

51:45

Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverages reinforcement learning to improve language model reasoning—enabling model self-reflection, self-correction, and exploration of a…

1
Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725 1:09:07

8M ago1:09:07

1:09:07

Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drago shares how Waymo is leveraging large-scale machine learning, including vision-language models and generative AI techniques to improve perception, planning, and simulation for its self-driving vehicl…

1
Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724 50:32

8M ago50:32

50:32

Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent papers, “MrT5: Dynamic Token Merging for Efficient Byte-level Language Models” and “Mission: Impossible Language Models.” For the MrT5 paper, we explore the importance and failings of tokenization in large language models—including inefficient compression…

1
Meditation: Closing The Winter Door 15:29

9M ago15:29

15:29

Welcome the Spring season and true New Year with this short guided meditation (pulled from our most recent Spring Equinox episode). We’ll take a few minutes to ground and settle, connect without breath and sense of proprioception. Then we’ll turn our attention toward Winter. Are there any loose ends to tie up? Anything that arose during the season …

1
Spring Equinox: Stepping Through The Portal 47:46

9M ago47:46

47:46

As the wheel of the year turns, the Spring Equinox arrives as a powerful portal—inviting us to step out of winter’s reflection, dreaming, and fortification, and into early Spring’s integration, release, and ultimately, inspired action. In this episode, we explore the profound energetic shift of this sacred threshold. •Closing the door of Winter and…

1
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723 58:38

9M ago58:38

58:38

Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his recent paper, “Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach.” This paper proposes a novel language model architecture which uses recurrent depth to enable “thinking in l…

1
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722 42:11

9M ago42:11

42:11

Today, we're joined by Chengzu Li, PhD student at the University of Cambridge to discuss his recent paper, “Imagine while Reasoning in Space: Multimodal Visualization-of-Thought.” We explore the motivations behind MVoT, its connection to prior work like TopViewRS, and its relation to cognitive science principles such as dual coding theory. We dig i…

1
Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721 49:29

9M ago49:29

49:29

Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and DeepSeek's R1 models. We dig into the different approaches to test-time scaling, including parallel and sequential scaling, as well…

1
Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720 1:07:05

9M ago1:07:05

1:07:05

Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the architectural differences between Trainium and GPUs, highlighting its systolic array-based compute design, and how it balances per…

1
π0: A Foundation Model for Robotics with Sergey Levine - #719 52:30

10M ago52:30

52:30

Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the ro…

Podcasts Worth a Listen

Carina Devi Podcasts

Podcasts Worth a Listen

Quick Reference Guide