Higher Self is a nurturing podcast dedicated to guiding awakening souls on their journey toward embodying their higher selves. Each episode provides practical tools, transformative stories, and intuitive wisdom designed to support listeners in navigating the complexities of being both human beings and spiritual beings. Topics range from navigating an uncertain world and growing through challenges, to conscious mothering and holistic wellbeing. Hosted by Carina Devi—a teacher of mindfulness, ...
…
continue reading
1
The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)
Sam Charrington
Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, de ...
…
continue reading
1
The Mythical Journey of Mothers: Making Meaning on the Spiritual Path
1:17:24
1:17:24
Play later
Play later
Lists
Like
Liked
1:17:24In this final episode of the season, we explore motherhood as a profound spiritual and mythic journey - one that cracks us open, reshapes our identity, and calls us into deeper presence - not unlike the Hero’s Journey and ancient rites of passage. From the unraveling of identity to the depths of our shadows and the emergence of a wiser, more embodi…
…
continue reading
1
Scaling Agentic Inference Across Heterogeneous Compute with Zain Asgar - #757
48:44
48:44
Play later
Play later
Lists
Like
Liked
48:44In this episode, Zain Asgar, co-founder and CEO of Gimlet Labs, joins us to discuss the heterogeneous AI inference across diverse hardware. Zain argues that the current industry standard of running all AI workloads on high-end GPUs is unsustainable for agents, which consume significantly more tokens than traditional LLM applications. We explore Gim…
…
continue reading
1
Proactive Agents for the Web with Devi Parikh - #756
56:04
56:04
Play later
Play later
Lists
Like
Liked
56:04Today, we're joined by Devi Parikh, co-founder and co-CEO of Yutori, to discuss browser use models and a future where we interact with the web through proactive, autonomous agents. We explore the technical challenges of creating reliable web agents, the advantages of visually-grounded models that operate on screenshots rather than the browser’s mor…
…
continue reading
1
AI Orchestration for Smart Cities and the Enterprise with Robin Braun and Luke Norris - #755
54:46
54:46
Play later
Play later
Lists
Like
Liked
54:46Today, we're joined by Robin Braun, VP of AI business development for hybrid cloud at HPE, and Luke Norris, co-founder and CEO of Kamiwaza, to discuss how AI systems can be used to automate complex workflows and unlock value from legacy enterprise data. Robin and Luke detail high-impact use cases from HPE and Kamiwaza’s collaboration on an “Agentic…
…
continue reading
1
Building an AI Mathematician with Carina Hong - #754
55:52
55:52
Play later
Play later
Lists
Like
Liked
55:52In this episode, Carina Hong, founder and CEO of Axiom, joins us to discuss her work building an "AI Mathematician." Carina explains why this is a pivotal moment for AI in mathematics, citing a convergence of three key areas: the advanced reasoning capabilities of modern LLMs, the rise of formal proof languages like Lean, and breakthroughs in code …
…
continue reading
1
High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753
52:23
52:23
Play later
Play later
Lists
Like
Liked
52:23In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling proces…
…
continue reading
1
Overstimulated, Touched Out, On Edge: Tools for the Breaking Point and Beyond
1:34:28
1:34:28
Play later
Play later
Lists
Like
Liked
1:34:28In this episode of The Inner Work of Motherhood, we explore what happens when a mother’s nervous system becomes overloaded: the overstimulation, anger, irritability, and wanting to disappear. You’ll learn how sensory overwhelm shows up in the body, why anger and reactivity are actually protective responses, and how to gently guide your body and min…
…
continue reading
1
Vibe Coding's Uncanny Valley with Alexandre Pesant - #752
1:12:36
1:12:36
Play later
Play later
Lists
Like
Liked
1:12:36Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take on how AI is enabling a shift in software development from typing characters to expressing intent, creating a new layer of abstraction similar to how high-level code compiles to machine code. We explor…
…
continue reading
1
Dataflow Computing for AI Inference with Kunle Olukotun - #751
57:37
57:37
Play later
Play later
Lists
Like
Liked
57:37In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match th…
…
continue reading
1
Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750
57:23
57:23
Play later
Play later
Lists
Like
Liked
57:23Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the…
…
continue reading
1
The Decentralized Future of Private AI with Illia Polosukhin - #749
1:05:03
1:05:03
Play later
Play later
Lists
Like
Liked
1:05:03In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve glo…
…
continue reading
1
Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748
1:03:39
1:03:39
Play later
Play later
Lists
Like
Liked
1:03:39Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose m…
…
continue reading
1
Reclaim the Magic of Motherhood with Presence and Mindfulness
41:07
41:07
Play later
Play later
Lists
Like
Liked
41:07Motherhood holds some of the brightest, most magical moments of our lives - ecstatic joy, oceanic love, and the kind of wonder that makes time stand still. But in the busyness of our day to day lives, it’s all too easy to lose sight of these moments. Between the endless tasks of feeding, cleaning, and planning, joy can feel like a distant memory ra…
…
continue reading
1
Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747
58:26
58:26
Play later
Play later
Lists
Like
Liked
58:26Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build more adaptable and creative models. We dig into her ICML 2025 Outstanding Paper Award winner, “Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction,” which ex…
…
continue reading
1
Overwhelm in Motherhood Part 2: How to Break Free from the Freeze Response
54:06
54:06
Play later
Play later
Lists
Like
Liked
54:06In Part 2 of Overwhelm In Motherhood, we explore the freeze response - the nervous system state where overwhelm lives - and that can leave mothers feeling shut down, numb, or paralyzed. While freeze can feel like a place of shame or failure, it’s also a clear message from our body - we just have to learn to listen to what she’s saying. This episode…
…
continue reading
1
Building an Immune System for AI Generated Software with Animesh Koratana - #746
1:05:11
1:05:11
Play later
Play later
Lists
Like
Liked
1:05:11Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools production-ready at scale. Animesh explains how rapid advances in AI-assisted coding have created an “asymmetry” where the speed of code output outpaces the maturity of processes for maintenance and su…
…
continue reading
1
Overwhelm in Motherhood Part 1: The Invisible Weight We Carry
24:16
24:16
Play later
Play later
Lists
Like
Liked
24:16In this episode, we explore the hidden factors that contribute to overwhelm: chronic fatigue and sleep deprivation, the mental load of remembering and managing countless details, the role that estrogen plays, sensitivity to the world around us, and the unique challenges of high needs children. We’ll also reflect on the realities of doing this work …
…
continue reading
1
Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745
1:11:48
1:11:48
Play later
Play later
Lists
Like
Liked
1:11:48In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the application of formal mathematics and reasoning enables the creation of more robust and safer AI systems. A pioneer behind concepts like the Inception architecture and adversarial examples, Christian now focuses on autoformalization—the AI-driven process …
…
continue reading
1
Creativity in Motherhood: Returning to the Root of Who You Are
54:02
54:02
Play later
Play later
Lists
Like
Liked
54:02In this episode, we explore what it means to stay connected to your creativity in the midst of motherhood. Whether you feel creatively blocked, disconnected from your sense of self, or you’re simply longing to feel inspired again, this conversation is here to remind you that creativity is not separate from mothering - it is a vital part of it. You’…
…
continue reading
1
Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744
1:10:20
1:10:20
Play later
Play later
Lists
Like
Liked
1:10:20Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on …
…
continue reading
1
Why Modern Motherhood Feels So Hard: Raising Children Without A Village and the Need for Cultural Midwifery
1:11:18
1:11:18
Play later
Play later
Lists
Like
Liked
1:11:18Why does motherhood so often feel overwhelming, lonely, and heavier than we ever imagined? In this opening episode of The Inner Work of Motherhood, we explore the roots of the modern maternal struggle: the loss of the village, the unrealistic expectations of the “Good Mother,” and the cultural pressures that leave women doing the work of many, ofte…
…
continue reading
1
Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743
1:01:01
1:01:01
Play later
Play later
Lists
Like
Liked
1:01:01Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to discuss the recent release of Genie 3, a model capable of generating “playable” virtual worlds. We dig into the evolution of the Genie project and review the current model’s scaled-up capabilities, including creating real-time, interactive, and high-re…
…
continue reading
1
Closing the Loop Between AI Training and Inference with Lin Qiao - #742
1:01:11
1:01:11
Play later
Play later
Lists
Like
Liked
1:01:11In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She explains why aligning training and inference systems is essential for creating a seamless, fast-moving production pipeline, preventing…
…
continue reading
1
Introducing Season 1: The Inner Work of Motherhood
8:01
8:01
Play later
Play later
Lists
Like
Liked
8:01This first season is a deep dive into the emotional, spiritual, and somatic terrain of mothering. The identity shifts, nervous system regulation (and all that disrupts it), the healing, grief, joy, rage, and bliss of this beautiful, wild journey that each mother must navigate in her own way. This is for the mothers who are called to do the deep wor…
…
continue reading
1
Context Engineering for Productive AI Agents with Filip Kozera - #741
46:01
46:01
Play later
Play later
Lists
Like
Liked
46:01In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the new programming interface. Filip breaks down the architecture of these "background agents," explaining how they use a reflection loop and tool-calling to execute complex tasks. He discusses the current…
…
continue reading
1
Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740
1:13:02
1:13:02
Play later
Play later
Lists
Like
Liked
1:13:02In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerful, efficient applications by composing multiple, often diverse, AI models and services. We discuss how these "networks of networks" can push the Pareto frontier, delivering results that are simultaneo…
…
continue reading
1
Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739
1:13:02
1:13:02
Play later
Play later
Lists
Like
Liked
1:13:02In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture and challenges of building real-time, production-ready conversational voice AI. Kwin breaks down the full stack for voice agents—from the models and APIs to the critical orchestration layer that manages…
…
continue reading
1
Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738
1:00:29
1:00:29
Play later
Play later
Lists
Like
Liked
1:00:29Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted papers and demos featured at this year’s CVPR conference. We start with “DiMA: Distilling Multi-modal Large Language Models for Autonomous Driving,” an end-to-end autonomous driving system that incorpora…
…
continue reading
1
Building the Internet of Agents with Vijoy Pandey - #737
56:13
56:13
Play later
Play later
Lists
Like
Liked
56:13Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we make specialized agents from different vendors collaborate effectively? As companies like Salesforce, Workday, and Microsoft all develop their own agentic systems, integrating them creates a complex, pr…
…
continue reading
1
LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736
59:31
59:31
Play later
Play later
Lists
Like
Liked
59:31Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI in equities feature forecasting, covering how they identify and create features, collect and quantify historical data, and build predictive models to forecast market behavior and asset prices for tradin…
…
continue reading
1
Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735
56:45
56:45
Play later
Play later
Lists
Like
Liked
56:45Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer vision. Jason introduces FiftyOne, an open-source platform for visualizing datasets, analyzing models, and improving data quality. We focus on Voxel51’s recent research report, “Zero-shot auto-labeling riv…
…
continue reading
1
Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734
1:25:21
1:25:21
Play later
Play later
Lists
Like
Liked
1:25:21Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher, an open-source tool for analyzing and improving Deep Neural Networks (DNNs) based on principles from theoretical physics. We explore the foundations of the Heavy-Tailed Self-Regularization (HTSR) theory that underpins it, which combines random matri…
…
continue reading
Today, I’m excited to share a special crossover edition of the podcast recorded live from Google I/O 2025! In this episode, I join Shawn Wang aka Swyx from the Latent Space Podcast, to interview Logan Kilpatrick and Shrestha Basu Mallick, PMs at Google DeepMind working on AI Studio and the Gemini API, along with Kwindla Kramer, CEO of Daily and cre…
…
continue reading
1
RAG Risks: Why Retrieval-Augmented LLMs are Not Safer with Sebastian Gehrmann - #732
57:09
57:09
Play later
Play later
Lists
Like
Liked
57:09Today, we're joined by Sebastian Gehrmann, head of responsible AI in the Office of the CTO at Bloomberg, to discuss AI safety in retrieval-augmented generation (RAG) systems and generative AI in high-stakes domains like financial services. We explore how RAG, contrary to some expectations, can inadvertently degrade model safety. We cover examples o…
…
continue reading
1
From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731
1:01:25
1:01:25
Play later
Play later
Lists
Like
Liked
1:01:25Today, we're joined by Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, to discuss how reinforcement learning (RL) is reshaping the way we build custom agents on top of foundation models. Mahesh highlights the crucial role of data curation, evaluation, and error analysis in model performance, and explains why RL offers a more robust altern…
…
continue reading
1
How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730
1:07:27
1:07:27
Play later
Play later
Lists
Like
Liked
1:07:27Today, we're joined by Josh Tobin, member of technical staff at OpenAI, to discuss the company’s approach to building AI agents. We cover OpenAI's three agentic offerings—Deep Research for comprehensive web research, Operator for website navigation, and Codex CLI for local code execution. We explore OpenAI’s shift from simple LLM workflows to reaso…
…
continue reading
1
CTIBench: Evaluating LLMs in Cyber Threat Intelligence with Nidhi Rastogi - #729
56:18
56:18
Play later
Play later
Lists
Like
Liked
56:18Today, we're joined by Nidhi Rastogi, assistant professor at Rochester Institute of Technology to discuss Cyber Threat Intelligence (CTI), focusing on her recent project CTIBench—a benchmark for evaluating LLMs on real-world CTI tasks. Nidhi explains the evolution of AI in cybersecurity, from rule-based systems to LLMs that accelerate analysis by p…
…
continue reading
1
Generative Benchmarking with Kelly Hong - #728
54:17
54:17
Play later
Play later
Lists
Like
Liked
54:17In this episode, Kelly Hong, a researcher at Chroma, joins us to discuss "Generative Benchmarking," a novel approach to evaluating retrieval systems, like RAG applications, using synthetic data. Kelly explains how traditional benchmarks like MTEB fail to represent real-world query patterns and how embedding models that perform well on public benchm…
…
continue reading
1
When The Earth Sings Again: Remembering The Ancient Feminine Origins of Easter
59:12
59:12
Play later
Play later
Lists
Like
Liked
59:12In this soul-stirring episode, we journey beyond the Easter we know today and into the ancient, feminine roots. Long before chocolate eggs and Sunday services, cultures around the world honored the Earth's reawakening in alignment with Goddess traditions - Eostre, Ishtar, Inanna, and hundreds of others - who embodied life itself, renewal, fertility…
…
continue reading
1
Exploring the Biology of LLMs with Circuit Tracing with Emmanuel Ameisen - #727
1:34:06
1:34:06
Play later
Play later
Lists
Like
Liked
1:34:06In this episode, Emmanuel Ameisen, a research engineer at Anthropic, returns to discuss two recent papers: "Circuit Tracing: Revealing Language Model Computational Graphs" and "On the Biology of a Large Language Model." Emmanuel explains how his team developed mechanistic interpretability methods to understand the internal workings of Claude by rep…
…
continue reading
1
Teaching LLMs to Self-Reflect with Reinforcement Learning with Maohao Shen - #726
51:45
51:45
Play later
Play later
Lists
Like
Liked
51:45Today, we're joined by Maohao Shen, PhD student at MIT to discuss his paper, “Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search.” We dig into how Satori leverages reinforcement learning to improve language model reasoning—enabling model self-reflection, self-correction, and exploration of a…
…
continue reading
1
Waymo's Foundation Model for Autonomous Driving with Drago Anguelov - #725
1:09:07
1:09:07
Play later
Play later
Lists
Like
Liked
1:09:07Today, we're joined by Drago Anguelov, head of AI foundations at Waymo, for a deep dive into the role of foundation models in autonomous driving. Drago shares how Waymo is leveraging large-scale machine learning, including vision-language models and generative AI techniques to improve perception, planning, and simulation for its self-driving vehicl…
…
continue reading
1
Dynamic Token Merging for Efficient Byte-level Language Models with Julie Kallini - #724
50:32
50:32
Play later
Play later
Lists
Like
Liked
50:32Today, we're joined by Julie Kallini, PhD student at Stanford University to discuss her recent papers, “MrT5: Dynamic Token Merging for Efficient Byte-level Language Models” and “Mission: Impossible Language Models.” For the MrT5 paper, we explore the importance and failings of tokenization in large language models—including inefficient compression…
…
continue reading
Welcome the Spring season and true New Year with this short guided meditation (pulled from our most recent Spring Equinox episode). We’ll take a few minutes to ground and settle, connect without breath and sense of proprioception. Then we’ll turn our attention toward Winter. Are there any loose ends to tie up? Anything that arose during the season …
…
continue reading
1
Spring Equinox: Stepping Through The Portal
47:46
47:46
Play later
Play later
Lists
Like
Liked
47:46As the wheel of the year turns, the Spring Equinox arrives as a powerful portal—inviting us to step out of winter’s reflection, dreaming, and fortification, and into early Spring’s integration, release, and ultimately, inspired action. In this episode, we explore the profound energetic shift of this sacred threshold. •Closing the door of Winter and…
…
continue reading
1
Scaling Up Test-Time Compute with Latent Reasoning with Jonas Geiping - #723
58:38
58:38
Play later
Play later
Lists
Like
Liked
58:38Today, we're joined by Jonas Geiping, research group leader at Ellis Institute and the Max Planck Institute for Intelligent Systems to discuss his recent paper, “Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach.” This paper proposes a novel language model architecture which uses recurrent depth to enable “thinking in l…
…
continue reading
1
Imagine while Reasoning in Space: Multimodal Visualization-of-Thought with Chengzu Li - #722
42:11
42:11
Play later
Play later
Lists
Like
Liked
42:11Today, we're joined by Chengzu Li, PhD student at the University of Cambridge to discuss his recent paper, “Imagine while Reasoning in Space: Multimodal Visualization-of-Thought.” We explore the motivations behind MVoT, its connection to prior work like TopViewRS, and its relation to cognitive science principles such as dual coding theory. We dig i…
…
continue reading
1
Inside s1: An o1-Style Reasoning Model That Cost Under $50 to Train with Niklas Muennighoff - #721
49:29
49:29
Play later
Play later
Lists
Like
Liked
49:29Today, we're joined by Niklas Muennighoff, a PhD student at Stanford University, to discuss his paper, “S1: Simple Test-Time Scaling.” We explore the motivations behind S1, as well as how it compares to OpenAI's O1 and DeepSeek's R1 models. We dig into the different approaches to test-time scaling, including parallel and sequential scaling, as well…
…
continue reading
1
Accelerating AI Training and Inference with AWS Trainium2 with Ron Diamant - #720
1:07:05
1:07:05
Play later
Play later
Lists
Like
Liked
1:07:05Today, we're joined by Ron Diamant, chief architect for Trainium at Amazon Web Services, to discuss hardware acceleration for generative AI and the design and role of the recently released Trainium2 chip. We explore the architectural differences between Trainium and GPUs, highlighting its systolic array-based compute design, and how it balances per…
…
continue reading
1
π0: A Foundation Model for Robotics with Sergey Levine - #719
52:30
52:30
Play later
Play later
Lists
Like
Liked
52:30Today, we're joined by Sergey Levine, associate professor at UC Berkeley and co-founder of Physical Intelligence, to discuss π0 (pi-zero), a general-purpose robotic foundation model. We dig into the model architecture, which pairs a vision language model (VLM) with a diffusion-based action expert, and the model training "recipe," emphasizing the ro…
…
continue reading