Episodes

EP21: Privacy in the Age of Agents with Niloofar Mireshghallah

Jan 7 2026

Guest: Niloofar Mireshghallah (Incoming Assistant Professor at CMU, Member of Technical Staff at Humans and AI)
In this episode, we dive into AI privacy, frontier model capabilities, and why academia still matters.
We kick off by discussing GPT-5.2 and whether models rely more on parametric knowledge or context. Niloofar shares how reasoning models actually defer to context, even accepting obviously false information to "roll with it."
On privacy, Niloofar challenges conventional wisdom: memorization isn't the problem anymore. The real threats are aggregation attacks (finding someone's pet name in HTML metadata), inference attacks (models are expert geoguessers), and input-output leakage in agentic workflows.
We also explore linguistic colonialism in AI, or how models fail for non-English languages, sometimes inventing cultural traditions.
The episode wraps with a call for researchers to tackle problems industry ignores: AI for science, education tools that preserve the struggle of learning, and privacy-preserving collaboration between small local models and large commercial ones.
Timeline
[0:00] Intro
[1:03] GPT-5.2 first impressions and skepticism about the data cutoff claims
[4:17] Parametric vs. context memory—when do models trust training vs. the prompt?
[9:28] The messy problem of memory, weights, and online learning
[16:12] Tool use changes model behavior in unexpected ways
[17:15] OpenAI's "Advances in Sciences" paper and human-AI collaboration
[24:17] Why deep research is getting less useful
[28:17] Pre-training vs. post-training—which matters more?
[30:35] Non-English languages and AI failures
[33:23] Hilarious Farsi bugs: "I'll get back to you in a few days" and invented traditions
[37:56] Linguistic colonialism—ChatGPT changed how we write
[41:20] Why memorization isn't the real privacy threat
[47:14] The three actual privacy problems: inference, aggregation, input-output leakage
[54:33] Deep research stalking experiment—finding a cat's name in HTML
[1:01:13] Privacy solutions for agentic systems
[1:03:23] What Niloofar's excited about: AI for scientists, small models, niche problems
[1:08:31] AI for education without killing the learning process
[1:09:15] Closing: underrated life advice on health and sustainable habits
Music:
"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
About
The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.

Show More Show Less

1 hr and 12 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP20: Yann LeCun

Dec 15 2025

Yann LeCun – Why LLMs Will Never Get Us to AGI
"The path to superintelligence - just train up the LLMs, train on more synthetic data, hire thousands of people to school your system in post-training, invent new tweaks on RL-I think is complete bullshit. It's just never going to work."
After 12 years at Meta, Turing Award winner Yann LeCun is betting his legacy on a radically different vision of AI. In this conversation, he explains why Silicon Valley's obsession with scaling language models is a dead end, why the hardest problem in AI is reaching dog-level intelligence (not human-level), and why his new company AMI is building world models that predict in abstract representation space rather than generating pixels.
Timestamps
(00:00:14) – Intro and welcome
(00:01:12) – AMI: Why start a company now?
(00:04:46) – Will AMI do research in the open?
(00:06:44) – World models vs LLMs
(00:09:44) – History of self-supervised learning
(00:16:55) – Siamese networks and contrastive learning
(00:25:14) – JEPA and learning in representation space
(00:30:14) – Abstraction hierarchies in physics and AI
(00:34:01) – World models as abstract simulators
(00:38:14) – Object permanence and learning basic physics
(00:40:35) – Game AI: Why NetHack is still impossible
(00:44:22) – Moravec's Paradox and chess
(00:55:14) – AI safety by construction, not fine-tuning
(01:02:52) – Constrained generation techniques
(01:04:20) – Meta's reorganization and FAIR's future
(01:07:31) – SSI, Physical Intelligence, and Wayve
(01:10:14) – Silicon Valley's "LLM-pilled" monoculture
(01:15:56) – China vs US: The open source paradox
(01:18:14) – Why start a company at 65?
(01:25:14) – The AGI hype cycle has happened 6 times before
(01:33:18) – Family and personal background
(01:36:13) – Career advice: Learn things with a long shelf life
(01:40:14) – Neuroscience and machine learning connections
(01:48:17) – Continual learning: Is catastrophic forgetting solved?
Music:
"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
About
The Information Bottleneck is hosted by Ravid Shwartz-Ziv and Allen Roush, featuring in-depth conversations with leading AI researchers about the ideas shaping the future of machine learning.

Show More Show Less

1 hr and 50 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP19: AI in Finance and Symbolic AI with Atlas Wang

Dec 10 2025
Atlas Wang (UT Austin faculty, XTX Research Director) joins us to explore two fascinating frontiers: the foundations of symbolic AI and the practical challenges of building AI systems for quantitative finance.
On the symbolic AI side, Atlas shares his recent work proving that neural networks can learn symbolic equations through gradient descent, a surprising result given that gradient descent is continuous while symbolic structures are discrete. We talked about why neural nets learn clean, compositional mathematical structures at all, what the mathematical tools involved are, and the broader implications for understanding reasoning in AI systems.
The conversation then turns to neuro-symbolic approaches in practice: agents that discover rules through continued learning, propose them symbolically, verify them against domain-specific checkers, and refine their understanding.
On the finance side, Atlas pulls back the curtain on what AI research looks like at a high-frequency trading firm. The core problem sounds simple (predict future prices from past data). Still, the challenge is extreme: markets are dominated by noise, predictions hover near zero correlation, and success means eking out tiny margins across astronomical numbers of trades. He explains why synthetic data techniques that work elsewhere don't translate easily, and why XTX is building time series foundation models rather than adapting language models.
We also discuss the convergence of hiring between frontier AI labs and quantitative finance, and why this is an exceptional moment for ML researchers to consider the finance industry.
Links:
Why Neural Network Can Discover Symbolic Structures with Gradient-based Training: An Algebraic and Geometric Foundation for Neurosymbolic Reasoning - arxiv.org/abs/2506.21797
Atlas website - https://www.vita-group.space/
Guest: Atlas Wang (UT Austin / XTX)
Hosts: Ravid Shwartz-Ziv & Allen Roush
Music: “Kid Kodi” — Blue Dot Sessions. Source: Free Music Archive. Licensed CC BY-NC 4.0.
Show More Show Less
1 hr and 11 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP18: AI Robotics

Dec 1 2025
In this episode, we hosted Judah Goldfeder, a PhD candidate at Columbia University and student researcher at Google, to discuss robotics, reproducibility in ML, and smart buildings.
Key topics covered:
Robotics challenges: We discussed why robotics remains harder than many expected, compared to LLMs. The real world is unpredictable and unforgiving, and mistakes have physical consequences. Sim-to-real transfer remains a major bottleneck because simulators are tedious to configure accurately for each robot and environment. Unlike text, robotics lacks foundation models, partly due to limited clean, annotated datasets and the difficulty of collecting diverse real-world data.
Reproducibility crisis: We discussed how self-reported benchmarks can lead to p-hacking and irreproducible results. Centralized evaluation systems (such as Kaggle or ImageNet challenges), where researchers submit algorithms for testing on hidden test sets, seem to drive faster progress.
Smart buildings: Judah's work at Google focuses on using ML to optimize HVAC systems, potentially reducing energy costs and carbon emissions significantly. The challenge is that every building is different. It makes the simulation configuration extremely labor-intensive. Generative AI could help by automating the process of converting floor plans or images into accurate building simulations.
Links:
Judah website - https://judahgoldfeder.com/
Music:
"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
Show More Show Less
1 hr and 45 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP17: RL with Will Brown

Nov 24 2025

In this episode, we talk with Will Brown, a research lead at Prime Intellect, about his journey into reinforcement learning (RL) and multi-agent systems, exploring their theoretical foundations and practical applications. We discuss the importance of RL in the current LLMs pipeline and the challenges it faces. We also discuss applying agentic workflows to real-world applications and the ongoing evolution of AI development.
Chapters
00:00 Introduction to Reinforcement Learning and Will's Journey
03:10 Theoretical Foundations of Multi-Agent Systems
06:09 Transitioning from Theory to Practical Applications
09:01 The Role of Game Theory in AI
11:55 Exploring the Complexity of Games and AI
14:56 Optimization Techniques in Reinforcement Learning
17:58 The Evolution of RL in LLMs
21:04 Challenges and Opportunities in RL for LLMs
23:56 Key Components for Successful RL Implementation
27:00 Future Directions in Reinforcement Learning
36:29 Exploring Agentic Reinforcement Learning Paradigms
38:45 The Role of Intermediate Results in RL
41:16 Multi-Agent Systems: Challenges and Opportunities
45:08 Distributed Environments and Decentralized RL
49:31 Prompt Optimization Techniques in RL
52:25 Statistical Rigor in Evaluations
55:49 Future Directions in Reinforcement Learning
59:50 Task-Specific Models vs. General Models
01:02:04 Insights on Random Verifiers and Learning Dynamics
01:04:39 Real-World Applications of RL and Evaluation Challenges
01:05:58 Prime RL Framework: Goals and Trade-offs
01:10:38 Open Source vs. Closed Source Models
01:13:08 Continuous Learning and Knowledge Improvement
Music:
"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed

Show More Show Less

1 hr and 6 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP16: AI News and Papers

Nov 17 2025
In this episode, we discuss various topics in AI, including the challenges of the conference review process, the capabilities of Kimi K2 thinking, the advancements in TPU technology, the significance of real-world data in robotics, and recent innovations in AI research. We also talk about the cool "Chain of Thought Hijacking" paper, how to use simple ideas to scale RL, and the implications of the Cosmos project, which aims to enable autonomous scientific discovery through AI.
Papers and links:
Chain-of-Thought Hijacking - https://arxiv.org/pdf/2510.26418
Kosmos: An AI Scientist for Autonomous Discovery - https://t.co/9pCr6AUXAe
JustRL: Scaling a 1.5B LLM with a Simple RL Recipe - https://relieved-cafe-fe1.notion.site/JustRL-Scaling-a-1-5B-LLM-with-a-Simple-RL-Recipe-24f6198b0b6b80e48e74f519bfdaf0a8
Chapters
00:00 Navigating the Peer Review Process
04:17 Kimi K2 Thinking: A New Era in AI
12:27 The Future of Tool Calls in AI
17:12 Exploring Google's New TPUs
22:04 The Importance of Real-World Data in Robotics
28:10 World Models: The Next Frontier in AI
31:36 Nvidia's Dominance in AI Partnerships
32:08 Exploring Recent AI Research Papers
37:46 Chain of Thought Hijacking: A New Threat
43:05 Simplifying Reinforcement Learning Training
54:03 Cosmos: AI for Autonomous Scientific Discovery
Music:
"Kid Kodi" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
"Palms Down" — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
Show More Show Less
59 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP15: The Information Bottleneck and Scaling Laws with Alex Alemi

Nov 13 2025
In this episode, we sit down with Alex Alemi, an AI researcher at Anthropic (previously at Google Brain and Disney), to explore the powerful framework of the information bottleneck and its profound implications for modern machine learning.
We break down what the information bottleneck really means, a principled approach to retaining only the most informative parts of data while compressing away the irrelevant. We discuss why compression is still important in our era of big data, how it prevents overfitting, and why it's essential for building models that generalize well.
We also dive into scaling laws: why they matter, what we can learn from them, and what they tell us about the future of AI research.
Papers and links:
Alex's website - https://www.alexalemi.com/
Scaling exponents across parameterizations and optimizers - https://arxiv.org/abs/2407.05872
Deep Variational Information Bottleneck - https://arxiv.org/abs/1612.00410
Layer by Layer: Uncovering Hidden Representations in Language Models - https://arxiv.org/abs/2502.02013
Information in Infinite Ensembles of Infinitely-Wide Neural Networks - https://proceedings.mlr.press/v118/shwartz-ziv20a.html
Music:
“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
Show More Show Less
1 hr and 23 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free
EP14: AI News and Papers

Nov 10 2025
In this episode, we talked about AI news and recent papers. We explored the complexities of using AI models in healthcare (the Nature Medicine paper on GPT-5's fragile intelligence in medical contexts). We discussed the delicate balance between leveraging LLMs as powerful research tools and the risks of over-reliance, touching on issues such as hallucinations, medical disagreements among practitioners, and the need for better education on responsible AI use in healthcare.
We also talked about Stanford's "Cartridges" paper, which presents an innovative approach to long-context language models. The paper tackles the expensive computational costs of billion-token context windows by compressing KV caches through a clever "self-study" method using synthetic question-answer pairs and context distillation. We discussed the implications for personalization, composability, and making long-context models more practical.
Additionally, we explored the "Continuous Autoregressive Language Models" paper and touched on insights from the Smol Training Playbook.
Papers discussed:
The fragile intelligence of GPT-5 in medicine: https://www.nature.com/articles/s41591-025-04008-8
Cartridges: Lightweight and general-purpose long context representations via self-study: https://arxiv.org/abs/2506.06266
Continuous Autoregressive Language Models: https://arxiv.org/abs/2510.27688
The Smol Training Playbook: https://huggingface.co/spaces/HuggingFaceTB/smol-training-playbook
Music:
“Kid Kodi” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
“Palms Down” — Blue Dot Sessions — via Free Music Archive — CC BY-NC 4.0.
Changes: trimmed
This is an experimental format for us, just news and papers without a guest interview. Let us know what you think!
Show More Show Less
57 mins

Failed to add items

Sorry, we are unable to add the item because your shopping basket is already at capacity.

Add to cart failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Follow podcast failed

Unfollow podcast failed

Listen for free

Episodes

EP21: Privacy in the Age of Agents with Niloofar Mireshghallah

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP20: Yann LeCun

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP19: AI in Finance and Symbolic AI with Atlas Wang

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP18: AI Robotics

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP17: RL with Will Brown

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP16: AI News and Papers

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP15: The Information Bottleneck and Scaling Laws with Alex Alemi

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed

EP14: AI News and Papers

Failed to add items

Add to cart failed.

Add to wishlist failed.

Remove from wishlist failed.

Follow podcast failed

Unfollow podcast failed