Podcast Banner

Podcasts

Paul, Weiss Waking Up With AI

Agents Provocateurs: Secrets, Silence, and the Social Issues of AI

In this episode, Katherine Forrest and Scott Caravello dive into new research exploring what happens when helpful AI agents end up helping the wrong people. Our hosts break down two recent papers and discuss what each may mean for the future: "Agents of Chaos," which examines security vulnerabilities and unexpected behaviors in AI agents under social pressure, and "H-Neurons," which presents groundbreaking findings on specific neurons correlated with hallucinations in large language models.

Stream here or subscribe on your
preferred podcast app:

Episode Transcript

Katherine Forrest: Hey folks, welcome to another episode of Paul Weiss Waking Up with AI. I'm Katherine Forrest.

Scott Caravello: And I'm Scott Caravello. Katherine, I am going to start with my favorite introduction question: Where were you in the world this week?

Katherine Forrest: Wait, no, no, no, Scott, that's my favorite introduction question because we know that you go to like New Orleans, you go to Hawaii, you know.

Scott Caravello: I'm a domestic traveler though, it's still pretty tame.

Katherine Forrest: Yeah, well, OK, so, this week, just this week, on Monday, I flew to Korea. I got there Tuesday night. Let's just put aside the fact that the days are like different, right? Wednesday, I had meetings in Korea and Thursday morning I got up and I flew back.

Scott Caravello: Oh my gosh…

Katherine Forrest: So, I am back in the saddle in the United States and I have no idea what's up or what's down. And I'm actually sitting in the office because I had so many meetings today. So, and, by the way, can I just say you, one of my favorite things about Korea, and I may have said this already on an episode a year ago, because I also had gone to Korea a year ago for a different purpose, but I love having pork dumplings for breakfast.