Secure LLMs & Prompt Defense 2025

Keynote: Opening Keynote Track: Keynotes Room: Main Hall Sep 15, 2025 9:00 AM – 9:45 AM Featured

Keynote: Jailbreaks & Defenses: Lessons from LLM Red-Teaming

Isolation patterns, layered filters, continuous evaluation.

Focus on jailbreak defense and eval harnesses.

Twitter/X

Keynote: Day 2 Keynote Track: Keynotes Room: Main Hall Sep 16, 2025 9:00 AM – 9:45 AM Featured

Keynote: Beyond Prompt Injection: Supply-Chain Threats

Weights, datasets, tool plugs and provenance verification.

Security of language models and aligned systems.

Session_chair: Track A Chair Track: Track A Room: Room A Featured

Chair: Session

Runs incident response and abuse prevention for LLM apps.

Twitter/X

Session_chair: Track B Chair Track: Track B Room: Room B Featured

Chair: Session

Designs adversarial evals for LLM safety.

Twitter/X

Workshop_instructor: Hands-on Workshop Track: Workshops Room: Room W1 Sep 16, 2025 10:30 AM – 12:00 PM Featured

Workshop: Writing Effective Guardrail Policies

Templates, exception queues, and staged rollout.

Policy rollouts and exception handling.

Speaker Track: Track A Room: Room A Sep 15, 2025 11:00 AM – 11:30 AM

Talk: Multi-Agent Attack Surfaces

Coordination exploits and role confusion attacks.

Red-teaming pipelines for multi-agent systems.

Twitter/X

Speaker Track: Track A Room: Room A Sep 15, 2025 11:35 AM – 12:05 PM

Talk: Tool Use Isolation: Sandboxes that Scale

Syscall filtering, broker patterns, and escape prevention.

Runtime isolation and sandbox patterns for LLM tools.

Twitter/X

Speaker Track: Track B Room: Room B Sep 15, 2025 2:00 PM – 2:30 PM

Talk: Provenance-First Training Data

Hash chains, attestations, and human-in-the-loop QA.

Dataset lineage and tamper-evident curation.

Speaker Track: Track B Room: Room B Sep 15, 2025 2:35 PM – 3:05 PM

Talk: A Practical Taxonomy of Jailbreaks

Injection family tree with concrete mitigations.

Alignment evals and jailbreak taxonomy.

Panelist: Panelist Track: Panels Room: Room P Sep 15, 2025 4:00 PM – 4:45 PM

Panel: Incident Postmortems that Actually Fix Things

Runbooks, decision logs, and accountability loops.

Runs post-mortems for LLM incidents.

Twitter/X

Moderator: Panel Moderator Track: Panels Room: Room P Sep 15, 2025 4:00 PM – 4:45 PM

Panel: Incident Postmortems that Actually Fix Things

Moderating panel on disciplined remediation.

Moderator for security/AI panels.

Speaker Track: Track C Room: Room C Sep 16, 2025 1:30 PM – 2:00 PM

Talk: From Static Benchmarks to Continuous Evals

Shift from one-off scores to live signals.

Builds eval harnesses and dashboards.

Twitter/X

Speaker Track: Track C Room: Room C Sep 16, 2025 2:05 PM – 2:35 PM

Talk: Risk Registers that Matter

Severity scales and near-miss logging for AI incidents.

Risk registers and governance boards.

Organizer: Program Operations

Ops: Session

Program operations and speaker success.

Organizer: Logistics Lead

Ops: Session

Venue logistics and A/V.

Judge: Poster Awards Judge Sep 16, 2025 3:30 PM – 4:30 PM

Judging: Session

External evaluator for poster awards.

Speaker Track: Track A Room: Room A Sep 16, 2025 3:00 PM – 3:30 PM

Talk: Broker Patterns for Toolformer-Style Agents

Decoupling tools with capability bounding.

Interfaces for safe tool use.

Panelist: Panelist Track: Panels Room: Room P Sep 16, 2025 4:45 PM – 5:30 PM

Panel: Measuring Safety: Where Benchmarks Fail

Panel on eval blind spots and proxy metrics.

Safety taxonomies and test corpora.

Moderator: Panel Moderator Track: Panels Room: Room P Sep 16, 2025 4:45 PM – 5:30 PM

Panel: Measuring Safety: Where Benchmarks Fail

Moderates panel discussion on evals.

Moderator for policy/safety panels.

Workshop_instructor: Hands-on Lab Track: Workshops Room: Room W2 Sep 15, 2025 1:30 PM – 3:00 PM

Workshop: Implementing Policy as Code

OPA examples and deployment pipelines.

Policy templates and rollout strategies.

Secure LLMs & Prompt Defense 2025

Event Details

Participants

Dr. Dr. Nina Volkov 🇺🇸

Keynote: Jailbreaks & Defenses: Lessons from LLM Red-Teaming

Prof. Prof. Mateo Alvarez 🇺🇸

Keynote: Beyond Prompt Injection: Supply-Chain Threats

Ava Jensen 🇺🇸

Chair: Session

Dr. Dr. Raghav Raman 🇺🇸

Chair: Session

Noah Patel 🇺🇸

Workshop: Writing Effective Guardrail Policies

Eleanor Park 🇺🇸

Talk: Multi-Agent Attack Surfaces

Yusuf Karim 🇺🇸

Talk: Tool Use Isolation: Sandboxes that Scale

Mira Kowalski 🇵🇱

Talk: Provenance-First Training Data

Dr. Dr. Wei Chen 🇸🇬

Talk: A Practical Taxonomy of Jailbreaks

Sara M�ller 🇩🇪

Panel: Incident Postmortems that Actually Fix Things

Jacob Reed 🇺🇸

Panel: Incident Postmortems that Actually Fix Things

Isabella Rossi 🇮🇹

Talk: From Static Benchmarks to Continuous Evals

Dr. Dr. Omar Saad 🇦🇪

Talk: Risk Registers that Matter

Emily Ward 🇺🇸

Ops: Session

Liam O'Connor 🇺🇸

Ops: Session

Judith Park 🇺🇸

Judging: Session

Rafael Santos 🇧🇷

Talk: Broker Patterns for Toolformer-Style Agents

Chen Li 🇨🇳

Panel: Measuring Safety: Where Benchmarks Fail

Sofia Dimitriou 🇬🇷

Panel: Measuring Safety: Where Benchmarks Fail

Diego Herrera 🇲🇽

Workshop: Implementing Policy as Code

Professional Impact & Significance