AIBlade Podcast

Cutting Edge AI Security

David Willis-Owen

21 episodesEN

Show overview

AIBlade Podcast launched in 2024 and has put out 21 episodes in the time since. That works out to roughly 4 hours of audio in total. Releases follow a fortnightly cadence.

Episodes typically run ten to twenty minutes — most land between 10 min and 14 min — though episode length varies meaningfully from one episode to the next. None of the episodes are flagged explicit by the publisher. It is catalogued as a EN-language Technology show.

The catalogue appears to be on hiatus or wound down — the most recent episode landed 1.4 years ago, with no new episodes in over a year. The busiest year was 2024, with 16 episodes published. Published by David Willis-Owen.

Episodes

Running

2024–2025 · 1y

Median length

11 min

Cadence

Fortnightly

From the publisher

Bringing you cutting edge AI Security research www.aiblade.net

Latest Episodes

View all 21 episodes

Jailbreaking Grok 3 | DeepSeek, ChatGPT, Claude & More

Article - https://www.aiblade.net/p/4030b68a-2ab6-452e-9a67-530f91a801f9 Notion Free Trial - https://affiliate.notion.so/pqesm7yjddbc AI Jailbreaking has been around since the dawn of consumer-grade LLMs. Defined by Microsoft as “a technique that can cause the failure of guardrails”, jailbreaking still poses a huge problem to LLM providers in 2025, since people can leverage it to easily break terms of service.In this post, we’ll try simple one-shot jailbreaks against each major model provider, assess the responses, and look at the future of jailbreaking. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Mar 8, 202510 min

Is Github Copilot Poisoned? Part 2

Article - https://www.aiblade.net/p/is-github-copilot-poisoned-part-2 In my previous post, I looked at how code generation models could potentially be poisoned. The impacts could be devastating, and I created a small script to find evidence of this at play. However, my code was too slow, and I didn’t find any meaningful results.In this post, I seek to improve upon my last experiment. I’ll investigate massive datasets of coding-related prompts, collect thousands of lines of AI-generated code, and analyse this code for evidence of malicious activity.Try Notion For Free! https://affiliate.notion.so/pqesm7yjddbc This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Feb 22, 202514 min

How Secure Is DeepSeek?

Article - https://www.aiblade.net/p/a2b8dbe2-ff30-4dd5-9c60-2781f07fea9a DeepSeek AI is taking the world by storm; their new R1 model provides ChatGPT-like capabilities at a fraction of the cost. But how secure really is it? In this post, we’ll take a look at three key areas: the shady origins of DeepSeek AI, a critical vulnerability allowing full database access, and targeted account compromise. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Feb 8, 20259 min

Is Github Copilot Poisoned?

In my last post, I looked at the feasibility of poisoning AI models. While the task would be challenging, the payoff would be huge, allowing threat actors to inject critical vulnerabilities into production codebases.So… have code suggestion models already been poisoned? In this post, we’ll develop a script to test Copilot for poisoning, evaluate its results, and suggest improvements for future experiments. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jan 25, 20259 min

AI Poisoning - Is It Really A Threat?

Article - https://www.aiblade.net/p/ai-poisoning-is-it-really-a-threat AI Training Data Poisoning is a hot topic, with OWASP citing it as the third most critical security risk faced by LLM Applications. But have these attacks ever occurred, and are they feasible for threat actors to use? In this post, I will scrutinize cutting-edge research and use my cybersecurity knowledge to conclude how impactful AI Poisoning really is. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jan 9, 20259 min

AI Pentesting With VulnHuntr

Article - https://www.aiblade.net/p/ai-pentesting-with-vulnhuntr For years, CISOs have been fantasizing about truly automated penetration testing, allowing them to quickly find critical bugs in key applications. While this dream isn’t fully here yet, VulnHuntr offers an LLM-based code analysis package that promises to “find and explain complex, multistep vulnerabilities”. In this post, we’ll look at what VulnHuntr is, how it works, and if this tool lives up to its bold claim. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Dec 15, 20246 min

AI Bug Bounty Guide 2024

Article: https://www.aiblade.net/p/ai-bug-bounty-guide-2024 Bug Bounty has long been an established source of income in the cybersecurity industry. As insecure AI/ML-based applications enter the market in 2024, new bounty programs with low-hanging fruit are opening up. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Nov 14, 20249 min

Claude Computer Use - The First Prompt Injection

Article: https://www.aiblade.net/p/claude-computer-use-prompt-injectionOn 22nd October 2024, Claude Computer Use was released to the world. While Computer Use is an incredible tool, it is also insecure by default. In this blog post, we’ll look at how Johann Rehberger from Embrace The Red was able to completely compromise a Claude-controlled machine via an ingenious Indirect Prompt Injection. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Nov 2, 20247 min

Hacking The AI Goat

Article: https://www.aiblade.net/p/hacking-the-ai-goat The AI Goat is a deliberately vulnerable AI architecture hosted on AWS. Created by Orca Security, it serves as a resource to train the next generation of ethical hackers. In this post, I will hack the Goat, discuss what I like about it, and suggest improvements to make it even better. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Oct 19, 20249 min

Indirect Prompt Injection Methodology (IPIM)

After exploiting several Indirect Prompt Injection vulnerabilities, I decided to author my first white paper: The Practical Application of Indirect Prompt Injection Attacks. In this post, I will present my Indirect Prompt Injection Methodology from the paper, discuss the outcomes of my research, and consider its significance in the future of AI Security. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Oct 12, 202413 min

2024 - State of AI Security Report

Generative AI now features in the production environments of several large organizations, yet very little research has been done surrounding its security. Orca Security seeks to change this with their “2024 - State of AI Security Report”.In this post, I will summarize the report’s key findings, analyze their relevance, and consider the future of AI Security. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Sep 28, 202411 min

AI Security With Chester Wisniewski

Chester Wisniewski is the Global Field CTO at Sophos, with a wealth of technical knowledge and over 25 years of experience in the cybersecurity industry.In this episode, we sit down and discuss a range of topics, including:- Whether ChatGPT was released too soon to the world- AI Security as the next big security skillset- If Apple Intelligence will live up to all its security claimsChester runs his own podcast – Security Take Two. Check it out here.You can reach out to Chester on LinkedIn or Mastodon. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Aug 1, 202429 min

ChatGPT - Delete My Code Without Me Asking!

Article - https://www.aiblade.net/p/chatgpt-delete-my-code AskTheCode is a GPT that allows users to “Provide a GitHub repository URL and ask about any aspect of the code”. With over 100k conversations and 1000 ratings on ChatGPT, software developers widely use this tool to improve their efficiency.…But is it really secure to give an AI access to your codebase?In this post, I will showcase how I used every technique at my disposal to push AskTheCode to its limits and craft an exploit. Then I will explain how I collaborated with the developer to remediate the issue. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jul 13, 202413 min

How Secure Will Apple Intelligence Be?

Article: https://www.aiblade.net/p/how-secure-will-apple-intelligence-be On 10/06/24, Apple announced its long-awaited “Apple Intelligence” to the world. Apple Intelligence is a suite of AI tools integrated into existing functionality to let users “get things done effortlessly”.As always, Apple has gone to great lengths to make this technology high-quality and watertight. But will it be 100% secure? In this post, we’ll look at what we know already based on Apple’s announcement, analyze this through a cybersecurity lens, and speculate on future security flaws in Apple Intelligence. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jun 15, 202412 min

ChatGPT - Send Me Someone's Calendar!

Article: https://www.aiblade.net/p/chatgpt-send-me-someones-calendar OpenAI recently introduced GPTs to premium users, allowing people to interact with third-party web services via a Large Language Model. But is this safe when AI is so easy to trick?In this post, I will present my novel research: exploiting a personal assistant GPT, causing it to unwittingly email the contents of someone’s calendar to an attacker. I will expand on the wider problems related to this vulnerability and discuss the future of similar exploits. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jun 8, 202415 min

How Hugging Face Was (Ethically) Hacked

Article: https://www.aiblade.net/p/how-hugging-face-was-ethically-hacked In this episode, we will look at how security researchers at Wiz were able to achieve Remote Code Execution on Hugging Face and escalate their privileges to read other people’s data. We will examine the consequences of the attack, and then consider countermeasures to prevent it from happening in the future. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Jun 1, 202412 min

AI Phone Scams: Automated Social Engineering

Article: https://www.aiblade.net/p/ai-phone-scams Several companies have begun offering free AI phone call services, featuring large language models linked to AI voice generators.The technology is undeniably cool… but the consequences are potentially catastrophic.In this episode, we will look at how AI phone assistants work, demo some real examples of them being exploited to perform social engineering attacks and look at how to mitigate the associated risks. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

May 23, 202416 min

Backdoors in ML - The Dark Side of Hugging Face

Article: https://www.aiblade.net/p/backdoors-in-ml New machine learning models are an exciting field to research. Hugging Face is the leader in this space, allowing people to upload and download open source ML projects.At the time of writing, over half a million open source models are available on Hugging Face. But innovative threat actors are using the hype around AI as a guise to hack victim computers. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

May 15, 202410 min

Unjailbreakable Large Language Models

Article: https://www.aiblade.net/p/unjailbreakable-large-language-models Since the beginning of the AI gold rush, people have used large language models for malicious intent. Drug recipes, explicit output, and discriminatory behaviour have all been elicited, with often hilarious results.These techniques are known as “prompt injections” or “jailbreaks” - getting the LLM to perform actions outside those intended by its developers. Prompt injections could have devastating consequences in certain scenarios.This episode aims to look at reliable countermeasures to prompt injection, and answer the burning question - “Is it possible to create 100% secure LLMs?" This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

May 9, 202419 min

How AI Threatens Critical Infrastructure

Article: https://www.aiblade.net/p/how-ai-threatens-critical-infrastructure On April 26th, 2024, the Department of Homeland Security released a 28-page document outlining AI security guidelines for critical infrastructure owners.While it is a step in the right direction, the whitepaper is vague, dry, and unhelpful.In this podcast, I will summarize the paper to save you from reading it, give my thoughts on how it could be improved, and discuss how we can guard critical infrastructure from AI threats in the future. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

May 5, 202410 min

AIBlade