
AIBlade Podcast
Cutting Edge AI Security
David Willis-Owen
Show overview
AIBlade Podcast launched in 2024 and has put out 21 episodes in the time since. That works out to roughly 4 hours of audio in total. Releases follow a fortnightly cadence.
Episodes typically run ten to twenty minutes — most land between 10 min and 14 min — though episode length varies meaningfully from one episode to the next. None of the episodes are flagged explicit by the publisher. It is catalogued as a EN-language Technology show.
The catalogue appears to be on hiatus or wound down — the most recent episode landed 1.3 years ago, with no new episodes in over a year. The busiest year was 2024, with 16 episodes published. Published by David Willis-Owen.
From the publisher
Bringing you cutting edge AI Security research www.aiblade.net
Latest Episodes
View all 21 episodes
Jailbreaking Grok 3 | DeepSeek, ChatGPT, Claude & More
Article - https://www.aiblade.net/p/4030b68a-2ab6-452e-9a67-530f91a801f9 Notion Free Trial - https://affiliate.notion.so/pqesm7yjddbc AI Jailbreaking has been around since the dawn of consumer-grade LLMs. Defined by Microsoft as “a technique that can cause the failure of guardrails”, jailbreaking still poses a huge problem to LLM providers in 2025, since people can leverage it to easily break terms of service.In this post, we’ll try simple one-shot jailbreaks against each major model provider, assess the responses, and look at the future of jailbreaking. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Is Github Copilot Poisoned? Part 2
Article - https://www.aiblade.net/p/is-github-copilot-poisoned-part-2 In my previous post, I looked at how code generation models could potentially be poisoned. The impacts could be devastating, and I created a small script to find evidence of this at play. However, my code was too slow, and I didn’t find any meaningful results.In this post, I seek to improve upon my last experiment. I’ll investigate massive datasets of coding-related prompts, collect thousands of lines of AI-generated code, and analyse this code for evidence of malicious activity.Try Notion For Free! https://affiliate.notion.so/pqesm7yjddbc This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

How Secure Is DeepSeek?
Article - https://www.aiblade.net/p/a2b8dbe2-ff30-4dd5-9c60-2781f07fea9a DeepSeek AI is taking the world by storm; their new R1 model provides ChatGPT-like capabilities at a fraction of the cost. But how secure really is it? In this post, we’ll take a look at three key areas: the shady origins of DeepSeek AI, a critical vulnerability allowing full database access, and targeted account compromise. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Is Github Copilot Poisoned?
In my last post, I looked at the feasibility of poisoning AI models. While the task would be challenging, the payoff would be huge, allowing threat actors to inject critical vulnerabilities into production codebases.So… have code suggestion models already been poisoned? In this post, we’ll develop a script to test Copilot for poisoning, evaluate its results, and suggest improvements for future experiments. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

AI Poisoning - Is It Really A Threat?
Article - https://www.aiblade.net/p/ai-poisoning-is-it-really-a-threat AI Training Data Poisoning is a hot topic, with OWASP citing it as the third most critical security risk faced by LLM Applications. But have these attacks ever occurred, and are they feasible for threat actors to use? In this post, I will scrutinize cutting-edge research and use my cybersecurity knowledge to conclude how impactful AI Poisoning really is. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

AI Pentesting With VulnHuntr
Article - https://www.aiblade.net/p/ai-pentesting-with-vulnhuntr For years, CISOs have been fantasizing about truly automated penetration testing, allowing them to quickly find critical bugs in key applications. While this dream isn’t fully here yet, VulnHuntr offers an LLM-based code analysis package that promises to “find and explain complex, multistep vulnerabilities”. In this post, we’ll look at what VulnHuntr is, how it works, and if this tool lives up to its bold claim. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

AI Bug Bounty Guide 2024
Article: https://www.aiblade.net/p/ai-bug-bounty-guide-2024 Bug Bounty has long been an established source of income in the cybersecurity industry. As insecure AI/ML-based applications enter the market in 2024, new bounty programs with low-hanging fruit are opening up. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Claude Computer Use - The First Prompt Injection
Article: https://www.aiblade.net/p/claude-computer-use-prompt-injectionOn 22nd October 2024, Claude Computer Use was released to the world. While Computer Use is an incredible tool, it is also insecure by default. In this blog post, we’ll look at how Johann Rehberger from Embrace The Red was able to completely compromise a Claude-controlled machine via an ingenious Indirect Prompt Injection. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Hacking The AI Goat
Article: https://www.aiblade.net/p/hacking-the-ai-goat The AI Goat is a deliberately vulnerable AI architecture hosted on AWS. Created by Orca Security, it serves as a resource to train the next generation of ethical hackers. In this post, I will hack the Goat, discuss what I like about it, and suggest improvements to make it even better. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Indirect Prompt Injection Methodology (IPIM)
After exploiting several Indirect Prompt Injection vulnerabilities, I decided to author my first white paper: The Practical Application of Indirect Prompt Injection Attacks. In this post, I will present my Indirect Prompt Injection Methodology from the paper, discuss the outcomes of my research, and consider its significance in the future of AI Security. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

2024 - State of AI Security Report
Generative AI now features in the production environments of several large organizations, yet very little research has been done surrounding its security. Orca Security seeks to change this with their “2024 - State of AI Security Report”.In this post, I will summarize the report’s key findings, analyze their relevance, and consider the future of AI Security. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

AI Security With Chester Wisniewski
Chester Wisniewski is the Global Field CTO at Sophos, with a wealth of technical knowledge and over 25 years of experience in the cybersecurity industry.In this episode, we sit down and discuss a range of topics, including:- Whether ChatGPT was released too soon to the world- AI Security as the next big security skillset- If Apple Intelligence will live up to all its security claimsChester runs his own podcast – Security Take Two. Check it out here.You can reach out to Chester on LinkedIn or Mastodon. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

ChatGPT - Delete My Code Without Me Asking!
Article - https://www.aiblade.net/p/chatgpt-delete-my-code AskTheCode is a GPT that allows users to “Provide a GitHub repository URL and ask about any aspect of the code”. With over 100k conversations and 1000 ratings on ChatGPT, software developers widely use this tool to improve their efficiency.…But is it really secure to give an AI access to your codebase?In this post, I will showcase how I used every technique at my disposal to push AskTheCode to its limits and craft an exploit. Then I will explain how I collaborated with the developer to remediate the issue. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

How Secure Will Apple Intelligence Be?
Article: https://www.aiblade.net/p/how-secure-will-apple-intelligence-be On 10/06/24, Apple announced its long-awaited “Apple Intelligence” to the world. Apple Intelligence is a suite of AI tools integrated into existing functionality to let users “get things done effortlessly”.As always, Apple has gone to great lengths to make this technology high-quality and watertight. But will it be 100% secure? In this post, we’ll look at what we know already based on Apple’s announcement, analyze this through a cybersecurity lens, and speculate on future security flaws in Apple Intelligence. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

ChatGPT - Send Me Someone's Calendar!
Article: https://www.aiblade.net/p/chatgpt-send-me-someones-calendar OpenAI recently introduced GPTs to premium users, allowing people to interact with third-party web services via a Large Language Model. But is this safe when AI is so easy to trick?In this post, I will present my novel research: exploiting a personal assistant GPT, causing it to unwittingly email the contents of someone’s calendar to an attacker. I will expand on the wider problems related to this vulnerability and discuss the future of similar exploits. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

How Hugging Face Was (Ethically) Hacked
Article: https://www.aiblade.net/p/how-hugging-face-was-ethically-hacked In this episode, we will look at how security researchers at Wiz were able to achieve Remote Code Execution on Hugging Face and escalate their privileges to read other people’s data. We will examine the consequences of the attack, and then consider countermeasures to prevent it from happening in the future. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

AI Phone Scams: Automated Social Engineering
Article: https://www.aiblade.net/p/ai-phone-scams Several companies have begun offering free AI phone call services, featuring large language models linked to AI voice generators.The technology is undeniably cool… but the consequences are potentially catastrophic.In this episode, we will look at how AI phone assistants work, demo some real examples of them being exploited to perform social engineering attacks and look at how to mitigate the associated risks. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Backdoors in ML - The Dark Side of Hugging Face
Article: https://www.aiblade.net/p/backdoors-in-ml New machine learning models are an exciting field to research. Hugging Face is the leader in this space, allowing people to upload and download open source ML projects.At the time of writing, over half a million open source models are available on Hugging Face. But innovative threat actors are using the hype around AI as a guise to hack victim computers. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

Unjailbreakable Large Language Models
Article: https://www.aiblade.net/p/unjailbreakable-large-language-models Since the beginning of the AI gold rush, people have used large language models for malicious intent. Drug recipes, explicit output, and discriminatory behaviour have all been elicited, with often hilarious results.These techniques are known as “prompt injections” or “jailbreaks” - getting the LLM to perform actions outside those intended by its developers. Prompt injections could have devastating consequences in certain scenarios.This episode aims to look at reliable countermeasures to prompt injection, and answer the burning question - “Is it possible to create 100% secure LLMs?" This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net

How AI Threatens Critical Infrastructure
Article: https://www.aiblade.net/p/how-ai-threatens-critical-infrastructure On April 26th, 2024, the Department of Homeland Security released a 28-page document outlining AI security guidelines for critical infrastructure owners.While it is a step in the right direction, the whitepaper is vague, dry, and unhelpful.In this podcast, I will summarize the paper to save you from reading it, give my thoughts on how it could be improved, and discuss how we can guard critical infrastructure from AI threats in the future. This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.aiblade.net