Deep dive into the abuse of DL APIs to create malicious AI models and how to detect them

Thursday 25 September 16:00 - 16:30, Green room

Mohamed Nabeel & Alex Starov (Palo Alto Networks)

According to Gartner, more than 70% of organizations will have integrated AI models in their workflows by the end of 2025. In order to reduce cost and foster innovation, it is often the case that pretrained models are fetched from model hubs like Hugging Face or TensorFlow Hub. However, this introduces a security risk where attackers can inject malicious code into the models they upload to these hubs, leading to various kinds of attacks including remote code execution (RCE), sensitive data exfiltration, and system file modification when these models are loaded or executed (predict function). Since AI models play a critical role in the digital transformation, this would, unfortunately, drastically increase the number of software supply chain attacks.

While there are several efforts at detecting malware when deserializing Pickle-based saved models (hiding malware in model parameters), the risk of abusing DL APIs (e.g. TensorFlow APIs) is understudied. Specifically, we show how one can abuse hidden functionality of TensorFlow APIs, such as file read/write and network send/receive, along with their persistence APIs, to launch attacks.

It is concerning to note that existing scanners in model hubs like Hugging Face and TensorFlow Hub are unable to detect some of the stealthy abuse of such APIs. This is because scanning tools like Flicking and ModelScan face challenges in semantic-level analysis and in-depth taint analysis. We also show how one may identify potentially abusable hidden API capabilities using LLMs and build scanners to detect such abuses.

Mohamed Nabeel

Mohamed Nabeel, Ph.D., is a principal security researcher at Palo Alto Networks where he investigates and develops solutions for open problems in web and DNS security using ML/AI/GenAI, providing advanced capabilities to protect internet users and AI security. He has authored and presented 20+ US patents and 25+ papers at top security conferences. He has presented his work in top industry conferences including RSA Conference and Virus Bulletin.

Alex Starov

Alex Starov is a senior manager of the web security research team at Palo Alto Networks. His research focuses on proactive and data-driven web security and malicious URL detection and he manages several of the brightest researchers and engineers on protecting web users against sophisticated cyber attacks. He obtained his Ph.D. in computer science from Stony Brook University. He has published his work in top security venues as well as authored several patents.

Back to VB2025 Programme page

Back to VB2025 conference page

Other VB2025 papers

Silent killers: unmasking a large-scale legacy driver exploitation campaign

VB2025 presentation: Silent killers: unmasking a large-scale legacy driver exploitation campaign, Jiří Vinopal

Practical AWS antiforensics

VB2025 presentation: Practical AWS antiforensics, Santiago Abastante

The Wolf of Wall Steal: inside crypto traffer group operations

VB2025 presentation: The Wolf of Wall Steal: inside crypto traffer group operations, Anna Pham & Joan Garcia

Demystifying the Playboy RaaS

VB2025 presentation: Demystifying the Playboy RaaS, Gijs Rijnders

Evading in plain sight: how adversaries beat user-mode protection engines for over a decade

VB2025 presentation: Evading in plain sight: how adversaries beat user-mode protection engines for over a decade, Omri Misgav

From Latin America to the world: ransomware TTPs, prolonged intrusions, and regional adaptation

VB2025 presentaiton: From Latin America to the world: ransomware TTPs, prolonged intrusions, and regional adaptation, Isabel Manjarrez

Tracking the IoT botnet's bloodline: code footprints don’t lie

VB2025 presentation: Tracking the IoT botnet's bloodline: code footprints don’t lie, Chanbin Jeon, ChangGyun Kim & SeungBeom Lim

Invisible thieves in the front yard -- from an advanced evasive edge-device attack to potential mitigation methods

VB2025 presentation: Invisible thieves in the front yard -- from an advanced evasive edge-device attack to potential mitigation methods, Ting-Wei Hsieh

Google Calendar as C2 infrastructure: a China-nexus campaign with stealthy tactics

VB2025 presentation: Google Calendar as C2 infrastructure: a China-nexus campaign with stealthy tactics, Tim Chen & Still Hsu

Goodbye loaders, hello RMM: the rise of legit software in ecrime campaigns

VB2025 presentation: Goodbye loaders, hello RMM: the rise of legit software in ecrime campaigns, Selena Larson & Ole Villadsen

Silent Lynx: uncovering a cyber espionage campaign in Central Asia

VB2025 presentation: Silent Lynx: uncovering a cyber espionage campaign in Central Asia, Subhajeet Singha & Sathwik Ram Prakki

The dark prescription: inside the infrastructure of illegal online pharmacies

VB2025 presentation: The dark prescription: inside the infrastructure of illegal online pharmacies, Martin Chlumecky & Lubos Bever

Panel: Tales from the Old West

VB2025 presentation: Panel: Tales from the Old West, Righard Zwienenberg, Jan Hruska, Pavel Baudis & Tjark Auerbach

Unmasking the GrassCall campaign: the hackers behind job recruitment cyber scams

VB2025 presentation: Unmasking the GrassCall campaign: the hackers behind job recruitment cyber scams, Dixit Panchal & Soumen Burma

Cracked by the GRU: how Russia’s notorious Sandworm unit weaponizes pirated software usage to target Ukraine

VB2025 presentation: Cracked by the GRU: how Russia’s notorious Sandworm unit weaponizes pirated software usage to target Ukraine, Arda Büyükkaya

Hunting potential C2 commands in Android malware via Smali string comparison and control flow analysis

VB2025 presentation: Hunting potential C2 commands in Android malware via Smali string comparison and control flow analysis, JunWei Song

Vo1d rising: inside the botnet controlling 1.68 M+ Android TVs worldwide

VB2025 presentation: Vo1d rising: inside the botnet controlling 1.68 M+ Android TVs worldwide, Alex Turing

Arachnid alert: Latrodectus loader crawls through defences

VB2025 presentation: Arachnid alert: Latrodectus loader crawls through defences, Albert Zsigovits

When avatars come alive: understanding hybrid threat actors

VB2025 presentation: When avatars come alive: understanding hybrid threat actors, Itay Cohen & Omer Benjakob

Inside Akira ransomware's Rust experiment

VB2025 presentation: Inside Akira ransomware's Rust experiment, Ben Herzog

Rogue hirer, rogue hiree: workplace cyber threats to individuals and businesses

VB2025 presentation: Rogue hirer, rogue hiree: workplace cyber threats to individuals and businesses, Chris Boyd

You definitely don’t want to CopyPaste this: FakeCaptcha ecosystem

VB2025 presentation: You definitely don’t want to CopyPaste this: FakeCaptcha ecosystem, Dmitrij Lenz & Roberto Dasilva

The Phantom Circuit: the Lazarus Group’s evolution in supply chain compromise

VB2025 presentation: The Phantom Circuit: the Lazarus Group’s evolution in supply chain compromise, Ryan Sherstobitoff

DeceptiveDevelopment and North Korean IT workers: from primitive crypto theft to sophisticated AI-based deception

VB2025 presentation: DeceptiveDevelopment and North Korean IT workers: from primitive crypto theft to sophisticated AI-based deception, Matej Havranek

Deep dive into the abuse of DL APIs to create malicious AI models and how to detect them

VB2025 presentation: Deep dive into the abuse of DL APIs to create malicious AI models and how to detect them, Mohamed Nabeel & Alex Starov

Vietnamese hacking group: a rising of information stealing campaigns going global

VB2025 presentation: Vietnamese hacking group: a rising of information stealing campaigns going global, Chetan Raghuprasad & Joey Chen

Stealth over TLS: the emergence of ECH-based C&C in ECHidna malware

VB2025 presentation: Stealth over TLS: the emergence of ECH-based C&C in ECHidna malware, Yuta Sawabe & Rintaro Koike

Prediction of future attack indicators based on the 2024 analysis of threats from malicious app distribution sites in South Korea

VB2025 presentation: Prediction of future attack indicators based on the 2024 analysis of threats from malicious app distribution sites in South Korea, Kyung Rae Noh, Shinho Lee, Eui-Tak Kim, Yujin Shim, Jonghwa Han & Jung-Sik Cho

Unmasking the unseen: a deep dive into modern Linux rootkits and their detection

VB2025 presentation: Unmasking the unseen: a deep dive into modern Linux rootkits and their detection, Ruben Groenewoud & Remco Sprooten

Boosting URL detection with syntactic features in spam emails

VB2025 presentation: Boosting URL detection with syntactic features in spam emails, Antonia Scherz

Dissecting evil twin RATs: tracking the long-term use of TA410's FlowCloud toolset

VB2025 presentation: Dissecting evil twin RATs: tracking the long-term use of TA410's FlowCloud toolset, Hiroshi Takeuchi

Unmasking TAG-124: dissecting a prevalent traffic distribution system in the cybercriminal ecosystem

VB2025 presentation: Unmasking TAG-124: dissecting a prevalent traffic distribution system in the cybercriminal ecosystem, Julian-Ferdinand Vögele

The Bitter end: unravelling 8 years of APT antics

VB2025 presentation: The Bitter end: unravelling 8 years of APT antics, Abdallah Elshinbary, Nick Attfield, Konstantin Klinger & Jonas Wagner

The attribution story of WhisperGate: an academic perspective

VB2025 presentation: The attribution story of WhisperGate: an academic perspective, Alexander Adamov

Emmenhtal Loader: the silent enabler of modern malware campaigns

VB2025 presentation: Emmenhtal Loader: the silent enabler of modern malware campaigns, Lovely Antonio, Ricardo Pineda & Louis Sorita

Sophistication or missed opportunity? Analysing XE Group’s long-term exploitation of zero-days with limited impact

VB2025 presentation: Sophistication or missed opportunity? Analysing XE Group’s long-term exploitation of zero-days with limited impact, Justin Lentz & Nicole Fishbein

Attacker identity revealed: insights from rogue VMs & BYOVD in EDR evasion

VB2025 presentation: Attacker identity revealed: insights from rogue VMs & BYOVD in EDR evasion, Navin Thomas, Renzon Cruz & Cuong Dinh

Living in the hypervisor: defeating anti-[VM, sandbox, analysis] via patching hypervisor

VB2025 presentation: Living in the hypervisor: defeating anti-[VM, sandbox, analysis] via patching hypervisor, Kağan Işıldak

PepsiDog: inside the rise of a professional Chinese phishing actor

VB2025 presentation: PepsiDog: inside the rise of a professional Chinese phishing actor, Stefan Tanase & Ionut Bucur

Code Red: How KnowBe4 exposed a North Korean IT infiltration scheme

VB2025 keynote presentation: Code Red: How KnowBe4 exposed a North Korean IT infiltration scheme, Martin Kraemer

TIPS: Smashing smishing by quashing quishing

VB2025 TIPS presentation: Smashing smishing by quashing quishing, Andrew Brandt

TIPS: Collective intelligence in OT cybersecurity: transforming threat insights into proactive defence

VB2025 TIPS presentation: Collective intelligence in OT cybersecurity: transforming threat insights into proactive defence, AJ Eserjose

TIPS: The battlegrounds are moving faster than we are - can we turn this oil-tanker on a dime?

VB2025 TIPS presentation: The battlegrounds are moving faster than we are - can we turn this oil-tanker on a dime? Tim West

TIPS: How MITRE is AI, anyway?

VB2025 TIPS presentation: How MITRE is AI, anyway? Samir Mody

TIPS: Fireside chat: The tortured “cybersecurity” poets department

VB2025 TIPS presentation: Fireside chat: The tortured “cybersecurity” poets department, Cat Self, Jeanette Miller, Jeannette Jarvis, Selena Larson

TIPS: Beyond machine translation: struggles and adaptations of North Korean IT workers in Japan’s crowdsourcing market

VB2025 TIPS presentation: Beyond machine translation: struggles and adaptations of North Korean IT workers in Japan’s crowdsourcing market, Takahiro Kakumaru & Yoshihiro Kori

TIPS: Panel: The wheels on the CVE go round and round: breaking the cycle of vulnerability fatigue

VB2025 TIPS presentation: Panel: The wheels on the CVE go round and round: breaking the cycle of vulnerability fatigue, Righard Zwienenberg, Robin Staa, John Alexander, Geri Revay

TIPS: Stop the flood: building a quality and trust-driven threat intelligence ecosystem

VB2025 TIPS presentation: Stop the flood: building a quality and trust-driven threat intelligence ecosystem, Kihong Kim & SuhMahn Hur

TIPS: Diff’ing the light fantastic – tracking typosquatting and disinformation in a resource-constrained environment

VB2025 TIPS presentation: Diff’ing the light fantastic – tracking typosquatting and disinformation in a resource-constrained environment, James Slaughter

TIPS: From clusters to actors: a practical threat actor attribution framework

VB2025 TIPS presentation: From clusters to actors: a practical threat actor attribution framework, Kyle Wilhoit & Robert Falcone

Cybersecurity 2035: where will we be in 10 years' time?

VB2025 presentation: Cybersecurity 2035: where will we be in 10 years' time?, Paul Ducklin

European PDNS readiness

VB2025 presentation: European PDNS readiness, Viliam Peli, George Buhai

Collaborative response to emerging critical RCE vulnerabilities in exposed edge devices

VB2025 presentation: Collaborative response to emerging critical RCE vulnerabilities in exposed edge devices, Piotr Kijewski

TIPS: Keynote

VB2025 TIPS presentation: Keynote, Gonçalo Ribeiro

Don’t fear journalists! Talk to me! Hacks, exploits & best practices for improving researcher-reporter ties

VB2025 presentation: Don’t fear journalists! Talk to me! Hacks, exploits & best practices for improving researcher-reporter ties, Omer Benjakob