ARXIV:2602.19818 · SECURITY AND MODEL INTEGRITY · SUBMITTED 17 MAR · 19:46 UTC · FRESHNESS STALE

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsErrorProof: failed

SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

arXiv

SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

Blocked on Code›Score8.0Evidence failed

Opportunity summary

Pain SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

Evidence 0 refs | 0 sources | 33% coverage

Blocker Evidence failed

Open Build Read PDF Signal Canvas Track

PROBLEM

SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing. Recent defenses, such as PickleBall, rely on per-library policy synthesis that requires complex system…

METHOD

Full abstract

Model repositories such as Hugging Face increasingly distribute machine learning artifacts serialized with Python's pickle format, exposing users to remote code execution (RCE) risks during model loading. Recent defenses, such as PickleBall, rely on per-library policy synthesis that requires complex system setups and verified benign models, which limits scalability and generalization. In this work, we propose a lightweight, machine-learning-based scanner that detects malicious Pickle-based files without policy generation or code instrumentation. Our approach statically extracts structural and semantic features from Pickle bytecode and applies supervised and unsupervised models to classify files as benign or malicious. We construct and release a labeled dataset of 727 Pickle-based files from Hugging Face and evaluate our models on four datasets: our own, PickleBall (out-of-distribution), Hide-and-Seek (9 advanced evasive malicious models), and synthetic joblib files. Our method achieves 90.01% F1-score compared with 7.23%-62.75% achieved by the SOTA scanners (Modelscan, Fickling, ClamAV, VirusTotal) on our dataset. Furthermore, on the PickleBall data (OOD), it achieves 81.22% F1-score compared with 76.09% achieved by the PickleBall method, while remaining fully library-agnostic. Finally, we show that our method is the only one to correctly parse and classify 9/9 evasive Hide-and-Seek malicious models specially crafted to evade scanners. This demonstrates that data-driven detection can effectively and generically mitigate Pickle-based model file attacks.

RESULT

ScienceToStartup currently rates this 8.0/10 on the public viability pass. Our method achieves 90.01% F1-score compared with 7.23%-62.75% achieved by the SOTA scanners (Modelscan, Fickling, ClamAV, VirusTotal) on our dataset.

WHY NOW

Security and Model Integrity moved forward this cycle; last verified April 2026. Public score 8.0/10.

Continue into Read for claims, analysis, references, and neighboring papers.

Opportunity summary

Score8.0

PainSafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

Evidence0 refs | 0 sources | 33% coverage

Blockermissing authors

Analysis summary

SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

VerifiedSource: PDF linkedPartialPaperPack: 3 of 4 citation fields filledMissingMissing fields: authorsErrorProof: failed

Competitive landscape

SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

Segment

Security and Model Integrity

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(19)

NeuPerm: Disrupting Malware Hidden in Neural Network Parameters by Leveraging Permutation Symmetry

2025Daniel Gilkarov, Ran Dubin

The Art of Hide and Seek: Making Pickle-Based Model Supply Chain Poisoning Stealthy Again

2025Tong Liu, Guozhu Meng et al.

My Model is Malware to You: Transforming AI Models into Malware by Abusing TensorFlow APIs

2025Ruofan Zhu, Ganhao Chen et al.

Models Are Codes: Towards Measuring Malicious Code Poisoning Attacks on Pre-trained Model Hubs

2024Jian Zhao, Shenao Wang et al.

Steganalysis of AI Models LSB Attacks

2023Daniel Gilkarov, Ran Dubin

Reusing Deep Learning Models: Challenges and Directions in Software Engineering

2023James C. Davis, Purvish Jajal et al.

Calibration-based Steganalysis for Neural Network Steganography

2023Na Zhao, Kejiang Chen et al.

An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry

2023Wenxin Jiang, Nicholas M. Synovic et al.

Pain Pickle: Bypassing Python Restricted Unpickler for Automatic Exploit Generation

2022Nan-Jung Huang, Chih-Jen Huang et al.

An Empirical Study of Artifacts and Security Risks in the Pre-trained Model Supply Chain

2022Wenxin Jiang, Nicholas M. Synovic et al.

GitHub

2022Sufyan bin Uzayr

MaleficNet: Hiding Malware into Deep Neural Networks Using Spread-Spectrum Channel Coding

2022Dorjan Hitaj, Giulio Pagnotta et al.

EvilModel 2.0: Bringing Neural Network Models into Malware Attacks

2021Zhi Wang, Chaoge Liu et al.

EvilModel: Hiding Malware Inside of Neural Network Models

2021Zhi Wang, Chaoge Liu et al.

LoRA: Low-Rank Adaptation of Large Language Models

2021J. Hu, Yelong Shen et al.

Backstabber’s Knife Collection: A Review of Open Source Software Supply Chain Attacks

2020Marc Ohm, H. Plate et al.

Atlas

2014Dhruva R. Chakrabarti, Hans-Juergen Boehm et al.

Scikit-learn: Machine Learning in Python

2011Fabian Pedregosa, G. Varoquaux et al.

Article Authors

2000

{ "contract_version": "paper-r2", "paper_id": "ef023335-62c1-4569-87c7-711310ae7a47", "arxiv_id": "2602.19818", "canonical_route": "/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "active_tab": "synced from current hash by the drawer client", "selected_artifact": "safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "endpoints": { "paper_pack": "/api/v1/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models/paper-pack", "build_passport": "/api/v1/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models/build-passport", "mcp_resource": "sciencetostartup://surfaces/paper-workspace" } }

{ "surface": "paper", "mode": "paper", "query": "SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models", "normalized_query": "2602.19818", "route": "/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "paper_ref": "safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "topic_slug": null, "benchmark_ref": null, "dataset_ref": null }

{ "@context": "https://schema.org", "@graph": [ { "@type": "WebPage", "@id": "https://sciencetostartup.com/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models#webpage", "url": "https://sciencetostartup.com/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "name": "SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models", "description": "SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.", "isPartOf": { "@id": "https://sciencetostartup.com/#website" } }, { "@type": "ScholarlyArticle", "@id": "https://sciencetostartup.com/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models#scholarlyArticle", "headline": "SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models", "description": "SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.", "url": "https://sciencetostartup.com/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models", "sameAs": "https://arxiv.org/abs/2602.19818", "identifier": { "@type": "PropertyValue", "propertyID": "arXiv", "value": "2602.19818" }, "isAccessibleForFree": true, "isPartOf": { "@id": "https://sciencetostartup.com/#website" }, "datePublished": "2026-02-23T13:19:43.000Z", "author": [ { "@type": "Person", "name": "Hillel Ohayon", "affiliation": { "@type": "Organization", "name": "Ariel University, Israel" } }, { "@type": "Person", "name": "Daniel Gilkarov", "affiliation": { "@type": "Organization", "name": "Ariel University, Israel" } }, { "@type": "Person", "name": "Ran Dubin", "affiliation": { "@type": "Organization", "name": "Ariel University, Israel" } } ], "citation": [ { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "b4dc87bfa557f124646e4e497848c13eb78baa87" }, "url": "https://www.semanticscholar.org/paper/b4dc87bfa557f124646e4e497848c13eb78baa87" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "5489995bc1f15fab400ce6966a8151e56634648d" }, "url": "https://www.semanticscholar.org/paper/5489995bc1f15fab400ce6966a8151e56634648d" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "e2e738a58e155c05f1613041e4d9793bc7290a33" }, "url": "https://www.semanticscholar.org/paper/e2e738a58e155c05f1613041e4d9793bc7290a33" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "212e8493a7103abbc1967734419f78f0bdf0248e" }, "url": "https://www.semanticscholar.org/paper/212e8493a7103abbc1967734419f78f0bdf0248e" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "727c7639331d3d256868023a6e6735a56114823e" }, "url": "https://www.semanticscholar.org/paper/727c7639331d3d256868023a6e6735a56114823e" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "432f05e19051b8ccdece6dd6e2df8468051f35d9" }, "url": "https://www.semanticscholar.org/paper/432f05e19051b8ccdece6dd6e2df8468051f35d9" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "8c69bb5c1a0c09626804ac7a408bf16b23bd9c92" }, "url": "https://www.semanticscholar.org/paper/8c69bb5c1a0c09626804ac7a408bf16b23bd9c92" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "2f4a119883e599f3202a483453250d7874c3c322" }, "url": "https://www.semanticscholar.org/paper/2f4a119883e599f3202a483453250d7874c3c322" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "0a6ed0468320a79b91c1be228f254f281a5a1881" }, "url": "https://www.semanticscholar.org/paper/0a6ed0468320a79b91c1be228f254f281a5a1881" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "6c4b7a3abbee06c565dc1eefb39ffe06462c8715" }, "url": "https://www.semanticscholar.org/paper/6c4b7a3abbee06c565dc1eefb39ffe06462c8715" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "2ec4d25b8ffb1821a664fb6e3722c236fe0c07db" }, "url": "https://www.semanticscholar.org/paper/2ec4d25b8ffb1821a664fb6e3722c236fe0c07db" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "310b337d1655d06d8d10a7dd4437ec6abdc64c93" }, "url": "https://www.semanticscholar.org/paper/310b337d1655d06d8d10a7dd4437ec6abdc64c93" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "3454f1e29e101769b6ca980cfd60afb44bd1f28b" }, "url": "https://www.semanticscholar.org/paper/3454f1e29e101769b6ca980cfd60afb44bd1f28b" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "a8ca46b171467ceb2d7652fbfb67fe701ad86092" }, "url": "https://www.semanticscholar.org/paper/a8ca46b171467ceb2d7652fbfb67fe701ad86092" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "4c7183fb2109271405e4a0fe23b5e827520a9f68" }, "url": "https://www.semanticscholar.org/paper/4c7183fb2109271405e4a0fe23b5e827520a9f68" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "47b851237f240831abee3971bca6bb8d2a121eb1" }, "url": "https://www.semanticscholar.org/paper/47b851237f240831abee3971bca6bb8d2a121eb1" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "ad4fd2c149f220a62441576af92a8a669fe81246" }, "url": "https://www.semanticscholar.org/paper/ad4fd2c149f220a62441576af92a8a669fe81246" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "af9d301a1482e9d86ecfff662f596f6cd29a30ef" }, "url": "https://www.semanticscholar.org/paper/af9d301a1482e9d86ecfff662f596f6cd29a30ef" }, { "@type": "ScholarlyArticle", "identifier": { "@type": "PropertyValue", "propertyID": "SemanticScholar", "value": "f66402d1e1cd1ff21174e306824a1aa25103a0bd" }, "url": "https://www.semanticscholar.org/paper/f66402d1e1cd1ff21174e306824a1aa25103a0bd" } ], "additionalProperty": [ { "@type": "PropertyValue", "propertyID": "viabilityScore", "value": 8 }, { "@type": "PropertyValue", "propertyID": "researchDomain", "value": "Security and Model Integrity" } ] }, { "@type": "BreadcrumbList", "itemListElement": [ { "@type": "ListItem", "position": 1, "name": "Home", "item": "https://sciencetostartup.com" }, { "@type": "ListItem", "position": 2, "name": "Security and Model Integrity", "item": "https://sciencetostartup.com/topics" }, { "@type": "ListItem", "position": 3, "name": "SafePickle: Robust and Generic ML Detection of Malicious Pic", "item": "https://sciencetostartup.com/paper/safepickle-robust-and-generic-ml-detection-of-malicious-pickle-based-ml-models" } ] }, { "@type": "FAQPage", "mainEntity": [ { "@type": "Question", "name": "What is the startup potential of \"SafePickle: Robust and Generic ML Detection of Malicious Pic\"?", "acceptedAnswer": { "@type": "Answer", "text": "SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing." } }, { "@type": "Question", "name": "What products could be built from this research?", "acceptedAnswer": { "@type": "Answer", "text": "The solution can be productized as a security add-on for AI model-sharing platforms, offering enhanced protection by scanning and flagging potentially harmful Pickle files before they can be shared or downloaded." } }, { "@type": "Question", "name": "What are the practical use cases?", "acceptedAnswer": { "@type": "Answer", "text": "Develop a security plugin for model sharing platforms like Hugging Face to automatically scan uploaded Pickle files for malicious content, warning users and potentially blocking uploads that fail the safety checks." } }, { "@type": "Question", "name": "What industries could this research disrupt?", "acceptedAnswer": { "@type": "Answer", "text": "SafePickle can replace or complement existing model scanners that may be less effective or more cumbersome to use, like policy-based methods that require complex setups or disrupt common workflows." } } ] } ] }

Competitive landscape

SafePickle offers a machine learning-based solution to detect malicious Pickle files in model repositories, enhancing security in AI model sharing.

Segment

Security and Model Integrity

Adoption evidence

No public code link in the paper record yet

Commercial read

8.0/10 public viability

Direct

not classified

Adjacent

not classified

Substitute

not classified

Unknown

not classified

References(19)

NeuPerm: Disrupting Malware Hidden in Neural Network Parameters by Leveraging Permutation Symmetry

2025Daniel Gilkarov, Ran Dubin

The Art of Hide and Seek: Making Pickle-Based Model Supply Chain Poisoning Stealthy Again

2025Tong Liu, Guozhu Meng et al.

My Model is Malware to You: Transforming AI Models into Malware by Abusing TensorFlow APIs

2025Ruofan Zhu, Ganhao Chen et al.

Models Are Codes: Towards Measuring Malicious Code Poisoning Attacks on Pre-trained Model Hubs

2024Jian Zhao, Shenao Wang et al.

Steganalysis of AI Models LSB Attacks

2023Daniel Gilkarov, Ran Dubin

Reusing Deep Learning Models: Challenges and Directions in Software Engineering

2023James C. Davis, Purvish Jajal et al.

Calibration-based Steganalysis for Neural Network Steganography

2023Na Zhao, Kejiang Chen et al.

An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry

2023Wenxin Jiang, Nicholas M. Synovic et al.

Pain Pickle: Bypassing Python Restricted Unpickler for Automatic Exploit Generation

2022Nan-Jung Huang, Chih-Jen Huang et al.

An Empirical Study of Artifacts and Security Risks in the Pre-trained Model Supply Chain

2022Wenxin Jiang, Nicholas M. Synovic et al.

GitHub

2022Sufyan bin Uzayr

MaleficNet: Hiding Malware into Deep Neural Networks Using Spread-Spectrum Channel Coding

2022Dorjan Hitaj, Giulio Pagnotta et al.

EvilModel 2.0: Bringing Neural Network Models into Malware Attacks

2021Zhi Wang, Chaoge Liu et al.

EvilModel: Hiding Malware Inside of Neural Network Models

2021Zhi Wang, Chaoge Liu et al.

LoRA: Low-Rank Adaptation of Large Language Models

2021J. Hu, Yelong Shen et al.

Backstabber’s Knife Collection: A Review of Open Source Software Supply Chain Attacks

2020Marc Ohm, H. Plate et al.

Atlas

2014Dhruva R. Chakrabarti, Hans-Juergen Boehm et al.

Scikit-learn: Machine Learning in Python

2011Fabian Pedregosa, G. Varoquaux et al.

Article Authors

2000

SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

SafePickle: Robust and Generic ML Detection of Malicious Pickle-based ML Models

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(19)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline

Claim map

Constellation map

Competitive landscape

Buzz

PDF

References(19)

Related Papers

Subscribe to the weekly brief

Build artifacts

Brief

Experiment plan

Validation checklist

Scientific founder

Translational engineer

Domain operator

GTM lead

Regulatory/clinical advisor

Timeline