AI RESEARCH
Can Subgraph Explanations Be Weaponized to Steal Graph Neural Networks?
arXiv CS.LG
•
ArXi:2605.30470v1 Announce Type: new Graph Machine Learning as a Service (GMLaaS) platforms increasingly implement explainability interfaces to meet regulatory transparency requirements. However, this transparency creates exploitable vulnerabilities for model extraction attacks. We present the first model extraction attack specifically designed for graph classification under strict black-box constraints where the attacker observes only discrete class labels and binary explanation masks (no probability scores, gradients, or confidence values