OpenZeppelin flags data contamination and misclassified risks in OpenAI's EVMbench dataset

Blockchain security firm OpenZeppelin reported that OpenAI's EVMbench benchmark suffers from training data contamination and methodological issues in classifying smart contract vulnerabilities. The audit found that several AI models may have been pre-exposed to the underlying vulnerability reports and identified at least four cases labeled as high-severity that are not practically exploitable.