OpenAI introduces EVMbench to measure AI crypto security

2026-2-23 10:50

OpenAI has launched a benchmarking system called EVMbench to evaluate how effectively artificial intelligence can identify and exploit security weaknesses in crypto smart contracts.

Announced on Feb. 18 and developed with Paradigm, the system focuses on contracts built for the Ethereum Virtual Machine.

The release reflects growing concern around blockchain security, as smart contracts secure more than $100 billion in open-source crypto assets.

By creating a controlled environment, OpenAI aims to understand how advanced models perform when handling financial software risks.

Benchmark design

EVMbench measures three capabilities: detecting vulnerabilities, repairing flawed code, and executing exploit scenarios.

The benchmark includes 120 high-risk security issues from 40 past smart contract audits.

Many cases were drawn from public auditing competitions, where developers and researchers test their ability to find and fix weaknesses.

The dataset also includes examples from reviews of the Tempo blockchain, a payments-focused network designed for stablecoin transactions.

These scenarios reflect financial use cases where smart contracts handle sensitive value transfers.

To build the testing environment, OpenAI adapted existing exploit scripts and created new ones where needed.

All tests run in isolated systems, ensuring no live networks are affected.

Only publicly disclosed vulnerabilities were included, reducing the risk of exposing new threats.

Testing capabilities

EVMbench evaluates AI systems through three modes. In detection mode, agents analyse contract code to locate vulnerabilities.

In patch mode, they attempt to correct those weaknesses without disrupting functionality.

In exploit mode, agents simulate attacks by attempting to drain funds from vulnerable contracts in a controlled environment.

This structure allows researchers to assess AI performance across defensive and offensive tasks.

The benchmark measures whether models can move beyond theoretical knowledge and operate effectively in blockchain conditions.

OpenAI also developed a custom testing framework to ensure results can be reproduced and verified.

This enables consistent comparison between models.

Performance results

OpenAI tested several advanced models using the benchmark. GPT-5.3-Codex achieved a score of 72.2% in exploit mode, compared with GPT-5, which scored 31.9% when released six months earlier.

These results show stronger performance when AI agents are given clear tasks.

However, detection and patching performance remained lower.

This highlighted challenges in identifying vulnerabilities and repairing smart contract logic.

Researchers found that AI systems struggled more when tasks required broader reasoning or deeper analysis of large codebases.

Security implications

OpenAI said EVMbench does not fully represent real-world blockchain environments.

Many production crypto systems undergo more extensive security reviews than those included.

Certain threats, including timing-based attacks and multi-chain vulnerabilities, are outside the scope of the benchmark.

The system is intended to support defensive security efforts by helping researchers understand AI capabilities and limitations.

As AI tools become more capable, they could be used by attackers and auditors.

Measuring performance helps reduce uncertainty and supports safer deployment.

Alongside the release, OpenAI said it is expanding security initiatives and allocating $10 million in API credits to support open-source security and infrastructure protection.

The company has made all EVMbench tools and datasets publicly available to encourage research and improve smart contract security.

The post OpenAI introduces EVMbench to measure AI crypto security appeared first on Invezz

origin »

Bitcoin price in Telegram @btc_price_every_hour

SherLOCK Security (LOCK) на Currencies.ru

$ 0 (+0.00%)
Объем 24H $0
Изменеия 24h: 6.06 %, 7d: -9.29 %
Cегодня L: $0 - H: $0
Капитализация $0 Rank 99999
Доступно / Всего 0 LOCK / 4.969m LOCK

openai crypto security evmbench exploit identify weaknesses

openai crypto → Результатов: 120


Фото:

AlphaRushAI and Hera Finance lead the AI crypto Bull Run as ChatGPT launch on App Store

OpenAI, the company behind the AI chatbot ChatGPT, has launched the official version of their AI-driven chatbot, ChatGPT, on the App Store of Apple Inc (NASDAQ: AAPL). The company is reportedly also planning to launch the chatbot on Android, a move that has sent ripples through the crypto space, especially among artificial intelligence-based cryptocurrencies.

2023-5-20 16:30


Ex-Ethereum developer calls Open AI founder’s Worldcoin project ‘unrealistic and scary’

Piggybacking off the success of OpenAI’s ChatGPT, its founder Sam Altman is reportedly close to snagging $100 million in funding for his crypto project Worldcoin. The project had already raised $100 million at a valuation of $3 billion last year from marquee investors including Andreessen Horowitz and the venture capital arm of Coinbase.

2023-5-16 17:00


Ex-Ethereum developer calls OpenAI founder’s Worldcoin project ‘unrealistic and scary’

Piggybacking off the success of OpenAI’s ChatGPT, its founder Sam Altman is reportedly close to snagging $100 million in funding for his crypto project Worldcoin. The project had already raised $100 million at a valuation of $3 billion last year from marquee investors including Andreessen Horowitz and the venture capital arm of Coinbase.

2023-5-17 17:00


Sam Altman, the prominent figure behind OpenAI, is on the verge of securing a staggering $100 million in funding for his groundbreaking crypto endeavor, Worldcoin.

OpenAI’s CEO, Sam Altman, is making significant progress in securing approximately $100 million in funding for his latest venture, Worldcoin. This ambitious start-up aims to revolutionize the global cryptocurrency landscape by utilizing iris-scanning technology to establish a secure and universal identification system.

2023-5-15 22:19


ChatGPT Can Now Browse The Internet – What This Means For Crypto  

ChatGPT, the AI-powered language bot that has taken the world by storm, has just added more firepower to its arsenal thanks to the release of new plugins by OpenAI. These plugins open up a vast array of third-party knowledge sources and databases, including the internet, thus expanding its already impressive functionality. The integration of these […]

2023-3-24 15:10