An international research team has developed a new benchmark that reveals the current limitations of LLMs. Even the most advanced models fail at 90 percent of the tasks - for now. The test, called ...
A new company called "The Stargate Project" is bringing together some of tech's biggest names to build what could become the largest AI infrastructure network in history. The joint venture between ...
Chinese AI startup DeepSeek has released two new AI models that they say match OpenAI's o1 in performance. Along with their main models, DeepSeek-R1 and DeepSeek-R1-Zero, they've also launched six ...
A new study by OpenAI shows that AI models become more robust against manipulation attempts if they are given more time to "think". The researchers also discovered new methods of attack. A recent ...
OpenAI has struck a new content licensing agreement with Axios, offering the news outlet funding to expand into four additional U.S. cities in exchange for access to its content for ChatGPT. The deal ...
OpenAI's AI reasoning expert Noam Brown says there is "lots of vague AI hype" on social media. While acknowledging there are "good reasons to be optimistic" about AI progress, Brown emphasized that ...
OpenAI has just launched Operator, an AI assistant that can navigate the web on its own. The tool, currently only available to US ChatGPT Pro subscribers, represents a step toward AI assistants that ...
Donald Trump has eliminated his predecessor's AI safety regulations, creating a regulatory gap for artificial intelligence development in the United States. In one of his first moves as president, ...
OpenAI's involvement in funding FrontierMath, a leading AI math benchmark, only came to light when the company announced its record-breaking performance on the test. Now, the benchmark's developer ...
Perplexity is stepping into Google's territory with a new AI assistant for Android that can control apps and handle tasks on its own. The move puts the startup in direct competition with Google's ...
While today's AI systems are typically trained once to handle various tasks like writing text and answering questions, they often struggle with new, unexpected challenges. Transformer² aims to solve ...
OpenAI is stepping into life sciences with a new LLM designed to optimize proteins. Early testing suggests the system might work better than human researchers at certain tasks. Working with startup ...