
Google AI Launches CURIE Benchmark to Advance Scientific Problem-Solving with AI
CURIE, SPIQA, and FEABench introduce rigorous benchmarks to evaluate LLMs on complex scientific tasks like long-form reasoning, multimodal analysis, and simulation.
The most important AI News curated by The AI Track team, offering you a comprehensive view of the artificial intelligence landscape.
CURIE, SPIQA, and FEABench introduce rigorous benchmarks to evaluate LLMs on complex scientific tasks like long-form reasoning, multimodal analysis, and simulation.
Papa Johns and Google Cloud announce a partnership to utilize AI technologies, aiming to enhance customer personalization and streamline delivery processes.
Artificial Intelligence is set to become a $4.8 trillion global market by 2033. UNCTAD’s latest report warns that without immediate measures, the benefits may be confined to a privileged few, exacerbating global inequalities.
UK authors protested Meta’s alleged use of over 7.5 million pirated books from LibGen for AI training. The protest gained momentum across social media and in courtrooms, raising ethical and legal questions about AI data sourcing.
Anthropic debuts Claude for Education, an AI tool tailored for colleges, offering critical thinking support, administrative automation, and full-campus partnerships.
Hollywood’s adoption of artificial intelligence in filmmaking presents both groundbreaking opportunities and significant challenges, as the industry seeks to balance innovation with artistic integrity.
OpenAI’s $40 billion SoftBank-led funding round values the company at $300 billion, with $18 billion earmarked for the Stargate AI data center project. The deal follows governance turmoil, including Altman’s 2023 ouster and safety team departures.
Runway’s Gen-4 AI model introduces consistent character and object generation, world physics simulation, and cinematic scene control—raising industry standards.
Alibaba introduced Qwen2.5-Omni-7B, a multimodal AI capable of processing text, images, audio, and video efficiently on smartphones and laptops, outperforming Google’s Gemini model in benchmarks.