
OpenAI Launches New Reasoning Models o3 and o4-mini with Image Understanding
OpenAI released o3 and o4-mini, its most advanced reasoning models yet. They feature full ChatGPT tool access and can process visual inputs alongside text.
The most important AI News curated by The AI Track team, offering you a comprehensive view of the artificial intelligence landscape.

OpenAI released o3 and o4-mini, its most advanced reasoning models yet. They feature full ChatGPT tool access and can process visual inputs alongside text.

xAI’s Grok evolves with memory recall, a real-time editing workspace, and file access from Google Drive. New features bring Grok into enterprise AI territory.

Codex CLI is OpenAI’s open-source terminal tool for AI-driven coding. It enables local execution, autonomy control, multimodal input, and model selection.

Claude AI performs autonomous research using Workspace and web content, offering citation-backed, context-rich output for enterprise users and regulated industries.

Kuaishou’s Kling 2.0 pushes AI video boundaries with fluid motion, detailed prompts, and industry-leading cinematic quality for long-form content.

Veo 2 is now available to Gemini Advanced users with enhanced realism, prompt-based video generation, mobile uploads to TikTok and YouTube, and image-to-video animation via Whisk Animate.

DolphinGemma, an AI model co-developed by Google, Georgia Tech, and WDP, decodes dolphin vocalizations using 40 years of acoustic data. It supports real-time interaction and open scientific research.

Hugging Face has acquired Pollen Robotics to launch an open-source robotics platform, integrating Reachy 2 and LeRobot to enable AI-powered machines for all.

ByteDance’s Seaweed-7B achieves real-time AI video with audio-video sync, long-form storytelling, and 2K upscaling — all with 7B parameters and low GPU needs.