
OpenAI Launched New Audio Models
OpenAI introduced three enhanced audio models including vibe-controlled voice synthesis and Whisper-beating transcription, priced 85% below market leader ElevenLabs, as AI operational costs plummet industry-wide.
Best of and reviews of AI tools, apps, and all the useful stuff that can make our lives easier.
OpenAI introduced three enhanced audio models including vibe-controlled voice synthesis and Whisper-beating transcription, priced 85% below market leader ElevenLabs, as AI operational costs plummet industry-wide.
Claude 3.7 Sonnet now offers web search for U.S. paid users, citing sources like Reuters. Despite use cases in finance and sales, energy costs and Google’s 90% search dominance loom.
Tencent’s Hunyuan3D-2.0 generates 3D assets in 30 seconds using multi-modal inputs and a two-stage pipeline, targeting gaming, VR, and industrial design with open-source tools.
Roblox has released Cube 3D, an open-source foundational model for generative AI, enabling developers to create 3D objects and scenes from text prompts. The beta mesh generation API is now available in Roblox Studio.
Baidu launches Ernie 4.5 at 1% of GPT-4.5’s cost and a free AI chatbot, using PaddlePaddle and Kunlun chips to challenge global rivals.
Cohere launched Command A, an AI model requiring only 2 GPUs that matches GPT-4o in enterprise tasks while processing 23 languages and costing 50% less for private deployments.
Manus AI, developed by Butterfly Effect, operates independently using multi-agent systems to execute complex tasks, though crashes and feedback loops highlight early-stage limitations.
Alibaba’s R1-Omni AI combines visual, audio, and textual analysis with RLVR to achieve state-of-the-art emotion recognition, now available open-source for applications in education, customer service, entertainment etc.
Google’s Gemma 3 AI model combines multi-modal capabilities with single-GPU efficiency, offering developers cost-effective access to advanced AI tools. The update includes enhanced image processing, safety filters, and academic support.