OpenAI unveiled GPT-4o, a new AI model it claims is much faster than the previous version. The new model has improved capabilities across text, video, and audio.
CTO Mira Murati stated that GPT-4o brings the intelligence of its product to the company’s free users for the first time, but noted paying customers would still have up to five times more capacity. CEO Sam Altman stated the model is natively multimodal, capable of generating content or understanding commands in voice, text, and images. The o in GPT-4o stands for omni.
Murati said the latest version will also have the memory capability to learn from previous conversations, which makes it similar to an AI assistant. “This is the first time that we are really making a huge step forward when it comes to the ease of use,” she explained. “This interaction becomes much more natural and far, far easier”.
She noted that GPT-4o is twice as fast as GPT-4 Turbo at half the cost. It can understand 50 different languages while providing improved speed and quality. It can respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds, which the company stated is like human response times in a conversation. The GPT-4o features will be rolling out over the coming weeks.
In collaboration with the company Markoja, the Faculty of Transport and Traffic Sciences in Zagreb, and the airports of Zagreb, Zadar, and Pula, Hrvatski Telekom has presented the NextGen 5G Airports project.
Infobip announced its Conversational Experience Orchestration Platform (CXOP), a solution that places agentic AI at the heart of every customer interaction.