Today: 1 November 2025
26 September 2023
2 mins read

OpenAI’s ChatGPT can now see, hear and speak

Image understanding is powered by multimodal GPT-3.5 and GPT-4. These models apply their language reasoning skills to a wide range of images, such as photographs, screenshots, and documents containing both text and images…reports Asian Lite News

Sam Altman-run OpenAI on Monday announced it is rolling out new voice and image capabilities in ChatGPT that can now help the AI chatbot see, hear and speak.

These capabilities offer a new, more intuitive type of interface by allowing you to have a voice conversation or show ChatGPT what you’re talking about, the company said in a statement.

“Voice mode and vision for chatGPT! really worth a try,” Altman posted on X. The company said it is rolling out voice and images in ChatGPT to Plus and Enterprise users over the next two weeks.

“Voice is coming on iOS and Android (opt-in in your settings) and images will be available on all platforms,” said the Microsoft-backed company. The new voice capability is powered by a new text-to-speech model, capable of generating human-like audio from just text and a few seconds of sample speech.

“We collaborated with professional voice actors to create each of the voices. We also use Whisper, our open-source speech recognition system, to transcribe your spoken words into text,” said OpenAI.

Image understanding is powered by multimodal GPT-3.5 and GPT-4. These models apply their language reasoning skills to a wide range of images, such as photographs, screenshots, and documents containing both text and images.

The new voice technology opens doors to many creative and accessibility-focused applications. However, “these capabilities also present new risks, such as the potential for malicious actors to impersonate public figures or commit fraud,” the company noted.

“This is why we are using this technology to power a specific use case — voice chat. Voice chat was created with voice actors we have directly worked with,” it added. Spotify is using the power of this technology for the pilot of their Voice Translation feature, which helps podcasters expand the reach of their storytelling by translating podcasts into additional languages in the podcasters’ own voices.

“We’ve also taken technical measures to significantly limit ChatGPT’s ability to analyze and make direct statements about people since ChatGPT is not always accurate and these systems should respect individuals’ privacy,” said the company.

ALSO READ-Twitter seizes @x handle without warning or paying owner

Previous Story

EAM holds bilateral with Madagascar counterpart

Next Story

Google to shut Basic HTML version of Gmail in Jan 2024

Latest from -Top News

Sharjah sets sail for London

At World Travel Market (WTM) London 2025, Sharjah is set to highlight both its deep-rooted cultural identity and forward-looking tourism strategy, demonstrating the emirate’s growing influence in global travel and destination marketing…reports

DP World to pump $5bn into India

DP World’s $5bn pledge strengthens India’s maritime future, powering green shipping, ship-repair capacity, talent growth and next-gen freight tech while deepening the nation’s global trade role…reports Asian Lite News DP World has

UAE mega-aid ship docks in Egypt

The shipment, one of the largest single consignments dispatched as part of the UAE’s air and sea bridge, includes essential food items…reports Asian Lite News A UAE humanitarian vessel carrying more than

Arab bloc hails UAE’s efforts in Gaza

Arab Parliament lauds UAE’s major aid efforts in Gaza and urges unified global backing for reconstruction, Palestinian statehood and a lasting peace grounded in the Two-State framework…reports Asian Lite News The Speaker

Defence bridges rise as UAE General visits India

UAE Land Forces Chief’s New Delhi visit strengthens defence ties with India, deepening cooperation in technology, training and strategic planning through high-level talks, industry briefings and ceremonial engagements….reports Asian Lite News The
Go toTop

Don't Miss

New AI tech to pick donor organs for transplant

The OrQA assessment will majorly look for damage, pre-existing conditions

ChatGPT fails when it comes to accounting

On a 11.3 per cent of questions, ChatGPT scored higher