News Report Technology

The latest updates to OpenAI's ChatGPT include exciting voice dialogue options and the ability to communicate through images.

In Brief

In the upcoming fortnight, OpenAI will begin the rollout of new image and voice features in ChatGPT.

These innovative functionalities will initially be accessible only to Plus and Enterprise users.

OpenAI Today, the announcement confirmed the deployment of advanced voice and image interaction features in ChatGPT, allowing users to chat vocally or explore images.

This revelation follows reports from Reddit users claiming access to OpenAI’s internal AI models, with one user sharing insights about a model dubbed Arrakis, which allegedly enables multi-modal inputs like text, audio, and video.

"The new voice feature utilizes an advanced text-to-speech model that creates lifelike audio from text combined with just a few seconds of voice samples,\" stated OpenAI. blog post We collaborated with professional voice actors to develop a variety of unique voice options. Additionally, we harness our own speech recognition technology, Whisper, to convert spoken language into text.

Thanks to the newly introduced features, users can have dynamic conversations with ChatGPT using their voices, and they can also discuss and analyze images together with the chatbot. The rollout is set to occur over the next fortnight for Plus and Enterprise tier users.

Voice capabilities will soon be available on iOS and Android devices as optional features, while the image interaction is set to be launched across all platforms.

To engage the voice feature, users can navigate to Settings → New Features on the mobile app and select 'voice conversations.' Then, pressing the headphone icon on the top-right of the home screen lets users choose from a selection of five distinct voices.

For discussions involving images, users simply tap the camera button to take a photo or choose an existing one. On iOS or Android, they need to press the plus icon before proceeding. Moreover, users can initiate conversations about several images or utilize them to guide the chatbot's responses.

OpenAI states that image comprehension relies on multimodal models GPT-3.5 and GPT-4, which utilize their linguistic reasoning capabilities to scrutinize a wide variety of visual content, ranging from photographs to screenshots and documents that blend text with imagery.

OpenAI’s partnership with Spotify

Spotify also today announced this dynamic translation feature uses OpenAI's Whisper tool to transcribe English speech and convert other languages into English.

According to The Verge As part of its pilot initiative, the organization has partnered with well-known podcasters like Dax Shepard, Monica Padman, Lex Fridman, Bill Simmons, and Steven Bartlett to craft AI-driven translations in languages such as Spanish, French, and German for select episodes and upcoming titles.

"We are convinced that a considered approach to AI can forge stronger connections between creators and audiences, which aligns with Spotify’s goal of unlocking the creativity of humanity,” stated Ziad Sultan, Spotify's VP of Personalization.

Episodes translated into different languages by the pilot creators will be accessible globally to both Premium and Free Spotify users.

However, please keep in mind that the information on this page is not meant to serve as legal, investment, financial, or other professional advice. Always invest responsibly and seek independent financial counsel if uncertainty arises. For comprehensive details, consider reviewing the terms and conditions and support resources offered by the issuer or advertiser. MetaversePost strives for accuracy and impartial reporting, but market dynamics can change rapidly.

Disclaimer

In line with the Trust Project guidelines Cindy, a journalist at Metaverse Post, focuses on topics surrounding web3, NFTs, the metaverse, and AI. She emphasizes interviewing key players in the Web3 space, having engaged with over 30 executives, thus bringing valuable perspectives to the audience. Originally from Singapore and now based in Tbilisi, Georgia, Cindy holds a Bachelor’s degree in Communications & Media Studies from the University of South Australia, paired with ten years of experience in journalism.

Let’s delve into various programs that leverage the power of digital currencies for philanthropic purposes.

AlphaFold 3, Med-Gemini, and other innovations: The transformative impact of AI in healthcare as we move into 2024.

Know More

AI is emerging in healthcare in numerous beneficial ways, from identifying novel genetic links to powering advanced robotic surgical systems.

Copyright, Permissions, and Linking Policy

Know More
Read More
Read more
News Report Technology
Binance has introduced a new fund accounts solution designed to lower entry barriers for fund managers entering the exchange.
Business News Report Technology
Sophon has launched Smart Accounts to streamline blockchain accessibility across various entertainment ecosystems.
News Report Technology
Cryptocurrencylistings.com's new CandyDrop feature aims to simplify cryptocurrency acquisition while enhancing user engagement with quality projects.
News Report Technology
Exploring the journey from Ripple to The Big Green DAO: A study of how cryptocurrency initiatives contribute to charitable causes.