💥 Fed cuts sparks mid cap boom! ProPicks AI scores with 4 stocks +23% each. Get October’s update first.Pick Stocks with AI

ChatGPT can now see, hear and speak

Published 26/09/2023, 03:42 pm
© Reuters ChatGPT can now see, hear and speak

In a significant development, OpenAI has revealed major advancements to its flagship model, ChatGPT, with the integration of voice and image functionalities.

The voice functionality, enabled through a new text-to-speech model, allows users to "engage in a back-and-forth conversation with your assistant", according to OpenAI.

The updates will be rolled out to Plus and Enterprise users within the next two weeks, aiming to offer a "new, more intuitive type of interface".

Users can activate this feature via Settings → New Features on the mobile app.

This vocal interaction is facilitated by Whisper, OpenAI's open-source speech recognition system, and a range of voices developed in collaboration with professional voice actors.

Beyond voice capabilities, ChatGPT now offers image processing functionalities as well.

What is so special about image processing?

Users can "troubleshoot why your grill won’t start, explore the contents of your fridge to plan a meal, or analyse a complex graph for work-related data," said the company.

The image processing is driven by multimodal GPT-3.5 and GPT-4 models, accessible via a drawing tool on the mobile app.

Vision-based models also present new challenges, ranging from hallucinations about people to relying on the model’s interpretation of images in high-stakes domains.

Prior to broader deployment, ChatGPT tested the model with red teamers for risk in domains such as extremism and scientific proficiency, and a diverse set of alpha testers.

Phased deployment strategy

OpenAI has adopted a phased deployment strategy, emphasising the company's goal "to build AGI that is safe and beneficial".

The firm also highlighted potential risks, stating that voice technology opened doors to many creative and accessibility-focused applications, but also presented new challenges such as the potential for malicious actors to impersonate public figures or commit fraud.

In summary, OpenAI’s latest feature roll-out significantly broadens the capabilities of ChatGPT.

While initially available to Plus and Enterprise users, the company plans to extend these functionalities to a wider user base in the coming weeks.

Read more on Proactive Investors AU

Disclaimer

Latest comments

Risk Disclosure: Trading in financial instruments and/or cryptocurrencies involves high risks including the risk of losing some, or all, of your investment amount, and may not be suitable for all investors. Prices of cryptocurrencies are extremely volatile and may be affected by external factors such as financial, regulatory or political events. Trading on margin increases the financial risks.
Before deciding to trade in financial instrument or cryptocurrencies you should be fully informed of the risks and costs associated with trading the financial markets, carefully consider your investment objectives, level of experience, and risk appetite, and seek professional advice where needed.
Fusion Media would like to remind you that the data contained in this website is not necessarily real-time nor accurate. The data and prices on the website are not necessarily provided by any market or exchange, but may be provided by market makers, and so prices may not be accurate and may differ from the actual price at any given market, meaning prices are indicative and not appropriate for trading purposes. Fusion Media and any provider of the data contained in this website will not accept liability for any loss or damage as a result of your trading, or your reliance on the information contained within this website.
It is prohibited to use, store, reproduce, display, modify, transmit or distribute the data contained in this website without the explicit prior written permission of Fusion Media and/or the data provider. All intellectual property rights are reserved by the providers and/or the exchange providing the data contained in this website.
Fusion Media may be compensated by the advertisers that appear on the website, based on your interaction with the advertisements or advertisers.
© 2007-2024 - Fusion Media Limited. All Rights Reserved.