OpenAI Enhances ChatGPT with Image and Voice Input Features

OpenAI has introduced new features to its popular AI-powered chatbot, ChatGPT, allowing users to use images and voice inputs when asking questions, these features will initially be available to ChatGPT Plus subscribers and business customers over the next two weeks.

With these new features, ChatGPT can accept images as input when responding to user queries, users can capture images using their device’s camera or upload images from their device.

This functionality can be used in various scenarios, such as taking a picture of a math problem and asking for a solution, inquiring about a recipe by uploading a picture of the ingredients, or asking questions about the content of a specific image, ChatGPT can identify elements within images, enabling more interactive interactions.

Additionally, users can take advantage of the voice input feature by clicking on the microphone icon and speaking directly to the chatbot, ChatGPT utilizes speech recognition technology to understand user queries and responds in a voice format as well.

OpenAI recently integrated the DALL-E model into ChatGPT, enabling it to generate images as part of its responses.

OpenAI also collaborates directly with companies like Spotify to leverage text-to-speech capabilities, allowing podcasters to translate their content into other languages using their own voices.

These new features come as reports suggest a decline in ChatGPT users in recent months. It’s worth noting that Google’s BERT and Bing Chat have offered voice and image search capabilities for several weeks.


Related:

The Author:

Leave A Reply

Your email address will not be published.



All content published on the Nogoom Masrya website represents only the opinions of the authors and does not reflect in any way the views of Nogoom Masrya® for Electronic Content Management. The reproduction, publication, distribution, or translation of these materials is permitted, provided that reference is made, under the Creative Commons Attribution 4.0 International License. Copyright © 2009-2024 Nogoom Masrya®, All Rights Reserved.