Iran Press/ Sci & Tech: Google is facing scrutiny over its demonstration video for its newly launched artificial intelligence model, Gemini.
On Wednesday, just weeks before the year’s end, Google launched what it considers its largest and most capable AI model Gemini and presented a demonstrative video to media outlets and the public.
The six-minute video includes spoken conversations between the user and a Gemini-powered chatbot, and also shows Gemini’s ability to recognize visual pictures and physical objects and know the difference. Some of the capabilities were impressive, such as Gemini’s ability to voice aloud a description of drawings of a duck, and describing a drawing of a duck versus a rubber duck, among other examples.
The company’s description on YouTube includes a short line that says, “For the purposes of this demo, latency has been reduced, and Gemini outputs have been shortened for brevity.” However, it doesn’t make that disclaimer in the video itself.
Following the launch, the company later confirmed to Bloomberg the demo wasn’t conducted in real time, but instead used still images and fed text prompts that Gemini responded to, as previously pointed out by The Information. The author noted that was “quite different” from what Google seemed to be suggesting: “that a person could have a smooth voice conversation with Gemini as it watched and responded in real-time to the world around it.”
After multiple requests for comment, the company on Friday told CNBC in a statement, “The video is an illustrative depiction of the possibilities of interacting with Gemini, based on real multimodal prompts and outputs from testing. We look forward to seeing what people create when access to Gemini Pro opens on December 13.”
214
Read More:
ChatGPT upgrade: AI chatbot can now 'see, hear and speak'