The Current Challenges of Voice-Assistant A.I.

The Five Most Key Takeaways of This Blog Post

All signs are pointing toward voice-assistant A.I. becoming a major technology in the coming years.
Right now, major tech companies are ironing out some of the challenges standing in the way of making this type of A.I. a truly global feature.
One life-imitating-art (namely, life imitating the 2013 Spike Jonze-directed movie Her) problem that OpenAI, the creator of ChatGPT, has identified is the likelihood of people to become emotionally reliant on voice-assistant A.I.
A significant roadblock is getting A.I. to proficiently understand a range of different accents, even within one language. Consider for instance that the so-called Minnesota accent (ever seen Fargo? The Minnesota accent is what a lot of the characters have in that movie) is apparently one of the most difficult for A.I. to understand.
Another issue is language-training the A.I. There are just so many languages out there, and the costs of training an A.I. system to an admissible level of proficiency (according to the tech companies, at least) are quite high.

How Far Are We from Ubiquitous Voice-Assistant A.I.?

Yes, most people who have a smart phone are using an iPhone, but how many of us are really using Siri?

Some analysts may point to Siris lack of popularity relative to just typing something in an Internet browser’s search bar as evidence that this type of A.I. may not exactly capture the attention of the public for a sustained period of time.

But when the Siris of tomorrow start to sound more and more like actual human beings, and can fetch (and create) data with unprecedented speed and efficiency for an A.I. voice assistant, then we will see whether this technology will become a huge part of smart-phone users’ daily lives.

However, there are indeed some big hurdles to clear before this technology can truly capture that attention. We enumerated them in the Key Takeaways section that opened this blog post, and we will go into more detail below.

Variations in Accents, and Languages

Most of us Americans are familiar with the fact that the diversity of accents, let alone regional dialects, can lead to significant changes in language use.

In the context of voice-assistant A.I., this means that an A.I. will need to be adaptable to a wide range of accents that speakers of a particular natural language may have. Otherwise, the likelihood of frustrating misunderstandings will linger.

Likewise, the sheer amount of languages on the Earth makes adapting this technology across the globe particularly challenging. You can certainly expect the super-common languages like English and Mandarin Chinese to be among the ones used, but the A.I. for smaller languages may lag behind in efficiency or even availability.

OpenAI’s Emotional Reliance Issue

Located far down in a report with a decidedly boring-sounding (for some readers, at least) name “GPT-4o System Card” is a section by the name “Societal impacts”, which provides under its subsection “Anthropomorphization and emotional reliance” a description of that subsection title’s named risks.

Anthropomorphization is attributing human-like features to non-human things. If you have indeed seen and remember enough of the movie Her, the movie referenced in the Key Takeaways section opening this blog and creepily referenced in OpenAI’s now-recalled Scarlett Johannson-soundalike A.I., then you will have a clear picture of what this looks like in relation to voice-assistant A.I.

Basically, you will have people that come to feel that the A.I. is something more than an A.I. assistant, such as a friend, or further even more than a friend (again, recall or look up a clip from Her).

Emotional reliance is also listed as one of the risks of OpenAI’s in-development voice-assistant A.I. platform.

The issue becomes even more serious when one considers the risk of voice-assistant A.I. “hallucinating” by giving erroneous, perhaps even harmful, advice to a user that has both anthropomorphized and formed an emotional attachment to a voice-assistant A.I.

Giving your attention to an A.I. platform is one thing, but giving it your trust is an entirely different thing. For business owners looking to avoid any major liabilities with their business’ own voice-assistant A.I.’s of the future, it is useful to consider including disclaimers in, and even just conversational limits on, the interactions between customers and the A.I. that occur.

Something that OpenAI points to as a potential agitating societal influence is the deferential nature of voice-assistant A.I., which will allow users to interrupt and take charge of the conversation at any time.

Given how the more advanced voice-assistant A.I. models are pretty much built to mimic a human-like conversation in certain superficial respects (namely, spoken back-and-forths with an artificial human-like voice), this could negatively condition people to bristle at the lack of such control in human-to-human conversations.

The Current Challenges of Voice-Assistant A.I.

The Five Most Key Takeaways of This Blog Post

How Far Are We from Ubiquitous Voice-Assistant A.I.?

Variations in Accents, and Languages

OpenAI’s Emotional Reliance Issue

Emotional reliance is also listed as one of the risks of OpenAI’s in-development voice-assistant A.I. platform.

Submit a Comment Cancel reply

Download Our ECourse

Recent Posts

Categories

Recent Comments

Year Written