Artificial Intelligence

Experts and OpenAI Warn Users Against Developing Emotional Attachment With GPT-4o Voice Mode

OpenAI’s new ChatGPT feature, Voice Mode, has sparked excitement and concern among people. Both experts and the company have warned users against developing an emotional relationship with the AI feature. Discover the reason for these warnings and the potential impact of developing an emotional bond with the feature.

Karthik Kashyap

August 12, 2024

Experts and OpenAI have warned users against developing an emotional relationship with the company’s new Voice Mode feature.
The more human-sounding voices used in the feature are believed to have both positive effects and potential risks for individual users and society.

In July, OpenAI, the creator of ChatGPT, launched its latest large language model (LLM), GPT-4o and GPT-4o mini. The company then launched the Advanced Voice Mode to a limited number of ChatGPT Plus users this month. This update has sparked excitement and concern among users, with OpenAI also issuing a caution. The company has warned that the new feature could lead to users forming emotional bonds with the artificial intelligence (AI) feature, with positive and negative social implications.

Social Implications of the New Voice Mode

The feature allows users to have voice conversations with ChatGPT using natural, human-sounding voices. This enhances accessibility and user experience but could blur the line between machine and human. The development raises critical questions about the future of human relationships with machines and AI and the ethical responsibilities creators like OpenAI should navigate.

One of the biggest concerns is the potential for users to build emotional attachments with AI, a phenomenon experts have warned for years. Dr. Sherry Turkle, an MIT professor, cautions that “when technology becomes this intimate, we must ask ourselves what kinds of relationships we are fostering and what it means for our connections with real people.” The emotional weight spoken words carry compared to text could make voice interactions more impactful, leading users to perceive AI as more than a tool.

Early testers of the Voice Mode feature seem to agree with this statement. Testers who used the feature reported feeling a sense of connection with the AI, describing its voice responses as “reassuring” and “comforting.” According to some users, the feature could make interactions more engaging and personalized. Such feedback can be considered positive, as AI’s human-like capabilities could fulfill the emotional needs of a certain section of society—for example, lonely individuals or people deprived of such emotional relationships.

However, this also comes with risks. As AI becomes more integrated into daily life, over-reliance on this technology for emotional needs may grow, impacting social dynamics and mental health.

OpenAI has acknowledged these risks. In its GPT-4o System CardOpens a new window , a technical document describing the measures taken to reduce potential risks, the company said that the risk may be heightened by the model’s audio capabilities, emphasizing the criticality of ongoing monitoring and implementing safeguards to prevent unintended consequences.

Besides OpenAI, a few other companies have recognized the potential risk of AI assistants mimicking human interaction. For example, the author of a paper published in April by Google DeepMind said that chatbots’ ability to use language “creates this impression of genuine intimacy.”

Another key concern around the new Voice Mode is users’ changing expectations of society. According to Dr. Margaret Mitchell, an AI ethics expert and former co-lead of Google’s Ethical AI team, “There’s a risk that as people grow accustomed to interacting with AI in human-like ways, they may start to expect similar interactions from real humans, which could alter social dynamics.” A few people using chatbots like Replika and Character AI have reported antisocial tensions arising from their chat habits.

See more: OpenAI Launches Structured Outputs for JSON in Its API, Reduces GPT-4o Prices

Other Risks of Voice Mode

Experts and OpenAI have also listed a few other concerns and potential risks of using the Voice Mode. Some of them include the model’s ability to generate harmful, inappropriate, and unauthorized voice content, spread disinformation, amplify societal biases, and aid in the development of biological or chemical weapons.

Moving forward

GPT-4o’s Voice Mode represents both a significant technological advancement and a set of potential risks. OpenAI has signaled its commitment to safeguarding against potential misuse and clarified that ensuring the responsible use of the technology is critical. That said, the road forward requires continuous vigilance and adaptation. With AI’s rapid development, today’s guardrails may not be sufficient. It calls for ongoing refinement of the technical safeguards and increasing public awareness about potential implications and precautions for using AI-driven voice technologies. Further, shared responsibility between policymakers, AI developers, and the public is crucial in navigating these unchartered waters.

MORE ON ARTIFICIAL INTELLIGENCE (AI)

artificial intelligence

Karthik Kashyap

opens a new window opens a new window

Karthik comes from a diverse educational and work background. With an engineering degree and a Masters in Supply Chain and Operations Management from Nottingham University, United Kingdom, he has experience of close to 15 years having worked across different industries out of which, he has worked as a content marketing professional for a significant part of his career. Currently, as an assistant editor at Spiceworks Ziff Davis, he covers a broad range of topics across HR Tech and Martech, from talent acquisition to workforce management and from marketing strategy to innovation. Besides being a content professional, Karthik is an avid blogger, traveler, history buff, and fitness enthusiast. To share quotes or inputs for news pieces, please get in touch on [email protected]