Generative AI, a strong boost for immersive experiences
Xavier Delgado Anglada
Technical Product Manager, Expert in Digital Workplace and VR en NTT DATA
The arrival of ChatGPT in November 2022 marked one of those disruptive innovations that can change everything in the years to come. This groundbreaking AI model, based on language models, quickly became the center of attention due to its accessibility, ease of use, and immense potential. It opened the door to entirely new ways of interacting with technology, leading to the emergence of new professions and opportunities for organizations.
Large Language Models like GPT allow, among other things, conversations that are surprisingly similar to those we have with other people. Their ability to generate content coherently and contextually translates into the possibility of providing context to a GPT model, allowing it to respond to the user and continue the conversation smoothly and meaningfully.
This evolution has transformed how people access information and interact with technology in various fields. Furthermore, when we merge generative AI with other technologies like virtual reality, the result is an enriched, highly accessible, and more natural experience in a fully immersive environment.
In this article, we will delve into the benefits that this powerful convergence of technologies brings while examining the challenges and prospects we can anticipate in this exciting horizon.
The result of the mmixx: more realistic immersive experiences
Integrating generative AI into immersive experiences results in a key benefit: enhancement and increased naturalness. Interactions become more akin to real-world interactions, elevating the level of immersion. The use of interaction devices like keyboards or mouse in virtual reality environments is not the most comfortable option with current devices, even considering the ongoing evolution in mixed reality. In immersive experiences, voice interaction and efficient recognition become the logical choice for engaging with the experience and receiving assistance or information.
In everyday life, we don't select options from a menu; we speak fluently. The use of natural language becomes a fundamental element within immersive experiences, allowing us to get even closer to real-life experiences.
Until the arrival of language models, we relied on Natural Language Processing (NLP) systems that could identify intentions and react to them but only within a predefined set of actions. They lacked the general and versatile knowledge exhibited by models like GPT. Additionally, with Large Language Models (LLMs), we can establish specific contexts and customize the conversation's tone, allowing the user to feel comfortable and the interaction to be meaningful.
Challenges and horizons of generative AI
Now that we've discussed the main use cases resulting from the union of generative AI and virtual reality, let's explore the challenges and next steps we can expect from this significant technological convergence.
Action as a response
One of the next steps is to ensure that NPCs not only respond to users but also take the most appropriate actions in each case. For example, in a commercial use case focused on selling items, we can expect that in the near future, we can ask AI about available colors in a model, and the AI will directly show them. In other words, we anticipate that AI will provide a coherent action based on what users want to do, demonstrating environmental knowledge rather than just textual knowledge.
More comfortable and accessible devices
Technological evolution in immersive experiences must be linked to the optimization of devices. Expectations include increasingly lightweight devices and more powerful conversations, allowing greater interaction between the user and AI. In the future, we might wear lightweight glasses, similar to those used in our daily lives, enabling us to interact with AI in a more natural and fluid manner, potentially revolutionizing the way we interact with machines.
Explainability, an unfinished subject
Explainability is one of the challenges of AI, and in the future, decisions made by AI are expected to be more transparent and understandable for users. This is a key consideration for trust and ethical decisions in AI, accelerating results and mitigating risks in areas such as health, finance, and justice. The field faces the significant challenge of addressing the complexity of models and ensuring clear interpretation to avoid biases.
Establishing ethical standards and governmental regulation is essential to promote responsible and transparent use of artificial intelligence in an ever-evolving field where explainability becomes integral to AI.
In conclusion, the combination of Generative AI in virtual reality opens a gateway to a universe of opportunities and a revolution in immersive experiences, proving that artificial intelligence is a powerful accelerator for the adoption of virtual reality. We are witnessing the beginning of a new way to relate, entertain, and collaborate in a virtual environment.
At NAKA, we have successfully implemented the integration of Generative AI for the development of immersive experiences, thus opening the door to all the benefits discussed in this article. We invite you to explore all the details of this integration, including aspects related to enterprise security, in our article NAKA Smart NPCs.
At NAKA, we have successfully implemented the integration of Generative AI for the development of immersive experiences, thus opening the door to all the benefits discussed in this article. Furthermore, on November 17th, we had the great opportunity to share our expertise as generative AI and immersive technologies experts at one of the most prominent events in the digital talent field: the third edition of Jump2Digital, held at Fira Barcelona. Learn more about our presentation here.