Large Language Model-Driven Immersive Agent.

Aditi Singh, Saket Kumar, Abul Ehtesham,Tala Talaei Khoei,Deepshikha Bhati

World AI IoT Congress(2024)

Cited 0|Views0
No score
Abstract
Recent research in the field of Large Language Models (LLMs) has given a new direction to the capabilities of AI agents for solving complex problems. This paper attempts to explore one such use case to investigate LLMs-based AI agents’ role in immersive technology, specifically focusing on GPT-4’s vision capabilities in Augmented Reality (AR). The paper utilizes Smart App Agent Framework for recommending products. This recommendation system assists users to make context-aware decisions during their online shopping experience.
More
Translated text
Key words
AI agent,Large Language Models,LLM Agent,Augmented Reality,multimodal agent,GPT-4 vision
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined