Large Language Model-Driven Immersive Agent.
World AI IoT Congress(2024)
Abstract
Recent research in the field of Large Language Models (LLMs) has given a new direction to the capabilities of AI agents for solving complex problems. This paper attempts to explore one such use case to investigate LLMs-based AI agents’ role in immersive technology, specifically focusing on GPT-4’s vision capabilities in Augmented Reality (AR). The paper utilizes Smart App Agent Framework for recommending products. This recommendation system assists users to make context-aware decisions during their online shopping experience.
MoreTranslated text
Key words
AI agent,Large Language Models,LLM Agent,Augmented Reality,multimodal agent,GPT-4 vision
AI Read Science
Must-Reading Tree
Example
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined