AI-enhanced landmark recognition for self-guided tour application using large language models

University of Bedfordshire
,
School of Computing Engineering and Creative Industries

Research Output: Chapter in Book/Report/Conference proceeding Conference contribution Peer-review

Abstract

Artificial intelligence (AI), particularly Large Language Models (LLMs), has created opportunities to improve user experiences by enabling the development of more interactive applications in various implementation scenarios. This paper proposes a mobile application as a virtual self-guided tour, enabling landmark recognition and enhanced user interaction with LLMs. A landmark classifier is employed for Cloud-based image classification, with accuracy further improved by incorporating GPS-based matching of classification results. These preliminary tests proved that the use of GPS to match the location improved the results and that the London Eye improved from 82 to 88 percent. Subsequently, users are provided with audio information about the identified landmark and access to extended landmark details generated by the used LLM. Users can also engage in text or voice-based interactions with the system. The system architecture integrates real-time image processing, location optimisation, and generative AI, creating interactive and engaging user interfaces.

Publication Information

Output type

Research Output: Chapter in Book/Report/Conference proceeding Conference contribution Peer-review

Original language

English

Article number

Pages from-to (Number of pages)

Pages 1 - 5

Publication milestones

Published - 21/09/2025

Publication status

Published - 21/09/2025

Publisher

Association for Computing Machinery, United States

Publication series

Publication series name: MobileHCI 2025 - Adjunct Proceedings of the 2025 Conference on Mobile Human-Computer Interaction

ISBN (Electronic)

9798400719707

External Publication IDs

Scopus: 105031906544

Host publication title

MobileHCI '25 Adjunct: Adjunct Proceedings of the 27th International Conference on Mobile Human-Computer Interaction

Host publication editors

Yomna Abdelrahman
Passant Elagroudy
Florian Alt

Access to documents

10.1145/3737821.3748524

Link

Link to publication in Scopus, opens in new tab

Publication metrics

PlumX, opens in new tab

Captures