Skip to search boxSkip to navigationSkip to main content

AI-enhanced landmark recognition for self-guided tour application using large language models

Research Output: Chapter in Book/Report/Conference proceeding Conference contribution Peer-review

Abstract

Artificial intelligence (AI), particularly Large Language Models (LLMs), has created opportunities to improve user experiences by enabling the development of more interactive applications in various implementation scenarios. This paper proposes a mobile application as a virtual self-guided tour, enabling landmark recognition and enhanced user interaction with LLMs. A landmark classifier is employed for Cloud-based image classification, with accuracy further improved by incorporating GPS-based matching of classification results. These preliminary tests proved that the use of GPS to match the location improved the results and that the London Eye improved from 82 to 88 percent. Subsequently, users are provided with audio information about the identified landmark and access to extended landmark details generated by the used LLM. Users can also engage in text or voice-based interactions with the system. The system architecture integrates real-time image processing, location optimisation, and generative AI, creating interactive and engaging user interfaces.

Publication Information

Output type

Research Output: Chapter in Book/Report/Conference proceeding Conference contribution Peer-review

Original language

English

Article number

18

Pages from-to (Number of pages)

Pages 1 - 5

Publication milestones

  • Published - 21/09/2025

Publication status

Published - 21/09/2025

Publisher

Association for Computing Machinery, United States

Publication series

  • Publication series name: MobileHCI 2025 - Adjunct Proceedings of the 2025 Conference on Mobile Human-Computer Interaction

ISBN (Electronic)

9798400719707

External Publication IDs

  • Scopus: 105031906544

Host publication title

MobileHCI '25 Adjunct: Adjunct Proceedings of the 27th International Conference on Mobile Human-Computer Interaction

Host publication editors

  • Yomna Abdelrahman
  • Passant Elagroudy
  • Florian Alt