Shizuoka University
Okabe Lab

LMM-Based Indoor Navigation System
via Automatic Map Dictionary Construction

Toya Nakamura

Shizuoka University

Makoto Okabe

Shizuoka University

研究のTeaser画像

Surrounding photo and destination (left), final result (right). The user inputs the destination and a photo of the surroundings taken with a smartphone. The system then uses an LMM to estimate the current location from information such as signboards in the image, and outputs the final result as a guided route on the map.

Abstract

In public spaces such as large commercial facilities and train stations, users are often forced to rely on installed signboards and paper maps, making it easy for them to lose track of their current location or misjudge their direction of travel. This research aims to realize an intuitive navigation system to solve this problem. The core of the proposed technology is the application of a Large Multimodal Model (LMM), which possesses both image recognition and language understanding capabilities. By analyzing unstructured map images and converting irregularly placed store names and complex pathway shapes into structured data, the system automatically constructs a map dictionary and a routing graph essential for navigation.

Paper

Master's Thesis Interim Report (2026)

Video

Material

Master's Thesis Interim Report (2026)

Citation

  • Toya Nakamura
    LMM-Based Indoor Navigation System via Automatic Map Dictionary Construction
    Master's Thesis Interim Report, January 2026