S4.7 - PNT in Urban Environments (I)
Tracks
Track: Application Areas
| Thursday, April 30, 2026 |
| 10:00 AM - 11:20 AM |
| Room N2 |
Details
Co-Chairs: Charles Toth & Ivan Petrunin
Speaker
Mr. Philipp Hager
Member of Scientific Staff
German Aerospace Center (DLR)
Indoor Localization Using Dual-Band Wi-Fi Radio Maps Generated from Foot-Mounted Inertial Trajectories
10:00 AM - 10:20 AMAbstract text
Accurate and scalable indoor pedestrian positioning remains challenging due to the absence of Global Navigation Satellite System (GNSS) signals and the complex spatial variability of indoor radio propagation. Wi-Fi fingerprinting is a widely adopted solution, estimating user’s position by matching received signal strength indicator (RSSI) measurements to a pre-recorded database of signal signatures. However, the construction of high-quality radio maps is limited by two fundamental factors: the lack of reliable ground-truth trajectories during data collection and the low temporal resolution of Wi-Fi scanning on consumer smartphones imposed by power management and operating system constraints.
Smartphone-based site surveys are therefore labor-intensive and result in sparse spatial sampling, with meter-scale spacing between consecutive fingerprints at typical walking speeds. This undersampling limits the ability to capture localized signal variations near doors, corridor transitions, and regions with strong RSSI gradients. Crowdsourced mapping approaches further suffer from inaccurate position labels derived from consumer-grade inertial sensors with rapidly accumulating drift, leading to degraded radio map quality.
This work presents a hybrid indoor mapping framework that decouples high-accuracy trajectory generation from scalable end-user localization. During an offline mapping phase, both pedestrian dead reckoning and Wi-Fi fingerprint collection are performed on a foot-mounted platform integrating an inertial measurement unit (IMU) and dual-band Wi-Fi hardware. The foot-mounted IMU exploits gait-induced zero-velocity updates (ZUPT) to generate drift-bounded pedestrian trajectories, while time-synchronized Wi-Fi measurements are collected along the same path. This synchronization enables automatic assignment of accurate position labels to each fingerprint without manual surveying. Continuous high-rate scanning yields spatially dense radio maps that better capture fine-grained signal variations compared to smartphone-based collection.
Dual-frequency Wi-Fi measurements in the 2.4 GHz and 5 GHz bands are modeled using Gaussian Process regression to account for spatial correlation and measurement uncertainty. In the online phase, arbitrary consumer smartphones perform localization using only ambient Wi-Fi observations and the pre-constructed radio map, without requiring specialized hardware or inertial sensing. Experiments conducted in a real-world office environment with multiple users and heterogeneous devices demonstrate meter-level positioning accuracy and consistent performance improvements over single-band Wi-Fi fingerprinting.
Smartphone-based site surveys are therefore labor-intensive and result in sparse spatial sampling, with meter-scale spacing between consecutive fingerprints at typical walking speeds. This undersampling limits the ability to capture localized signal variations near doors, corridor transitions, and regions with strong RSSI gradients. Crowdsourced mapping approaches further suffer from inaccurate position labels derived from consumer-grade inertial sensors with rapidly accumulating drift, leading to degraded radio map quality.
This work presents a hybrid indoor mapping framework that decouples high-accuracy trajectory generation from scalable end-user localization. During an offline mapping phase, both pedestrian dead reckoning and Wi-Fi fingerprint collection are performed on a foot-mounted platform integrating an inertial measurement unit (IMU) and dual-band Wi-Fi hardware. The foot-mounted IMU exploits gait-induced zero-velocity updates (ZUPT) to generate drift-bounded pedestrian trajectories, while time-synchronized Wi-Fi measurements are collected along the same path. This synchronization enables automatic assignment of accurate position labels to each fingerprint without manual surveying. Continuous high-rate scanning yields spatially dense radio maps that better capture fine-grained signal variations compared to smartphone-based collection.
Dual-frequency Wi-Fi measurements in the 2.4 GHz and 5 GHz bands are modeled using Gaussian Process regression to account for spatial correlation and measurement uncertainty. In the online phase, arbitrary consumer smartphones perform localization using only ambient Wi-Fi observations and the pre-constructed radio map, without requiring specialized hardware or inertial sensing. Experiments conducted in a real-world office environment with multiple users and heterogeneous devices demonstrate meter-level positioning accuracy and consistent performance improvements over single-band Wi-Fi fingerprinting.
Biography
Philipp Hager is a Ph.D. student in the Department of Communications and Navigation at German Aerospace Center. His research interests include multi sensor fusion and indoor localization. He obtained his M.Sc. in Embedded Systems Design from University of Applied Sciences Upper Austria in 2023.
Mr. Fabian Theurl
University Research Assistant
Graz University Of Technology, Institute of Geodesy
Indoor Position Fixing Based on Fiducial Marker Detection in Low-Cost 360° LiDAR Data
10:20 AM - 10:40 AMAbstract text
Most indoor position fixing methods rely on active radio frequency-emitting infrastructure such as WiFi access points, Bluetooth low energy beacons, or ultra-wideband anchors. In this paper, we present a novel and robust position fixing method which detects printed fiducial markers with known coordinates in 3D Light Detection and Ranging (LiDAR) point clouds. The method does not require active infrastructure, only an active sensor and power supply at the mobile robot. It is therefore well-suited for position fixing in challenging environments, as it supports navigation in the dark and in settings where powered infrastructure is unavailable.
Our method processes point clouds from a low-cost, non-repetitive 360° horizontal field of view LiDAR sensor, which are accumulated during short stationary phases of a mobile robot. The resulting point clouds are then converted to spherical images where the intensity of each individual LiDAR point is mapped as the gray value of the corresponding pixel. We apply image processing techniques and a custom kernel operation designed to fill the sparsely populated spherical image. Since the intensity values vary with the distance to the target and the angle of reflection of the emitted light pulse, we propose a robust processing chain to reliably detect the AprilTag markers in challenging conditions. The robot's position is then estimated using a weighted non-linear least-squares adjustment where the ranges to each detected marker corner represent the individual measurements.
We evaluate our approach in two steps using indoor test datasets with ground truth measurements from a camera tracking system. First, we determine the accuracy of individual position fixes with a varying number of tags in the robot's surroundings. In a second analysis, we show how the method can improve a LiDAR-based SLAM solution of the robot when it enters an environment with fiducial markers.
Our method processes point clouds from a low-cost, non-repetitive 360° horizontal field of view LiDAR sensor, which are accumulated during short stationary phases of a mobile robot. The resulting point clouds are then converted to spherical images where the intensity of each individual LiDAR point is mapped as the gray value of the corresponding pixel. We apply image processing techniques and a custom kernel operation designed to fill the sparsely populated spherical image. Since the intensity values vary with the distance to the target and the angle of reflection of the emitted light pulse, we propose a robust processing chain to reliably detect the AprilTag markers in challenging conditions. The robot's position is then estimated using a weighted non-linear least-squares adjustment where the ranges to each detected marker corner represent the individual measurements.
We evaluate our approach in two steps using indoor test datasets with ground truth measurements from a camera tracking system. First, we determine the accuracy of individual position fixes with a varying number of tags in the robot's surroundings. In a second analysis, we show how the method can improve a LiDAR-based SLAM solution of the robot when it enters an environment with fiducial markers.
Biography
- Fabian Theurl
- Research Assistant at the Institute of Geodesy, Working Group Navigation at the Technical University of Graz.
- Sensor fusion for autonomous robots
- Fiducial Marker Detection for Position Fixing using a LiDAR sensor.
Dr. Josef Krska
Researcher
Czech Technical University In Prague, Faculty Of Electrical Engineering
Joint Synchronization and SS-TWR Multilateration for Dynamic Anchor Deployment in UWB Positioning Network
10:40 AM - 11:00 AMAbstract text
Ultra-Wide Band (UWB) positioning networks are comprised of anchor (infrastructure) and tag (user) devices. For the positioning of the tags, the position of each of the anchors is required to be known. This network determination process is performed before the network operation (i.e. tag positioning), it is typically time-consuming and often requires expensive equipment. Yet, repositioning and adding of anchors during the network’s operation is beneficial, improving flexibility and positioning robustness.
Considering Time Difference of Arrival (TDoA) positioning method, the anchors need to be precisely synchronized. Consequently, when an anchor is added to the network or moved, both their position and distance to other anchors needs to be retrieved. Specifically, for Downlink TDoA, where tags are passive and anchors transmit periodically, the positioning broadcast messages can be used for positioning of both the tags and the anchors.
In this work we propose to utilize the synchronization Kalman Filter (KF), which tracks time bias, drift and drift rate to the reference timescale, to compensate the drift error in inter-anchor Single-Sided Two-Way Ranging (SS-TWR). In contrast, the traditional SS-TWR implementation relies on drift estimation through carrier frequency offset (CFO) observations that are approximately two orders of magnitude worse than the KF-based drift estimate and often impaired by observable bias.
The SS-TWR leverages on the already present joint synchronization and downlink positioning broadcast messages. The overhead required to distribute the necessary timestamps is marginal and is substantially more effective than employing dedicated two-way ranging messages. The inter-anchor ranges provided by the SS-TWR and positions of already determined anchors within the localization network are then directly used to estimate the unresolved anchor's position through multilateration.
This approach enables the addition of new anchors to existing networks and to reposition the anchors within the network while the network is online, without interruption of the tag positioning service. Also, the approach provides the means for sanity checks of the anchor positions and tampering detection. The algorithm design aims to minimize the additional load introduced to the localization network, to maximize the available tag localization capacity.
Considering Time Difference of Arrival (TDoA) positioning method, the anchors need to be precisely synchronized. Consequently, when an anchor is added to the network or moved, both their position and distance to other anchors needs to be retrieved. Specifically, for Downlink TDoA, where tags are passive and anchors transmit periodically, the positioning broadcast messages can be used for positioning of both the tags and the anchors.
In this work we propose to utilize the synchronization Kalman Filter (KF), which tracks time bias, drift and drift rate to the reference timescale, to compensate the drift error in inter-anchor Single-Sided Two-Way Ranging (SS-TWR). In contrast, the traditional SS-TWR implementation relies on drift estimation through carrier frequency offset (CFO) observations that are approximately two orders of magnitude worse than the KF-based drift estimate and often impaired by observable bias.
The SS-TWR leverages on the already present joint synchronization and downlink positioning broadcast messages. The overhead required to distribute the necessary timestamps is marginal and is substantially more effective than employing dedicated two-way ranging messages. The inter-anchor ranges provided by the SS-TWR and positions of already determined anchors within the localization network are then directly used to estimate the unresolved anchor's position through multilateration.
This approach enables the addition of new anchors to existing networks and to reposition the anchors within the network while the network is online, without interruption of the tag positioning service. Also, the approach provides the means for sanity checks of the anchor positions and tampering detection. The algorithm design aims to minimize the additional load introduced to the localization network, to maximize the available tag localization capacity.
Biography
Josef Krška received B.S. and M.S. degrees in control engineering from the Czech Technical University in Prague (CTU), Faculty of Electrical Engineering (FEE), in 2018 and 2021, respectively. He received Ph.D. degree in radio engineering at CTU FEE in 2025. He is currently employed on a research position at the CTU FEE Department of Radioengineering. His research is primarily focused on UWB networks, UWB positioning, and data fusion with other positioning systems. Dr. Krška has been a part of Integrated Satellite and Terrestrial Navigation Technologies Center at the department since 2016.
Mr. Harrison Reeves
Phd Student
Ucl
Cross-view urban positioning using angular semantic constraints and 2D vector maps
11:00 AM - 11:20 AMAbstract text
Reliable positioning in Global Navigation Satellite System (GNSS)-denied urban environments remains a critical capability gap for dismounted personnel and autonomous systems. However, most visual positioning systems require the query and database images to be taken from similar viewpoints. For military, emergency, and security applications, there is often a need to match forward-looking query images with a satellite-based database or a map. Using semantic feature descriptors, such as "building," "road," and "house," enables this cross-view image matching.
This paper proposes Angular Semantic Positioning, a method that extracts explicit angular bearings to static urban infrastructure from a single monocular image and compares them against lightweight 2D vector maps (e.g., OpenStreetMap). We employ a real-time semantic segmentation network to distinguish individual building instances from continuous urban blocks, enabling precise centroid-based bearing measurements. To handle the ambiguity of single-epoch matching, we enhance positioning accuracy by enforcing geometric depth-ordering constraints. For example, we use the physical prior that line-of-sight rays can traverse planar "road" surfaces but are obstructed by vertical "building" structures, allowing the algorithm to score position candidates based on free-space visibility.
This method offers two main advantages for real-time operation: (1) Robustness: By enforcing geometric consistency between the visual scene and the vector map, the system reduces the impact of transient segmentation errors; and (2) Efficiency: A forward-looking query image can be matched to a map in 2–6 ms on consumer hardware, minimizing Size, Weight, and Power (SWaP) requirements. This is significantly faster than emerging end-to-end machine learning approaches (e.g., Sarlin et al., 2023).
Experimental validation using query images from the Mapillary Vistas dataset (Neuhold et al., 2017) and map data from OpenStreetMap (Haklay & Weber, 2008) covering London and the surrounding area demonstrates that the algorithm successfully isolates the true position within the top 10% of candidates in 77% of trials. This effectively reduces the search space by an order of magnitude using only a single monocular frame, validating its suitability for resilient Positioning, Navigation, and Timing (PNT) in resource-constrained environments. By combining multiple image matches as the user moves, we would expect to obtain a unique position solution.
This paper proposes Angular Semantic Positioning, a method that extracts explicit angular bearings to static urban infrastructure from a single monocular image and compares them against lightweight 2D vector maps (e.g., OpenStreetMap). We employ a real-time semantic segmentation network to distinguish individual building instances from continuous urban blocks, enabling precise centroid-based bearing measurements. To handle the ambiguity of single-epoch matching, we enhance positioning accuracy by enforcing geometric depth-ordering constraints. For example, we use the physical prior that line-of-sight rays can traverse planar "road" surfaces but are obstructed by vertical "building" structures, allowing the algorithm to score position candidates based on free-space visibility.
This method offers two main advantages for real-time operation: (1) Robustness: By enforcing geometric consistency between the visual scene and the vector map, the system reduces the impact of transient segmentation errors; and (2) Efficiency: A forward-looking query image can be matched to a map in 2–6 ms on consumer hardware, minimizing Size, Weight, and Power (SWaP) requirements. This is significantly faster than emerging end-to-end machine learning approaches (e.g., Sarlin et al., 2023).
Experimental validation using query images from the Mapillary Vistas dataset (Neuhold et al., 2017) and map data from OpenStreetMap (Haklay & Weber, 2008) covering London and the surrounding area demonstrates that the algorithm successfully isolates the true position within the top 10% of candidates in 77% of trials. This effectively reduces the search space by an order of magnitude using only a single monocular frame, validating its suitability for resilient Positioning, Navigation, and Timing (PNT) in resource-constrained environments. By combining multiple image matches as the user moves, we would expect to obtain a unique position solution.
Biography
Harrison Reeves is a PhD candidate at University College London (UCL), supervised by Professor Paul Groves, Dr. Simon Julier, and Dr. Sophia Bano. His research sits at the intersection of traditional navigation and computer vision, specifically focusing on resilient image-based positioning for GNSS-denied environments. Today, he will present his latest work on Angular Semantic Positioning, a novel approach for cross-view matching in urban canyons using 2D vector maps.