Open Access

Advanced Visual SLAM and Image Segmentation Techniques for Augmented Reality

1 Cranfield University
2 Cranfield University
3 Cranfield University

Abstract

Augmented reality can enhance human perception to experience a virtual-reality intertwined world by computer vision techniques. However, the basic techniques cannot handle complex large-scale scenes, tackle real-time occlusion, and render virtual objects in augmented reality. Therefore, this paper studies potential solutions, such as visual SLAM and image segmentation, that can address these challenges in the augmented reality visualizations. This paper provides a review of advanced visual SLAM and image segmentation techniques for augmented reality. In addition, applications of machine learning techniques for improving augmented reality are presented.

Keywords

How to Cite

Jiang, Y., Tran, T. H., & Williams, L. (2026). Advanced Visual SLAM and Image Segmentation Techniques for Augmented Reality. Asia Journal of Social Innovation and Development, 2(1), 28. Retrieved from https://www.ajsid.org/index.php/pub/article/view/28

References

📄 AlapattD.MascagniP.VardazaryanA.GarciaA.OkamotoN.MutterD.MarescauxJ.CostamagnaG.DallemagneB. PadoyN. (2021). Temporally Constrained Neural Networks (TCNN): A framework for semi-supervised video semantic segmentation.
📄 Almalioglu, Y., Saputra, M. R. U., de Gusmao, P. P., Markham, A., & Trigoni, N. (2019). Ganvo: Unsupervised deep monocular visual odometry and depth estimation with generative adversarial networks. In International conference on robotics and automation, (pp. 5474-5480). IEEE.
📄 Alves, J., & Bernardino, A. (2020, April). A remote RGB-D VSLAM solution for low computational powered robots. In 2020 IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC), (pp. 214-220). IEEE. doi:10.1109/ICARSC49921.2020.9096074
📄 Alzahrani, N. M., & Alfouzan, F. A. (2022). Augmented Reality (AR) and Cyber-Security for Smart Cities—A Systematic Literature Review. Sensors (Basel), 22(7), 2792. doi:10.3390/s22072792 PMID:35408406
📄 Arandjelovic, R., Gronat, P., Torii, A., Pajdla, T., & Sivic, J. (2016). NetVLAD: CNN architecture for weakly supervised place recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, (pp. 5297-5307). doi:10.1109/CVPR.2016.572