Choy, C., Gwak, J., and Savarese, S.: 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks, in: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Long Beach, CA, USA, 15–20 June 2019, IEEE, 3070–3079,
https://doi.org/10.1109/CVPR.2019.00319, 2019.
a,
b,
c,
d
Dai, A., Chang, A. X., Savva, M., Halber, M., Funkhouser, T., and Niessner, M.: ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes, in: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017, IEEE, 2432–2443,
https://doi.org/10.1109/CVPR.2017.261, 2017.
a
Guo, Y., Wang, H., Hu, Q., Liu, H., Liu, L., and Bennamoun, M.: Deep Learning for 3D Point Clouds: A Survey, IEEE T. Pattern Anal., 43, 4338–4364,
https://doi.org/10.1109/TPAMI.2020.3005434, 2021.
a
Hu, Y., Yang, J., Chen, L., Li, K., Sima, C., Zhu, X., Chai, S., Du, S., Lin, T., Wang, W., Lu, L., Jia, X., Liu, Q., Dai, J., Qiao, Y., and Li, H.: Planning-Oriented Autonomous Driving, in: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023, IEEE, 17853–17862,
https://doi.org/10.1109/CVPR52729.2023.01712, 2023.
a,
b
Hughes, N., Chang, Y., and Carlone, L.: Hydra: A Real-Time Spatial Perception System for 3D Scene Graph Construction and Optimization, in: Robotics: Science and Systems XVIII, Vol. 18,
https://www.roboticsproceedings.org/rss18/p050.html (last access: 31 December 2025), 2022. a
Jain, A., Katara, P., Gkanatsios, N., Harley, A. W., Sarch, G., Aggarwal, K., Chaudhary, V., and Fragkiadaki, K.: ODIN: A Single Model for 2D and 3D Segmentation, in: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024, IEEE, 3564–3574,
https://doi.org/10.1109/CVPR52733.2024.00342, 2024.
a,
b
Ji, G., Weder, S., Engelmann, F., Pollefeys, M., and Blum, H.: ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding, in: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 10–17 June 2025, IEEE, 4398–4407,
https://doi.org/10.1109/CVPR52734.2025.00415, 2025.
a
Lai, X., Liu, J., Jiang, L., Wang, L., Zhao, H., Liu, S., Qi, X., and Jia, J.: Stratified Transformer for 3D Point Cloud Segmentation, in: 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), New Orleans, LA, USA, 18–24 June 2022, IEEE, 8490–8499,
https://doi.org/10.1109/CVPR52688.2022.00831, 2022.
a
Li, D., Guan, J., Chen, Z., Liao, J., and Du, J.: PointSSM: State Space Model for Large-Scale LiDAR Point Cloud Semantic Segmentation, Int. J. Appl. Earth Obs., 144, 104830,
https://doi.org/10.1016/j.jag.2025.104830, 2025a.
a,
b
Li, Z., Ai, Y., Lu, J., Wang, C., Deng, J., Chang, H., Liang, Y., Yang, W., Zhang, S., and Zhang, T.: Pamba: Enhancing Global Interaction in Point Clouds via State Space Model, in: 39th AAAI Conference on Artificial Intelligence (AAAI), Philadelphia, PA, USA, 25 February–4 March 2025, AAAI Press, Washington, DC, USA, 39, 5092–5100,
https://doi.org/10.1609/aaai.v39i5.32540, 2025b.
a,
b
Liu, Z., Yang, X., Tang, H., Yang, S., and Han, S.: FlatFormer: Flattened Window Attention for Efficient Point Cloud Transformer, in: 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17–24 June 2023, IEEE, 1200–1211,
https://doi.org/10.1109/CVPR52729.2023.00122, 2023.
a
Mazaheri, H., Goli, S., and Nourollah, A.: A Survey of 3D Space Path-Planning Methods and Algorithms, ACM Comput. Surv., 57, 1–32,
https://doi.org/10.1145/3673896, 2024.
a,
b
Mohammadi Amin, F., Caldwell, D. G., and van de Venn, H. W.: Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments, J. Intell. Robot. Syst., 111, 94,
https://doi.org/10.1007/s10846-025-02290-9, 2025.
a,
b
Peng, B., Wu, X., Jiang, L., Chen, Y., Zhao, H., Tian, Z., and Jia, J.: OA-CNNs: Omni-Adaptive Sparse CNNs for 3D Semantic Segmentation, in: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024, IEEE, 21305–21315,
https://doi.org/10.1109/CVPR52733.2024.02013, 2024.
a
Qi, C. R., Yi, L., Su, H., and Guibas, L. J.: PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space, in: Proceedings of the 31st International Conference on Neural Information Processing Systems (NeurIPS), Red Hook, NY, USA, 4–9 December 2017, 5105–5114,
https://proceedings.neurips.cc/paper_files/paper/2017/file/d8bf84be3800d12f74d8b05e9b89836f-Paper.pdf (last access: 4 March 2026), 2017.
a,
b
Rozenberszki, D., Litany, O., and Dai, A.: Language-Grounded Indoor 3D Semantic Segmentation in the Wild, in: Proceedings of the 17th European Conference on Computer Vision (ECCV), Tel Aviv, Israel, 23–27 October 2022, edited by: Avidan, S., Brostow, G., Cissé, M., Farinella, G. M., and Hassner, T., Lecture Notes in Computer Science, Springer, Cham, 13693, 125–141,
https://doi.org/10.1007/978-3-031-19827-4_8, 2022.
a,
b
Shi, K., Wang, R., Liu, J., Wang, H., and Zhang, D.: Design and analysis of mobile mechanism based on three-dimensional Hilbert curve, Mech. Sci., 16, 851–876,
https://doi.org/10.5194/ms-16-851-2025, 2025.
a,
b
Thomas, H., Qi, C. R., Deschaud, J.-E., Marcotegui, B., Goulette, F., and Guibas, L.: KPConv: Flexible and Deformable Convolution for Point Clouds, in: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), Seoul, Korea (South), 27 October–2 November 2019, IEEE, 6410–6419,
https://doi.org/10.1109/ICCV.2019.00651, 2019.
a,
b,
c
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A. N., Kaiser, Ł., and Polosukhin, I.: Attention Is All You Need, in: Proceedings of the 31st International Conference on Neural Information Processing Systems (NeurIPS), Red Hook, NY, USA, 4–9 December 2017, 6000–6010,
https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf (last access: 4 March 2026), 2017. a
Wang, P.-S.: OctFormer: Octree-based Transformers for 3D Point Clouds, ACM T. Graphic., 42, 155,
https://doi.org/10.1145/3592131, 2023.
a,
b,
c,
d,
e
Wu, X., Lao, Y., Jiang, L., Liu, X., and Zhao, H.: Point Transformer V2: Grouped Vector Attention and Partition-Based Pooling, in: Proceedings of the 36th International Conference on Neural Information Processing Systems (NeurIPS), edited by: Koyejo, S., Mohamed, S., Agarwal, A., Belgrave, D., Cho, K., and Oh, A., New Orleans, LA, USA, 28 November–9 December 2022, 35, 33330–33342,
https://proceedings.neurips.cc/paper_files/paper/2022/file/d78ece6613953f46501b958b7bb4582f-Paper-Conference.pdf (last access: 4 March 2026), 2022.
a,
b
Wu, X., Jiang, L., Wang, P.-S., Liu, Z., Liu, X., Qiao, Y., Ouyang, W., He, T., and Zhao, H.: Point Transformer V3: Simpler, Faster, Stronger, in: 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Seattle, WA, USA, 16–22 June 2024, IEEE, 4840–4851,
https://doi.org/10.1109/CVPR52733.2024.00463, 2024.
a,
b,
c,
d,
e,
f,
g
Xu, R., Li, J., Zhang, S., Li, L., Li, H., Ren, G., and Tang, X.: Interactive trajectory prediction for autonomous driving based on Transformer, Mech. Sci., 16, 87–97,
https://doi.org/10.5194/ms-16-87-2025, 2025.
a
Yang, Y.-Q., Guo, Y.-X., Xiong, J.-Y., Liu, Y., Pan, H., Wang, P.-S., Tong, X., and Guo, B.: Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding, Computational Visual Media, 11, 83–101,
https://doi.org/10.26599/CVM.2025.9450383, 2025.
a,
b
Zeid, K. A., Yilmaz, K., de Geus, D., Hermans, A., Adrian, D., Linder, T., and Leibe, B.: DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation, arXiv [preprint],
https://doi.org/10.48550/arXiv.2503.18944, 2026.
a,
b,
c
Zhao, W., Zhang, R., Wang, Q., Cheng, G., and Huang, K.: BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis, in: 2025 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 10–17 June 2025, IEEE, 29395–29405,
https://doi.org/10.1109/CVPR52734.2025.02737, 2025.
a,
b