Method RoI-graph Pooling Graph Construction Given sample points Local graph G = (V, E), where node represents a sampled point, edge is connection between nodes Define G as k-nearest neighbor graph, built from the geometric distance among different nodes Use PointNet [21] to encode original neighbor points Add the 3D proposal’s local corners [23] for each node to make them have the ability to discriminate differences Initial state s j of each node v j at iteration step t = 0: s j = [x j ,y j ,z j ,r j ,f j ,u j ,w j ,f img ] features from PointNet two diagonal corners of the 3D proposal [21] Qi, C.R., Su, H., Mo, K., Guibas, L.J.: Pointnet: Deep learning on point sets for 3d classification and segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017) [23] Sheng, H., Cai, S., Liu, Y., Deng, B., Huang, J., Hua, X.S., Zhao, M.J.: Improving 3d object detection with channel-wise transformer. In: Proceedings of the IEEE International Conference on Computer Vision (2021)