通道混洗与跨尺度增强的轻量级铁路全景分割

Lightweight railway panoramic segmentation based on channel shuffle and cross scale enhancement

  • 摘要: 针对高速铁路场景下图像全景分割时存在全景分割精确度低,难以实现轻量级实时分割等问题,提出了一种通道混洗与跨尺度增强的轻量级铁路全景分割方法。首先,基于FasterNet网络,提出了轻量化CS_FasterNet特征提取网络,通过部分卷积和通道混洗优化了特征信息的聚合,实现对铁路场景下全景分割轻量化特征提取。其次,设计了多尺度特征交互增强模块,利用特征交互和跨特征融合,全面地捕捉局部的细节和全局信息,提高图像特征提取的质量。最后,改进预测融合模块对语义结果与实例结果进行融合,提升网络对图像分割的准确性,得到更加精细的全景分割输出结果。实验结果表明:所提轻量级模型在模型每秒处理帧率和计算量等评价指标均优于对比方法,相较于UPSNet方法,本文方法的每秒处理帧数提高了约11.5帧/s,全景分割质量提升了约9.9%,能够实现对不同铁路场景下全景分割的准确性和实时性。

     

    Abstract: A lightweight railway panoramic segmentation method based on channel shuffle and cross scale enhancement is proposed to address the issues of low accuracy and difficulty in achieving lightweight real-time segmentation in high-speed railway scene image panoramic segmentation. Firstly, based on the FasterNet network, a lightweight CS-FasterNet feature extraction network is proposed. The original FasterNet is a feature extraction network that reduces redundant calculations through partial convolution, aiming to improve computation speed and preserve high detection accuracy. However, some convolutions only apply filters to a small portion of the input channel, which can lead to insufficient feature extraction for the remaining channels. The aggregation of feature information was optimized through partial convolution and channel shuffling, and feature recombination techniques were used to reduce the computational complexity of the model, thereby achieving the goal of feature extraction in lightweight railway scenes. Realize lightweight feature extraction for panoramic segmentation in railway scenes. Secondly, a multi-scale feature interaction enhancement module was designed, which applies multi head attention mechanism to further extract pixel level semantic feature information from high-level semantic features of the image, which can expand the range of receptive fields, increase the long-range dependency relationship of feature maps, and help capture finer grained information. The cross scale feature fusion module adopts two paths of feature fusion: bottom-up and top-down. By fusing the feature maps of different scales output by the backbone network, it improves the utilization of scale features, comprehensively captures local details and global information, and which improves the quality of image feature extraction. Finally, the improved prediction fusion module integrates semantic results with instance results. In the panoramic segmentation task, the Soft NMS method is used to improve the accuracy of pixel classification. Soft NMS selects detection boxes by reducing the confidence of the intersection to union ratio greater than the detection box. The obtained intersection to union ratio value is multiplied by the original score using Gaussian exponent, and finally the true detection box is selected by weighting. Improve the accuracy of image segmentation by the network and obtain more refined panoramic segmentation output results. The experimental results show that the proposed lightweight model uses frame rate processing per second and computational complexity as evaluation indicators. The larger the value of frame rate processing per second, the faster the segmentation speed, and the smaller the value of computational complexity, the more favorable it is for lightweight segmentation. In the model, the evaluation indicators such as processing frame rate per second and computational complexity are superior to the comparison method. Compared with the UPSNet method, the processing frame rate per second of this method has increased by about 11.5frames/s, and the panoramic segmentation quality has improved by about 9.9%. It can achieve accuracy and real-time panoramic segmentation in different railway scenes.

     

/

返回文章
返回