<?xml version="1.0" encoding="UTF-8"?>
<article article-type="research-article" xml:lang="en-US" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher">london-journal-of-research-in-computer-science-technology</journal-id>
<journal-title-group>
<journal-title>London Journal of Research in Computer Science &amp; Technology</journal-title>
</journal-title-group>
<issn publication-format="print">2514-863X</issn>
<issn publication-format="electronic">2514-8648</issn>
<publisher><publisher-name>JournalsPress</publisher-name></publisher>
<self-uri xlink:href="https://journalspress.com/journal-seo-export/jats/106923.xml" />
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">106923</article-id>
<title-group>
<article-title>Bidirectional Feature Fusion Optimization of Salient Object Detection Method based on Cdinet</article-title>
</title-group>
<volume>25</volume>
<abstract><p>Salient object detection has become an important research direction in image processing technology, so for the challenges of inference efficiency and feature fusion in bimodal salient target detection, this paper proposes an improved salient target detection method for multimodal images. Firstly, a parameter-free SimAM module is introduced into the early feature extraction convolutional laver of the CDINet model to emphasize the key features, which is also used to enhance the ability of the subsequent layers to capture the key information. Then, a bidirectional feature fusion (BiFPN) machine is introduced at the decoder stage system, replacing standard convolution with depth-separable convolution to reduce computational effort while achieving efficient feature fusion and improving multi-scale detection. Finally, the model is enhanced to handle unbalanced data by using focus loss and weighted cross-entropy loss function, as well as combining the cross-merge ratio loss function to improve the accuracy of boundary prediction. The experimental results show that the improved model (SblCDINet) improves the F-measure by 1.07%, 1.15%, 1.06%, 0.16%, and 1.44%, reduces the computational complexity of the model by 19.8% over the five datasets compared to the original CDINet model data. Compared with other models, the SblCDINet model not only achieves the improvement of inference efficiency and the enhancement of the feature fusion effect, but also effectively reduces the computational complexity, and the experimental data also verifies the effectiveness and superiority of the method in multi-modal image saliency object detection.</p></abstract>
<self-uri content-type="html" xlink:href="https://journalspress.com/bidirectional-feature-fusion-optimization-of-salient-object-detection-method-based-on-cdinet/" />
</article-meta>
</front>
<body>
<sec>
<title>Full Text</title>
<p>Salient object detection has become an important research direction in image processing technology, so for the challenges of inference efficiency and feature fusion in bimodal salient target detection, this paper proposes an improved salient target detection method for multimodal images. Firstly, a parameter-free SimAM module is introduced into the early feature extraction convolutional laver of the CDINet model to emphasize the key features, which is also used to enhance the ability of the subsequent layers to capture the key information. Then, a bidirectional feature fusion (BiFPN) machine is introduced at the decoder stage system, replacing standard convolution with depth-separable convolution to reduce computational effort while achieving efficient feature fusion and improving multi-scale detection. Finally, the model is enhanced to handle unbalanced data by using focus loss and weighted cross-entropy loss function, as well as combining the cross-merge ratio loss function to improve the accuracy of boundary prediction. The experimental results show that the improved model (SblCDINet) improves the F-measure by 1.07%, 1.15%, 1.06%, 0.16%, and 1.44%, reduces the computational complexity of the model by 19.8% over the five datasets compared to the original CDINet model data. Compared with other models, the SblCDINet model not only achieves the improvement of inference efficiency and the enhancement of the feature fusion effect, but also effectively reduces the computational complexity, and the experimental data also verifies the effectiveness and superiority of the method in multi-modal image saliency object detection.</p>
</sec>
</body>
</article>