Avg-topk: A new pooling method for convolutional neural networks

dc.contributor.authorÖzdemir, Cüneyt
dc.date.accessioned2024-12-24T19:09:56Z
dc.date.available2024-12-24T19:09:56Z
dc.date.issued2023
dc.departmentSiirt Üniversitesi
dc.description.abstractThe pooling layer is a layer used in Convolutional Neural Networks (CNN) that takes the output feature map of the previous convolutional layer and reduces the feature maps to smaller sizes. Furthermore, in CNN the pooling layer is one of the layers that determines the success of the model. The pooling layer, reduces the spatial dimension of a CNN, greatly reducing the learning time and computational cost of the model. The most common pooling methods are maximum and average pooling. Due to the fact that the pooling strategy reduces the amount of feature maps and model parameters, it is crucial to preserve the dominant information. In this study, a cost-effective new pooling method approach is proposed. The proposed pooling method is used by calculating the weighted average of the dominant features. The proposed pooling model has been developed to address the shortcomings of maximum pooling and average pooling. The proposed new Avg-TopK pooling model takes the pixels with the highest interaction as much as the specified K number and averages them. In this study, the performances of several pooling strategies for gray and RGB picture classification in 3 different datasets were compared and analyzed in depth. Extensive experiments have demonstrated that the Avg-TopK pooling method achieves significantly higher image classification accuracy than conventional pooling methods. It has been observed that using the AVG-TopK method in transfer learning models leads to much more successful results. Furthermore, studies in the literature have compared based on the performance metrics and it has been seen that the proposed method produces more successful outcomes. In research conducted on datasets using this method, the accuracy achieved for the CIFAR-10 dataset was 6.28% and 16.62% according to the maximum pooling and the average pooling, respectively. For the CIFAR-100 dataset, the accuracy rate increased by 7.76% compared to the maximum pooling and by 25% compared to the average pooling. © 2023 Elsevier Ltd
dc.identifier.doi10.1016/j.eswa.2023.119892
dc.identifier.issn0957-4174
dc.identifier.scopus2-s2.0-85151310278
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org10.1016/j.eswa.2023.119892
dc.identifier.urihttps://hdl.handle.net/20.500.12604/3833
dc.identifier.volume223
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherElsevier Ltd
dc.relation.ispartofExpert Systems with Applications
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_20241222
dc.subjectConvolutional neural networks
dc.subjectCustom pooling method
dc.subjectDown-sampling
dc.subjectNew pooling method
dc.titleAvg-topk: A new pooling method for convolutional neural networks
dc.typeReview Article

Dosyalar