Enhancing Semantic Code Search With Deep Graph Matching

dc.authoridAFZAL, FARKHANDA/0000-0001-5396-7598
dc.contributor.authorBibi, Nazia
dc.contributor.authorMaqbool, Ayesha
dc.contributor.authorRana, Tauseef
dc.contributor.authorAfzal, Farkhanda
dc.contributor.authorAkgul, Ali
dc.contributor.authorEldin, Sayed M.
dc.date.accessioned2024-12-24T19:28:33Z
dc.date.available2024-12-24T19:28:33Z
dc.date.issued2023
dc.departmentSiirt Üniversitesi
dc.description.abstractThe job of discovering appropriate code snippets against a natural language query is an important task for software developers. Appropriate code retrieval increases software productivity and quality as well. In contrast to traditional information retrieval techniques, code search necessitates bridging the semantic breach between programming languages and natural language to search code fragments. Deep neural networks for search codes have recently been a hot topic in research. The standard neural code quest approaches present source code and query in the form of text as independent embedding, then calculate the semantic similarity between them using vector distance (e.g., using cosine similarity). Although recent research utilized query and code snippets during code search, it overlooked the contained rich semantic information and deep structural features between them. In this study, we are also dealing with the problem of code search by providing a deep neural solution that facilitates software developers during software development. Our proposed model effectively used neural graph matching and a searching approach for semantic code retrieval. It first converts both query and code fragments in graph format and then the semantic matching module is used to facilitate the process of matching that will retrieve the best-matched code snippets. It not only exploits the enriched semantic meanings and features, but it also uses the cross-attention mechanism to learn the fine-grained similarity that exists between query and code. The proposed model's evaluation is done using the Codesearchnet dataset with six representative programming languages. It provides comparatively good results as compared to existing baselines. It enables users to find required code snippets, and ranking is used to retrieve top 10 results. The accuracy of the proposed system is approximately 97%.
dc.identifier.doi10.1109/ACCESS.2023.3263878
dc.identifier.endpage52411
dc.identifier.issn2169-3536
dc.identifier.scopus2-s2.0-85153339400
dc.identifier.scopusqualityQ1
dc.identifier.startpage52392
dc.identifier.urihttps://doi.org/10.1109/ACCESS.2023.3263878
dc.identifier.urihttps://hdl.handle.net/20.500.12604/7115
dc.identifier.volume11
dc.identifier.wosWOS:001005698600001
dc.identifier.wosqualityQ2
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherIEEE-Inst Electrical Electronics Engineers Inc
dc.relation.ispartofIeee Access
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_20241222
dc.subjectCodes
dc.subjectSource coding
dc.subjectSemantics
dc.subjectSoftware
dc.subjectTask analysis
dc.subjectGraph neural networks
dc.subjectSTEM
dc.subjectCodesearchnet
dc.subjectnatural language
dc.subjectdeep neural networks
dc.subjectsemantic matching
dc.subjectGNN
dc.subjectsource code selection
dc.subjectsource code reuse
dc.subjectrecommendation system
dc.titleEnhancing Semantic Code Search With Deep Graph Matching
dc.typeArticle

Dosyalar