This site requires Cookies enabled in your browser for login.
Updating ...
WaterNet Home
WaterNet
for
pour le
Canada
Menu
WaterNet
Home
GWFO
Home
Master
List
Data
Centre
Collections
X
Defaults
Select All
Websites
X
Global Water Futures Observatories (GWFO) Global Water Futures (GWF) Global Institute for Water Security (GIWS) International Network of Alpine Research Catchment Hydrology
Legacy Research Programs
X
Changing Cold Regions Network (CCRN) Drought Research Initiative (DRI) International Network of Alpine Research Catchment Hydrology (Legacy Site) Improving Processes & Parameterization for Prediction in Cold Regions Hydrology (IP3) The Mackenzie Global Energy and Water Cycle Experiment (GEWEX) Study (MAGS)
Legacy sites
Map
Utilities
X
Account Settings Metadata Editor Record List Alias List Editor
Data Centre
Data Type Editor
. . .
X
Clear
Select All
Advanced Search
Go to Top⇡
Related items loading ...
Fetching Chart ...
Publication Additional Information Download
Publication Type
Journal Article
Authorship
Nafi, K. W., Roy, B., Roy, C. K., & Schneider, K. A.
Title
A universal cross language software similarity detector for open source software categorization
Year
2020
Publication Outlet
Journal of Systems and Software, 162, 110491.
DOI
https://doi.org/10.1016/j.jss.2019.110491
Citation
Nafi, K. W., Roy, B., Roy, C. K., & Schneider, K. A. (2020). A universal cross language software similarity detector for open source software categorization. Journal of Systems and Software, 162, 110491. https://doi.org/10.1016/j.jss.2019.110491 .
Abstract
While there are novel approaches for detecting and categorizing similar software applications, previous research focused on detecting similarity in applications written in the same programming language and not on detecting similarity in applications written in different programming languages. Cross-language software similarity detection is inherently more challenging due to variations in language, application structures, support libraries used, and naming conventions. In this paper we propose a novel model, CroLSim, to detect similar software applications across different programming languages. We define a semantic relationship among cross-language libraries and API methods (both local and third party) using functional descriptions and a word-vector learning model. Our experiments show that CroLSim can successfully detect cross-language similar software applications, which outperforms all existing approaches (mean average precision rate of 0.65, confidence rate of 3.6, and 75% highly rated successful queries). Furthermore, we applied CroLSim to a source code repository to see whether our model can recommend cross-language source code fragments if queried directly with source code. From our experiments we found that CroLSim can recommend cross-language functional similar source code when source code is directly used as a query (average precision=0.28, recall=0.85, and F-Measure=0.40).
Program Affiliations
GWF: Global Water Futures
Project Affiliations
GWF-CS: Computer Science
Publication Stage
Published
Additional Information
Computer Science
Download Links
https://doi.org/10.1016/j.jss.2019.110491
© 2026 - WaterNet Version 2026-06-18
Global Water Futures Observatories
Powered by
G W F Net
T-2021-11-14-B2B3B1M6jGQrEqEzbERolxPSA Publication 1.0