Overlapping community detection in weighted networks via hierarchical clustering
Identifikátory výsledku
Kód výsledku v IS VaVaI
<a href="https://www.isvavai.cz/riv?ss=detail&h=RIV%2F61989100%3A27240%2F24%3A10256313" target="_blank" >RIV/61989100:27240/24:10256313 - isvavai.cz</a>
Výsledek na webu
<a href="https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0312596" target="_blank" >https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0312596</a>
DOI - Digital Object Identifier
<a href="http://dx.doi.org/10.1371/journal.pone.0312596" target="_blank" >10.1371/journal.pone.0312596</a>
Alternativní jazyky
Jazyk výsledku
angličtina
Název v původním jazyce
Overlapping community detection in weighted networks via hierarchical clustering
Popis výsledku v původním jazyce
In real-world networks, community structures often appear as tightly connected clusters of nodes, with recent studies suggesting a hierarchical organization where larger groups subdivide into smaller ones across different levels. This hierarchical structure is particularly complex in trade networks, where actors typically belong to multiple communities due to diverse business relationships and contracts. To address this complexity, we present a novel algorithm for detecting hierarchical structures of overlapping communities in weighted networks, focusing on the interdependency between internal and external quality metrics for evaluating the detected communities. The proposed Graph Hierarchical Agglomerative Clustering (GHAC) approach utilizes maximal cliques as the basis units for hierarchical clustering. The algorithm measures dissimilarities between clusters using the minimal closed trail distance (CT-distance) and the size of maximal cliques within overlaps, capturing the density and connectivity of nodes. Through extensive experiments on synthetic networks with known ground truth, we demonstrate that the adjusted Silhouette index is the most reliable internal metric for determining the optimal cut in the dendrogram. Experimental results indicate that the GHAC method is competitive with widely used community detection techniques, particularly in networks with highly overlapping communities. The method effectively reveals the hierarchical structure of communities in weighted networks, as demonstrated by its application to the OECD weighted trade network, which describes the balanced trade value of bilateral trade relations.
Název v anglickém jazyce
Overlapping community detection in weighted networks via hierarchical clustering
Popis výsledku anglicky
In real-world networks, community structures often appear as tightly connected clusters of nodes, with recent studies suggesting a hierarchical organization where larger groups subdivide into smaller ones across different levels. This hierarchical structure is particularly complex in trade networks, where actors typically belong to multiple communities due to diverse business relationships and contracts. To address this complexity, we present a novel algorithm for detecting hierarchical structures of overlapping communities in weighted networks, focusing on the interdependency between internal and external quality metrics for evaluating the detected communities. The proposed Graph Hierarchical Agglomerative Clustering (GHAC) approach utilizes maximal cliques as the basis units for hierarchical clustering. The algorithm measures dissimilarities between clusters using the minimal closed trail distance (CT-distance) and the size of maximal cliques within overlaps, capturing the density and connectivity of nodes. Through extensive experiments on synthetic networks with known ground truth, we demonstrate that the adjusted Silhouette index is the most reliable internal metric for determining the optimal cut in the dendrogram. Experimental results indicate that the GHAC method is competitive with widely used community detection techniques, particularly in networks with highly overlapping communities. The method effectively reveals the hierarchical structure of communities in weighted networks, as demonstrated by its application to the OECD weighted trade network, which describes the balanced trade value of bilateral trade relations.
Klasifikace
Druh
J<sub>imp</sub> - Článek v periodiku v databázi Web of Science
CEP obor
—
OECD FORD obor
10201 - Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8)
Návaznosti výsledku
Projekt
—
Návaznosti
S - Specificky vyzkum na vysokych skolach
Ostatní
Rok uplatnění
2024
Kód důvěrnosti údajů
S - Úplné a pravdivé údaje o projektu nepodléhají ochraně podle zvláštních právních předpisů
Údaje specifické pro druh výsledku
Název periodika
PLoS One
ISSN
1932-6203
e-ISSN
—
Svazek periodika
19
Číslo periodika v rámci svazku
10
Stát vydavatele periodika
US - Spojené státy americké
Počet stran výsledku
22
Strana od-do
—
Kód UT WoS článku
001344593100005
EID výsledku v databázi Scopus
—