On this page

Overview

A High-Performance Evolutionary Multiobjective Community Detection Algorithm for Large Graphs

The High-Performance Multi-Objective Community Detection (HP-MOCD) algorithm is a scalable evolutionary method designed to efficiently identify high-quality community partitions in large complex networks. HP-MOCD combines the NSGA-II optimization framework with a parallel architecture and topology-aware genetic operators tailored to the structure of real-world graphs. In addition to detailing its core components, we describe the algorithm’s design choices, solution representation, and multi-objective selection strategy. The implementation is written in Rust for performance and exposed to Python via PyO3. The full source code is publicly available on GitHub.

check_circle

You can read the full pre-print clicking here

Overview and Design Rationale

Let us consider a graph G = (V, E), where V is the set of nodes and E the set of edges. The objective of the HP-MOCD is to uncover meaningful community structures by simultaneously optimizing multiple, often conflicting, structural criteria.

info

Note: The library has only networkx or igraph support. Compatibility for another libraries should be a great contribution!

To achieve this, HP-MOCD is built upon the NSGA-II (Non-dominated Sorting Genetic Algorithm II) framework, a well-established method in multi-objective optimization. NSGA-II was chosen due to its strong ability to produce diverse, high-quality Pareto fronts, especially when compared to older algorithms like PESA-II, which often struggle with diversity maintenance or selection pressure.

Optimization Strategy

The HP-MOCD algorithm proceeds in two main phases:

Initialization Phase:
A population of potential community partitions (called individuals) is randomly generated. Each individual is a possible assignment of nodes to communities.
Evolutionary Phase:
The population evolves through a number of generations using genetic operators—selection, crossover, and mutation. At each generation, individuals are evaluated, ranked, and filtered to maintain only the most promising solutions.

This high-level flow is summarized below.

Algorithm 1: HP-MOCD Workflow

flowchart TD
    %% Start
    A["Start HP-MOCD"]--> B["Initialize Population"]

    %% First evaluation
    B --> C["Evaluate P (Intra/Inter)"]

    %% Main loop condition
    C --> D{"gen finished?"}

    %% Loop path
    D -- Yes --> E["Assign Crowding Distances"]
    E --> F["Select Parents M\n(Tournament Selection)"]
    F --> G["Generate Offspring Q\n(Apply Crossover & Mutation)"]
    G --> H["Evaluate Q"]
    H --> I["Merge P and Q → R"]
    I --> J["Select Next Generation P\n(Best N from R)"]
    J --> K["Increment gen"]
    K --> D

    %% End path
    D -- No --> L["Extract Pareto Front F1\n(rank = 1)"]
    L --> M["Return F1"]

Objectives and Representation

The optimization targets two structural objectives:

Intra-Community Connectivity Measures how densely connected nodes are within each community. This objective is maximized (or its penalty minimized) to encourage cohesive clusters.
Inter-Community Separation Measures the extent of connections between different communities. This is minimized to promote structural separation and distinct boundaries.

Together, these form a multi-objective problem, where each solution represents a trade-off between internal density and external separation.

Internal Graph Representation

Internally, the graph G is stored using a hash map (via Rust’s high-performance rustc-hash) mapping each node to its neighbor list. This ensures:

Fast access/modification during evolution
Efficient computation of objective functions
Scalability for large graphs

Each individual (solution) is encoded as a mapping from node IDs to community IDs:

  { node_1: community_3, node_2: community_1, ... }

This compact representation supports fast mutations and evaluations during the evolutionary cycle.

Edit this page

Last updated 03 Jun 2025, 13:25 -0300 . history

Quickstart

A quickstart guide to …

Overview

Overview and Design Rationale link

Optimization Strategy link

Algorithm 1: HP-MOCD Workflow link

Objectives and Representation link

Internal Graph Representation link

Overview and Design Rationale

Optimization Strategy

Algorithm 1: HP-MOCD Workflow

Objectives and Representation

Internal Graph Representation