The Common Neighbors algorithm measures the similarity between two nodes by counting how many neighbors they share.
The logic behind this algorithm is that two nodes with many common neighbors are more likely to be similar or have a potential connection. This similarity score is calculated using the following formula:

where N(x) and N(y) are the sets of adjacent nodes to nodes x and y respectively.
More common neighbors indicate greater similarity between nodes, while a number of 0 indicates no similarity between two nodes.

In this example, CN(D,E) = |N(D) ∩ N(E)| = |{B, F}| = 2.

Run the following statements on an empty graph to define its structure and insert data:
INSERT (A:default {_id: "A"}), (B:default {_id: "B"}), (C:default {_id: "C"}), (D:default {_id: "D"}), (E:default {_id: "E"}), (F:default {_id: "F"}), (G:default {_id: "G"}), (A)-[:default]->(B), (B)-[:default]->(E), (C)-[:default]->(B), (C)-[:default]->(D), (C)-[:default]->(F), (D)-[:default]->(B), (D)-[:default]->(E), (F)-[:default]->(D), (F)-[:default]->(G);
To load the entire graph to the HDC server hdc-server-1 as my_hdc_graph:
CREATE HDC GRAPH my_hdc_graph ON "hdc-server-1" OPTIONS { nodes: {"*": ["*"]}, edges: {"*": ["*"]}, direction: "undirected", load_id: true, update: "static" }
Algorithm name: topological_link_prediction
Name | Type | Spec | Default | Optional | Description |
|---|---|---|---|---|---|
ids | []_id | / | / | No | Specifies the first group of nodes for computation by their _id. If unset, all nodes in the graph are used as the first group of nodes. |
uuids | []_uuid | / | / | No | Specifies the first group of nodes for computation by their _uuid. If unset, all nodes in the graph are used as the first group of nodes. |
ids2 | []_id | / | / | No | Specifies the second group of nodes for computation by their _id. If unset, all nodes in the graph are used as the second group of nodes. |
uuids2 | []_uuid | / | / | No | Specifies the second group of nodes for computation by their _uuid. If unset, all nodes in the graph are used as the second group of nodes. |
type | String | Common_Neighbors | Adamic_Adar | No | Specifies the similarity type; for Common Neighbors, keep it as Common_Neighbors. |
return_id_uuid | String | uuid, id, both | uuid | Yes | Includes _uuid, _id, or both to represent nodes in the results. |
limit | Integer | ≥-1 | -1 | Yes | Limits the number of results returned. Set to -1 to include all results. |
CALL algo.topological_link_prediction.write("my_hdc_graph", { ids: ["C"], ids2: ["A","E","G"], type: "Common_Neighbors", return_id_uuid: "id" }, { file: { filename: "cn" } })
Result:
File: cn_id1,_id2,result C,A,1 C,E,2 C,G,1
CALL algo.topological_link_prediction.run("my_hdc_graph", { ids: ["C"], ids2: ["A","C","E","G"], type: "Common_Neighbors", return_id_uuid: "id" }) YIELD cn RETURN cn
Result:
| _id1 | _id2 | result |
|---|---|---|
| C | A | 1 |
| C | E | 2 |
| C | G | 1 |
CALL algo.topological_link_prediction.stream("my_hdc_graph", { ids: ["C"], ids2: ["A", "B", "D", "E", "F", "G"], type: "Common_Neighbors", return_id_uuid: "id" }) YIELD cn FILTER cn.result >= 2 RETURN cn
Result:
| _id1 | _id2 | result |
|---|---|---|
| C | D | 2 |
| C | E | 2 |