The MATCH statement allows you to specify a graph pattern to search for in the graph. It is the fundamental statement for retrieving data from the graph database and binding them to variables for use in subsequent parts of the query.

CREATE GRAPH myGraph { NODE User ({name string}), NODE Club ({since uint32}), EDGE Follows ()-[{createdOn date}]->(), EDGE Joins ()-[{memberNo uint32}]->() }
GQLMATCH (n) RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| U05 | Sys-gen | User | {name: "lionbower"} |
| U04 | Sys-gen | User | {name: "mochaeach"} |
| U03 | Sys-gen | User | {name: "purplechalk"} |
| U02 | Sys-gen | User | {name: "Brainy"} |
| U01 | Sys-gen | User | {name: "rowlock"} |
| C02 | Sys-gen | Club | {since: 2005} |
| C01 | Sys-gen | Club | {since: 2005} |
GQLMATCH ()-[e]->() RETURN e
Result: e
_uuid | _from | _to | _from_uuid | _to_uuid | schema | values |
|---|---|---|---|---|---|---|
| Sys-gen | U01 | U02 | UUID of U01 | UUID of U02 | Follows | {createdOn: "2024-01-05" } |
| Sys-gen | U02 | U03 | UUID of U02 | UUID of U03 | Follows | {createdOn: "2024-02-01"} |
| Sys-gen | U03 | U05 | UUID of U03 | UUID of U05 | Follows | {createdOn: "2024-05-03"} |
| Sys-gen | U04 | U02 | UUID of U04 | UUID of U02 | Follows | {createdOn: "2024-02-10"} |
| Sys-gen | U02 | C01 | UUID of U02 | UUID of C01 | Joins | {memberNo: 1} |
| Sys-gen | U05 | C01 | UUID of U05 | UUID of C01 | Joins | {memberNo: 2} |
| Sys-gen | U04 | C02 | UUID of U04 | UUID of C02 | Joins | {memberNo: 9} |
Notice that if you don't specifiy the edge direction, either as outgoing or incoming, each edge in the graph will be returned twice, as two paths are considered distinct when their element sequences differ, i.e., (n1)-[e]->(n2) and (n2)<-[e]-(n1) are different paths.
GQL - Each edge will be returned twiceMATCH ()-[e]-() RETURN e
Both node pattern and edge pattern support the label/schema expression to specify schemas (in typed graphs) or labels (in open graphs).
To retrieve all Club nodes:
GQLMATCH (n:Club) RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| C02 | Sys-gen | Club | {since: 2005} |
| C01 | Sys-gen | Club | {since: 2005} |
To retrieve all nodes connected to Brainy with Follows or Joins edges:
GQLMATCH (:User {name: 'Brainy'})-[:Follows|Joins]-(n) RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| U03 | Sys-gen | User | {name: "purplechalk"} |
| C01 | Sys-gen | Club | {since: 2005} |
Property specification can be included in node and edge patterns to apply joint equalities to filter nodes and edges with key-value pairs.
To retrieve Club nodes whose _id and since have specific values:
GQLMATCH (n:Club {_id: 'C01', since: 2005}) RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| C01 | Sys-gen | Club | {since: 2005} |
To retrieve the name of the member of club C01 whose memberNo is 1:
GQLMATCH (:Club {_id: 'C01'})<-[:Joins {memberNo: 1}]-(n) RETURN n.name
Result: n
| n.name |
|---|
| Brainy |
You can use abbreviated edge patterns when you do not need to filter edges or assign them to a variable. Even with the abbreviated form, you may still specify the direction of the edge when necessary.
To retrieve nodes connected with mochaeach with any outgoing edges:
GQLMATCH (:User {name: 'mochaeach'})->(n) RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| U02 | Sys-gen | User | {name: "Brainy"} |
| C02 | Sys-gen | Club | {since: 2005} |
To retrieve users followed by mochaeach, and the clubs joined by those users:
GQLMATCH p = (:User {name: 'mochaeach'})-[:Follows]->(:User)-[:Joins]->(:Club) RETURN p
Result: p

The WHERE clause can be used within an element pattern (node or edge pattern), a parenthesized path pattern, or immediately after a graph pattern in the MATCH statement to specify various search conditions.
To retrieve 1-step paths with outgoing Follows edges, where their createdOn values are greater than a specified date:
GQLMATCH p = ()-[e:Follows WHERE e.createdOn > '2024-04-01']->() RETURN p
Result: p

To retrieve one- or two-step paths containing outgoing Follows edges, where their createdOn values are smaller than a specified value:
GQLMATCH p = (()-[e:Follows]->() WHERE e.createdOn < "2024-02-05"){1,2} RETURN p
Result: p

To retrieve members of club C01 whose memberNo is greater than 1:
GQLMATCH (c:Club)<-[e:Joins]->(n) WHERE c._id = 'C01' AND e.memberNo > 1 RETURN n
Result: n
| _id | _uuid | schema | values |
|---|---|---|---|
| U05 | Sys-gen | User | {name: "lionbower"} |
A quantified path is a variable-length path where the complete path or a part of it is repeated a specified number of times.
To retrieve distinct nodes related to lionbower in 1 to 3 hops:
GQLMATCH (:User {name: 'lionbower'})-[]-{1,3}(n) RETURN collect_list(DISTINCT n._id) AS IDs
Result:
| IDs |
|---|
| ["C01","U01","U02","U03","U04"] |
To retrieve paths that begin with one- or two-step subpaths containing Follows edges, where their createdOn values are greater than a specified value, and these subpaths must connect to node C01:
GQLMATCH p = (()-[e:Follows]->() WHERE e.createdOn > "2024-01-31"){1,2}()-({_id:"C01"}) RETURN p
Result: p

A shortest paths between two nodes are the paths that has the fewest edges.
To retrieve all the shortest paths between lionbower and purplechalk within 5 hops:
GQLMATCH p = ALL SHORTEST (n1:User)-[]-{,5}(n2:User) WHERE n1.name = 'lionbower' AND n2.name = 'purplechalk' RETURN p
Result: p

When a MATCH statement contains multiple path patterns, each pattern is matched independently against the graph to produce its own result set. These result sets are then combined by performing an equi-join on the shared node or edge variables.
To retrieve users who joined club C02 and also follow Brainy:
GQLMATCH (u)-[:Joins]->(:Club {_id: 'C02'}), (u)-[:Follows]->(:User {name: 'Brainy'}) RETURN u
Result: u
| _id | _uuid | schema | values |
|---|---|---|---|
| U04 | Sys-gen | User | {name: "mochaeach"} |
The above query is equivalent to the following using two MATCHs:
GQLMATCH (u)-[:Joins]->(:Club {_id: 'C02'}) MATCH (u)-[:Follows]->(:User {name: 'Brainy'}) RETURN u
If the path patterns share no common variables, the result sets are combined using a Cartesian product — a behavior that is usually undesired. For example,
GQLMATCH (c:Club), (u:User)-[f:Follows WHERE f.createdOn > '2024-02-01']->() RETURN c._id, u.name
Result:
| c._id | u.name |
|---|---|
| C02 | mochaeach |
| C02 | purplechalk |
| C01 | mochaeach |
| C01 | purplechalk |
The YIELD clause can be used to select specific node, edge, or path variables from the MATCH statement, making them accessible for reference in subsequent parts of the query. Variables not selected with YIELD will no longer be available. If the YIELD clause is omitted, all variables are passed through by default.
This query only returns c, as n is not involved in YIELD:
GQLMATCH (n:User)-[:Joins]->(c:Club) YIELD c RETURN *
Result: c
| _id | _uuid | schema | values |
|---|---|---|---|
| C01 | Sys-gen | Club | {since: 2005} |
| C02 | Sys-gen | Club | {since: 2005} |
| C01 | Sys-gen | Club | {since: 2005} |
This query returns n1 and e, n2 is not included:
GQLMATCH (n1:Club) MATCH (n2:Club)<-[e:Joins WHERE e.memberNo < 3]-() YIELD e RETURN *
n1
| _id | _uuid | schema | values |
|---|---|---|---|
| C01 | Sys-gen | Club | {since: 2005} |
| C01 | Sys-gen | Club | {since: 2005} |
| C02 | Sys-gen | Club | {since: 2005} |
| C02 | Sys-gen | Club | {since: 2005} |
e
_uuid | _from | _to | _from_uuid | _to_uuid | schema | values |
|---|---|---|---|---|---|---|
| Sys-gen | U02 | C01 | UUID of U02 | UUID of C01 | Joins | {memberNo: 1} |
| Sys-gen | U02 | C01 | UUID of U02 | UUID of C01 | Joins | {memberNo: 1} |
| Sys-gen | U05 | C01 | UUID of U05 | UUID of C01 | Joins | {memberNo: 2} |
| Sys-gen | U05 | C01 | UUID of U05 | UUID of C01 | Joins | {memberNo: 2} |
This query throws syntax error since n2 is not selected in the YIELD clause, thus it cannot be accessed by the RETURN statement:
GQL - Syntax ErrorMATCH (n1:User), (n2:Club) YIELD n1 RETURN n1, n2