Graph pattern matching is one of the most interesting and challengingoperations in graph analytics. However, it is primarily supported by graphdatabase systems such as Neo4j but, besides research prototypes, not generallyavailable for distributed (not-only graph) processing frameworks like ApacheFlink or Apache Spark.
In our talk, we want to give an overview of our current implementation ofCypher on Apache Flink. Cypher is the Neo4j graph query language and enablesthe intuitive definition of graph patterns including structural and semanticpredicates. As the Neo4j graph data model is not supported out-of-the box byApache Flink, we leverage Gradoop, a Flink-based graph analytics frameworkbased on Apache Flink that already provides an abstraction of schema-freeproperty graphs.
We will give a brief overview about the technologies used to implement Cypher,explain our query engine and give a demonstration of the available languagefeatures. Finally, we will discuss open challenges and missing featureshopefully motivating people to contribute.
The project is a cooperation between the University of Leipzig and NeoTechnology. |