Azure Cosmos DB is Microsoft’s multi-model database which supports the Gremlin query language to store and operate on graph data. It enables you to build highly responsive applications worldwide. As part of Cosmos DB, Graph/Gremlin API is supported for graph databases. Since Cosmos DB is optimized for fast processing (OLTP), traversal limits may apply for heavy analytic workloads (OLAP). Azure Databricks and GraphFrames can be used as an alternative to do advanced analytics. GraphFrames is an Apache Spark package which extends DataFrames to provide graph analytics capabilities.
Cosmos DB Spark connector contains samples to read graph data into GraphFrames. In this session I will demonstrate how to build upon this connector to write GraphFrames to Cosmos DB using an Azure Databricks PySpark notebook.
level: 400