Professional Documents
Culture Documents
3rd Int. Workshop on Graph Data Management: Techniques and applications (GDM 2012) 5 April, Washington DC, USA
Outline 1. Introduction 2. Current graph databases 3. Comparison of graph database models 4. Conclusions and future work
Outline 1. Introduction 2. Current graph databases 3. Comparison of graph database models 4. Conclusions and future work
Database models
A data model is a collection of conceptual tools used to model real-world entities and the relationships among them [Silberschatz et al. 1996]. A DB model consists of 3 components [Codd 1980]:
2 Graph-oriented operations
R. Angles and C. Gutierrez. Survey of Graph Database Models. ACM Comp. Surveys, 2008.
DEX
Neo4j
Motivation
1. What is the most suitable graph database?
Empirical comparison: desirable but hard Benchmark: there is not a standard one The application domain is also important
Outline 1. Introduction 2. Current graph databases 3. Comparison of graph database models 4. Conclusions and future work
Outline 1. Introduction 2. Current graph databases 3. Comparison of graph database models 4. Conclusions and future work
Backed: RDB (filament), BerkeleyDB (HypergraphDB), TokyoCabinet (vertexDB) Indexing: nodes, relations, atributes, triples Data formats: GraphML, graphViz, N-Triples, RDF-XML, ACID (partial support): Allegro, HypergraphDB, Infinite, Neo4j
Languages: SPARQL 1.0(.1), GraphQL, (Sones) Why is an API the main approach? does it facilities the development of applications?
Node/edge identification: ObjectsI-Ds vs Values Support for complex relations (hyperedges = n-ary relations)
Query features
Declarative QL: SPARQL, Prolog, Lisp, GraphQL Reasoning: RDF(S), OWL Analysis: Social Networking, graph statistics
Integrity constraints
Outline 1. Introduction 2. Current graph databases 3. Comparison of graph database models 4. Conclusions and future work
Conclusions
General balance of graph database models Data structures:
Several types of graph structures Good expressive power for data modeling
Query features:
Restricted to provide APIs Good support for essential graph queries Lack of graph query languages
Integrity constraints:
Basic notions of restrictions Oriented to be schema-less
Future work
Empirical evaluation of graph databases Development of a Benchmark for GDBs Comparison with other database technologies (in particular RDF databases)