In-Memory Dictionary-Based Indexing of Quoted RDF Triples

Ruben Taelman

QuWeDa 2023, 6 November 2023

In-Memory Dictionary-Based Indexing of Quoted RDF Triples

Ghent University – imec – IDLab, Belgium

Violets are blue

– Alice

:Alice :says << :Violets :haveColor :Blue >> .

🗄️ ?

Making statements about statements

Triples can be quoted as terms

Wide adoption of quoted triples

How to index quoted triples,
and what is the impact on ingestion, storage, and query performance?

Dictionary-based approaches

Dictionary Encoding

1. Singular Dictionary

Singular Dictionary

2. Quoted Triples Dictionary

Quoted Triples Dictionary

3. Referential Quoted Triples Dictionary

Referential Quoted Triples Dictionary

4. Indexed Quoted Triples Dictionary

Indexed Quoted Triples Dictionary

Experimental Setup

Singular and referential dictionaries achieve lowest storage size

Storage size depth 1

Singular and quoted dictionaries achieve lowest ingestion times

Depth 1:   Ingestion time depth 1
Depth 5:   Ingestion time depth 5

Minimal difference in performance for highly selective queries

Query high depth 1

Indexed Quoted Triples Dictionary significantly faster for medium selectivity

Query medium depth 1

Indexed Quoted Triples Dictionary significantly faster for low selectivity

Query low depth 1

Conclusions