Ghent University – imec – IDLab, Belgium
Not all data is public (e.g. Solid), GDPR, ...
Public SPARQL endpoints down for 1.5+ days each month (95% availability).
Federation algorithms only work in the order of 10 sources.
Websites use schema.org (36% in 2014), Solid data pods are popping up, ...
Client-side engines authenticate themselves to sources.
Better scaling, as clients do most of the work themselves.
Detect query-relevant sources on-the-fly by incorporating live crawling into the querying process.
Clients can use this as an external index over certain sources.