Challenges for Query Agents on the Decentralized Web

Traditional query processing over centralized data

Data and query engine are not collocated
Query engine runs on a separate machine
Not just one datasets
Data is spread over the Web into multiple heterogeneous sources

Linked Data is interlinked
Following the Linked Data principles
Query engine can follow links
Start from one source, and discover new sources on the fly
Link Traversal Query Processing (LTQP)

Query sources must describe their query API
Agents must be able to discover capabilities of sources for query planning
Query sources must describe their contents
Agents must be able to discover contents of sources for query planning
Cardinality-based, shape-based, approximate, privacy-preserving...
Query agents must have efficient query planning algorithms
(Adaptive) query planning and execution over heterogeneous query APIs