SeccoSQL
Experimental distributed SQL engine that separates communication from computation.
SeccoSQL is an experimental distributed SQL engine on Spark designed for complex SQL and graph-style queries.
What it is
SeccoSQL explores a distributed query execution model that explicitly separates communication from computation. This exposes a larger optimization space than conventional distributed SQL engines, where data movement and local execution are usually coupled inside each operator.
Key ideas
- Explicit communication and computation operators for relational algebra.
- Finer-grained operator reordering to reduce distributed communication cost.
- Spark-based implementation for complex SQL and graph-style query workloads.