Streaming SQL for Apache Spark

Streaming SQL for Spark is a project based on Catalyst and Spark Streaming to support SQL-style queries on data streams. It bridges the gap between structured data queries and stream processing. It provides:

  • SQL support on both stream and table data with extended time-based windowing aggregation and join.
  • Easy mutual operation between DStream and SQL.
  • External source API support for streaming source.

See: http://bit.ly/1J6LFdR