Stream processing and reactive programming
October 31, 2014 — July 1, 2015
Lazy bookmark for practical details on processing and transforming possibly infinite streams of data, from signals to parse trees. Disambiguating “transducers”.
Used in parallel/offline processing of large data sets that do not fit in core, or processing things that happen in real-time such as UI.
I am imagining more general objects than singly-indexed real-valued signals; Tokens, maybe. Classic DSP can be elsewhere. Infrastructure to do stream processing in a distributed fashion is filed under message queues.
In statistics and machine learning, stream processing connects with online learning; incorporating data as it comes in, as in distributed statistics.
1 Functional reactive programming
See FRP.
2 Streaming data analysis
Online, possibly real-time, certainly memory-constrained.
Apache Storm…
- Storm-compatible, Heron aims to be Storm-but-more-reliable.
A collection of links for streaming algorithms and data structures
3 To read
Adrian Colyer explains the McSherry et al Naiad system
A different, information-theoretic, angle — The (interstellar) streaming problem:
- Streaming Algorithms
- Jeremy Kun: The complexity of Communication