There are 6 projects that I have on my list of Open Source Technologies in the Big Data space that I want to contribute to :
(in no particular order)
- Spark
- Kafka
- Beam
- Flink
- Mahout
- Streams
There are 6 projects that I have on my list of Open Source Technologies in the Big Data space that I want to contribute to :
(in no particular order)
I have been spending some time this weekend trying to understand Kafka.
Kafka has four core APIs:
The documentation uses the WordCountDemo as the motivating example for the Streaming scenario. So I decided to spend some time over the weekend understanding this API better.
References:
Code: