Batch and Streaming Analytics

In the previous chapter, we introduced Spark and obtained BTC/USD transaction data from www.bitstamp.net. Using that data, we can now perform some analysis on it.

First, we are going to query this data using a notebook tool named Apache Zeppelin. After that, we will write a program that receives the live transactions from https://www.bitstamp.net/ and sends them to a Kafka topic as they arrive.

Finally, we will use Zeppelin again to run some streaming analytics queries on the data coming to the Kafka topic.

In this chapter, we will cover the following topics:

  • Introduction to Zeppelin
  • Analyzing transactions with Zeppelin
  • Introducing Apache Kafka
  • Streaming transactions to Kafka
  • Introducing Spark Streaming
  • Analyzing Streaming transactions with Zeppelin
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset