Index
A
B
- batch data processing
- batch duration
- batching
- batch mode / The emergence of Spark SQL
- batch processing
- Big Data
- Big Data analytics architecture
- Big Data ecosystem
- Big Data problem statements, Lambda Architecture
- bolts
- Business Intelligence (BI) / The Big Data ecosystem
C
- Call Data Record (CDR) / The telecoms or cellular arena
- CAS (content-addressed storage) / Producers
- cascading / Components of the Big Data ecosystem
- Cassandra Core driver
- Cassandra Query Language (CQL) / Configuring Apache Cassandra and Spark
- Catalyst optimizer
- challenges, batch data processing
- challenges, in selecting technology for data consumption layer
- challenges, real-time data processing
- cluster manager
- cluster managers, for Spark streaming
- Coda Hale metrics library
- Complex Event Processing (CEP) / Real-time processing
- components, Big Data ecosystem
- components, Kinesis
- components, Spark SQL
- components, Spark Streaming
- components/layers, Lambda Architecture
- ConnectionProvider interface
- consumer group
- cost-based optimization / The Catalyst optimizer
- CQLSH
- custom connectors
D
E
- Eclipse
- Eclipse Luna (4.4)
- electronic publishing
- electronic trading platform
- ETL (Extract Transform Load) / Dataset processing
- extensibility
- extensions/libraries, Spark
F
- fastutil library
- fault tolerance
- fault tolerant
- features, Lambda Architecture
- features, resilient distributed datasets (RDD)
- features, Spark
- fence instruction / Memory and cache
- filtering step / Dataset processing
- Flume
- functionalities, RDD API
- functions, resilient distributed datasets (RDD)
G
H
I
- Infrastructure as a Service (IaaS) / The Big Data ecosystem
- input data streams
- input sources, Storm
- installing
- integration / Dataset processing
- inter-worker communication
- Internet of Things (IoT)
- intra-worker communication
J
- Java
- JdbcMapper interface
- JdbcRDD
- Joins
K
- Kafka
- Key Performance Indicators (KPIs)
- key technologies, Hadoop ecosystem
- Kinesis
- Kinesis Client Library (KCL)
- Kinesis Producer Library (KPL)
- Kinesis streaming service
- Kinesis stream producers
- Kryo documentation
- Kryo serialization
- Kyro
L
M
N
O
- operations, RDD API
- Oracle Java 7
- OrderedRDDFunctions
- org.apache.spark.streaming.dstream.DStream.scala / Spark Streaming APIs
- org.apache.spark.streaming.flume.*
- org.apache.spark.streaming.kafka.*
- org.apache.spark.streaming.kinesis.*
- org.apache.spark.streaming.StreamingContext / Spark Streaming APIs
- org.apache.spark.streaming.twitter.*
- org.apache.spark.streaming.zeromq.*
- or Illinois Uniform Crime Reporting (IUCR) / Programming Spark transformations and actions
- output data streams
- output operations, DStreams
P
Q
R
S
- S3
- Scala
- Scala 2.10.5 compressed tarball
- Scala APIs, by Spark Core
- scalability
- schema evolution
- schema merging
- SequenceFileRDDFunctions
- serialization process
- shards
- single point of failure (SPOF) / The need for Lambda Architecture
- SLAs
- smart traversing
- software development kit (SDK) / Components of Kinesis
- Spark
- Spark-Cassandra connector
- Spark-Cassandra Java library
- Spark 1.4.0
- Spark actions
- Spark architecture
- Spark cluster
- Spark compressed tarball
- Spark Core
- Spark core engine
- Spark driver
- Spark execution model
- Spark extensions
- Spark framework
- Spark job
- Spark master
- Spark packages
- SparkR
- Spark SQL
- SPARK SQL
- Spark SQL job
- Spark Steaming job
- Spark Streaming
- Spark Streaming APIs
- Spark Streaming applications
- Spark Streaming job
- Spark streaming job
- Spark Streaming operations
- Spark transformation
- Spark UI
- Spark worker/executors
- speed layers
- splits
- spout collector / The concept of anchoring and reliability
- SQL Streaming Crime Analyzer
- standalone resource manager
- StorageLevel class
- storage levels, Spark
- Storm
- Storm abstractions
- Storm acking framework
- Storm cluster
- Storm internal message processing
- Storm internals
- Storm internode communication
- Storm parallelism
- Storm persistence
- Storm simple patterns
- Storm UI
- StreamingContext
- streaming data
- stream producer
- Supervisors / Optimizing Storm performance
T
- Tachyon
- Taychon
- TextInputFormat
- Thrift
- transformation / Dataset processing
- transformation operations, on input streams
- transformation operations, on streaming data
- Trident
- Trident operations
- Trident topology
- troubleshooting tips
U
- use cases, for batch data processing
- use cases, real-time data processing
W
Y
Z
..................Content has been hidden....................
You can't read the all page of ebook, please click
here login for view all page.