Appendix A. References

Amazon Web Services, http://aws.amazon.com/.

Amazon DynamoDB, http://aws.amazon.com/dynamodb/.

Amazon Elastic MapReduce (Amazon EMR), http://aws.amazon.com/elasticmapreduce/.

Amazon Simple Storage Service (S3), http://aws.amazon.com/s3.

Cassandra Database, http://cassandra.apache.org/.

Apache HBase, http://hbase.apache.org/.

Apache Hive, http://hive.apache.org/.

Apache Hive Wiki: https://cwiki.apache.org/Hive/.

Apache Oozie, http://incubator.apache.org/oozie/.

Apache Pig, http://pig.apache.org/.

Apache Zookeeper, http://zookeeper.apache.org/.

Cascading, http://cascading.org.

Data processing on Hadoop without the hassle, https://github.com/nathanmarz/cascalog.

Easy, efficient MapReduce pipelines in Java and Scala, https://github.com/cloudera/crunch.

Datalog, http://en.wikipedia.org/wiki/Datalog.

C.J. Date, The Relational Database Dictionary, O’Reilly Media, 2006, ISBN 978-0-596-52798-3.

Jeffrey Dean and Sanjay Ghemawat, MapReduce: simplified data processing on large clusters, Proceeding OSDI ’04 Proceedings of the 6th conference on Symposium on Operating Systems Design and Implementation - Volume 6, 2004.

Apache Derby, http://db.apache.org/derby/.

Jeffrey E.F. Friedl, Mastering Regular Expressions, 3rd Edition, O’Reilly Media, 2006, ISBN 978-0-596-52812-6.

Alan Gates, Programming Pig, O’Reilly Media, 2011, ISBN 978-1-449-30264-1.

Lars George, HBase: The Definitive Guide, O’Reilly Media, 2011, ISBN 978-1-449-39610-7.

Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung, The Google file system, SOSP ’03 Proceedings of the nineteenth ACM symposium on Operating systems principles, 2003.

Jan Goyvaerts and Steven Levithan, Regular Expressions Cookbook, 2nd Edition, O’Reilly Media, 2009, ISBN 978-1-449-31943-4.

Eben Hewitt, Cassandra: The Definitive Guide, O’Reilly Media, 2010, ISBN 978-1-449-39041-9.

Ashish Thusoo, et al, Hive - a petabyte scale data warehouse using Hadoop, 2010 IEEE 26th International Conference on Data Engineering (ICDE).

JDK 1.6 java.util.regex.Pattern Javadoc, http://docs.oracle.com/javase/6/docs/api/java/util/regex/Pattern.html.

The Java Tutorials, Lesson: Regular Expressions, http://docs.oracle.com/javase/tutorial/essential/regex/.

JSON, http://json.org/.

Apache Kafka: A high-throughput, distributed messaging system, http://incubator.apache.org/kafka/index.html.

Kerberos: The Network Authentication Protocol, http://web.mit.edu/kerberos.

MapR, the Next Generation Distribution for Apache Hadoop, http://mapr.com.

MarkLogic, http://www.marklogic.com/.

Wolfram Mathematica, http://www.wolfram.com/mathematica/.

Matlab: The Language of Technical Computing, http://www.mathworks.com/products/matlab/index.html.

GNU Octave, http://www.gnu.org/software/octave/.

Oracle XML DB, http://www.oracle.com/technetwork/database/features/xmldb/index.html.

The R Project for Statistical Computing, http://r-project.org/.

A Scala API for Cascading, https://github.com/twitter/scalding.

SciPy: Scientific Tools for Python, http://scipy.org.

Shark (Hive on Spark), http://shark.cs.berkeley.edu/.

Spark: Lightning-Fast Cluster Computing, http://www.spark-project.org/.

Storm: Distributed and fault-tolerant realtime computation: stream processing, continuous computation, distributed RPC, and more, https://github.com/nathanmarz/storm.

Tony Stubblebine, Regular Expression Pocket Reference, O’Reilly Media, 2003, ISBN 978-0-596-00415-6.

Dean Wampler, Functional Programming for Java Developers, O’Reilly Media, 2011, ISBN 978-1-449-31103-2.

Dean Wampler and Alex Payne, Programming Scala, O’Reilly Media, 2009, ISBN 978-0-596-15595-7.

Tom White, Hadoop: The Definitive Guide, 3nd Edition, O’Reilly Media, 2012, ISBN 978-1-449-31152-0.

XPath Specification, http://www.w3.org/TR/xpath/.

..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset