
HBase handles basically two kinds of file types.
One is used for the write-ahead log
and the other for the actual data storage. Continue reading “Apache Hbase”

HBase handles basically two kinds of file types.
One is used for the write-ahead log
and the other for the actual data storage. Continue reading “Apache Hbase”
CAP theorem is also called as Brewer’s theorem.
C – Consistency
A- Availability
P- Partition Tolerance
Continue reading “CAP Theorem”
Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of streaming data into the Hadoop Distributed File System (HDFS).
Continue reading “Apache Flume”
Apache Sqoop(TM) is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases. Continue reading “Apache Sqoop”
Microsoft Azure HDInsight is a 100% Apache Hadoop-based service in the Azure cloud
Continue reading “Microsoft Azure HDInsight”
Pig was originally developed at Yahoo Research around 2006 for Continue reading “Apache Pig”
Two key members of the Facebook team that created the Hadoop query language Hive.
1) Joydeep Sen Sarma.
2) Ashish Thusoo. Continue reading “Hive Query Language”