How much memory your Namenode need?
October 11, 2015Apache Pig Tutorial – Loading Datasets
December 7, 2015Is Hive Good At Everything?
Hive is an awesome tool, which takes in SQL like queries and translate them in to MapReduce. Hive is very helpful in working with complex data and helps in visualizing the data in row columnar fashion (which is liberating).
Just because Hive can take in SQL like queries and execute them like a database system does, can it do everything a database can do?
The answer is NO.
Hive has a lot of limitations when compared to a database system like MySQL or Oracle. Here are some of the limitations in Hive when compared to a traditional database system.
- Hive can not do pointed updates or deletes. The closest to delete you get is by dropping partitions. Why? Because behind the scenes, Hive works against files in HDFS.
- Hive does not support triggers
- Transactions – Very high level support and it is recently added. Reference
- Indexes – Rudimentary support. Reference
- Views – Read-only views are allowed. Materialized views are not supported.
- Speed – Hive relies on MapReduce and Hadoop for execution, which works very well with big datasets but not great for split second results.