HP Vertica and Apache HBase
Hi all,
What is the difference between HP Vertica and HBase? I want to know what is HP Veritca good at and What is
HP Veritca not good at? And the same question to Apache HBase?
1
Hi all,
What is the difference between HP Vertica and HBase? I want to know what is HP Veritca good at and What is
HP Veritca not good at? And the same question to Apache HBase?
Comments
This link will clear all your doubts
http://my.vertica.com/docs/7.2.x/HTML/index.htm
Hi,
Both address different use cases :
hbase do not support SQL (you can still use Phoenix ) , joins and aggregate functions , so if you need it , you need to develop it by your self
This is very high level compression ,and their are many more .
I hope you will find it usefull to start with
Thanks
Hi
Some more info on that .
Yes , Apache Drill will gave you SQL interface , however the fact that you have SQL not change the underline storage that Hbase have which is optimized for key value retrievals . And as i mention if your use case is small retrievals mainly by row key it should be fine , reporting ad analytics its different story.
Some advantages that Vertica will gave you (if your use case is reporting and analytics ).
1) Performances - Vertica is columnar database which is very optimized for reporting , data is persists per column and the underline compression is per column , vertica is optimized for large scans using its underline parallel execution engine , for data modeling you can still use start schema as vertica is able to join the data verty fast , HBASE store data is different you are limited to query only by the row key (and if you need another dimension to do search , you need to define a second table where row key have this second dimension and it stores the value of the primary row key, so you have less flexibility with what you can select) , Vertica will bring you more flexibility with that and it term of performances it will be slow then what vertica will provide .
Main disadvantage is concurrency – if you need to support high concurrency you will probably need to scale your cluster and some time it will required your to create pre calculations results tables.
2) Vertica is enterprise ready , HBASE open source , for production environment this point is very important .
By the way talking about HBASE , if Vertica is not something you like to go with , i will advise to consider Cassandra ,Cassandra community is bigger and its more enterprise ready solution than HBASE .
I hope it answer your questions
Thanks
Thank you very much.