I know that this is comparing apples to oranges, but sometimes you have to choose what to eat…
I’m trying to find a rule of thumb as to the magnitudes for which to choose hbase over postgres. E.g., I want to do ‘select sum(x) from t group by y’. Obviously this can be done with an HBase scan, but at what volumes is the performance difference substantial.
Looking for any information.