Is it a right choice to replace Apache hive with P...
# getting-started
r
Is it a right choice to replace Apache hive with Pinot?.
m
Could you elaborate on the use case a bit?
r
We have a Datalake(Cloudera) built on HDFS and Hive (ORC format). Users perform Hive SQL queries for performing data quality KPI with lot of data blending queries across schemas (Ex: Sales, Billing, Finance.. ) , data cleansing and also build dashboards on Tableau. Some of the tables contains 800+ columns with many null values fields and can contain few million records
m
You can certainly use Pinot for building dashboards, and more broader analytical use cases. However, it is not the right tool for data blending/cleansing jobs.