One key feature of Microsoft’s Big Data Solution is solid integration of Apache Hadoop with the Microsoft Business Intelligence (BI) components. A good example of this is the ability for Excel to connect to the Hive data warehouse framework in the Hadoop cluster. This section walks you through using Excel via the Hive ODBC driver. A video on the subject is available on YouTube.
Install the Hive ODBC Driver
Create a Hive ODBC Data Source to use with Excel
11. Click Execute Query.
Cool.. tried it works like a charm
will this odbc driver work for a non azure based hadoop cluster?
We are planning to implement Proof of Concepts(POC) on using Microsoft BI stack/Hadoop Big data. We have HadoopOnCloudera cluster is available with two nodes. Microsoft provided Hive ODBC driver and Hive Add in Excel to using in Powerpivot Excel 2010. We are planning to install these drivers and connect to HadoopCloudera thru Excel Powerpivot 2010 and publish it in SharePoint.
I have below questions:
1. Is this Hive driver is only for HadoopOnAzure or Can we connect to HadoopOnCloudera using Microsoft Hive ODBC drivers from Excel Powerpivot2010. Can we also connect using SSRS/SSAS. Are there any plug-ins available from SSIS?
2. Hive ODBC driver and Add in Excel are Free or do we need to obtain license
3. Is there any other better use case to implement POC with MS BI stack than what we are thinking
Yes, it is work very well with non Azure based Hadoop. We are working with Cloudera Hadoop and Microsoft ODBC driver.
Hi Michael,
May I know what specific setting you have to do/tune on your Cloudera Hadoop in order to make this ODBC driver work?
In my scenario:
1. Tested Excel with this ODBC driver with Azure and it was working, so I think the setting on Excel/Windows ODBC side is okay.
2. When trying to connect to Cloudera Hadoop, we made sure Hive server is working. When connecting from Excel to Cloudera, we chose No Authentication and it seemed to connect, but there was no table shown.
Thanks in advance.
Chih-Hua