One key feature of Microsoft’s Big Data Solution is solid integration of Apache Hadoop with the Microsoft Business Intelligence (BI) components. A good example of this is the ability for Excel to connect to the Hive data warehouse framework in the Hadoop cluster. This section walks you through using Excel via the Hive ODBC driver
Create a Hive ODBC Data Source to use with Excel
11. Click Execute Query.
Cool.. tried it works like a charm
will this odbc driver work for a non azure based hadoop cluster?
We are planning to implement Proof of Concepts(POC) on using Microsoft BI stack/Hadoop Big data. We have HadoopOnCloudera cluster is available with two nodes. Microsoft provided Hive ODBC driver and Hive Add in Excel to using in Powerpivot Excel 2010. We are planning to install these drivers and connect to HadoopCloudera thru Excel Powerpivot 2010 and publish it in SharePoint.
I have below questions:
1. Is this Hive driver is only for HadoopOnAzure or Can we connect to HadoopOnCloudera using Microsoft Hive ODBC drivers from Excel Powerpivot2010. Can we also connect using SSRS/SSAS. Are there any plug-ins available from SSIS?
2. Hive ODBC driver and Add in Excel are Free or do we need to obtain license
3. Is there any other better use case to implement POC with MS BI stack than what we are thinking
Yes, it is work very well with non Azure based Hadoop. We are working with Cloudera Hadoop and Microsoft ODBC driver.
Hi Michael,
May I know what specific setting you have to do/tune on your Cloudera Hadoop in order to make this ODBC driver work?
In my scenario:
1. Tested Excel with this ODBC driver with Azure and it was working, so I think the setting on Excel/Windows ODBC side is okay.
2. When trying to connect to Cloudera Hadoop, we made sure Hive server is working. When connecting from Excel to Cloudera, we chose No Authentication and it seemed to connect, but there was no table shown.
Thanks in advance.
Chih-Hua
Hi Chih-Hua,
it could be many things like port,permissions on hadoop .....
Drop me mail with your details I will try to help you.
michaeln @ liveperson com
I have the same question as RR Netha.
2 questions:
1) Is a 32 bit release also possible, largely so I can use it in SSDT for SSIS?
2) Will it support usage by SSIS?
Cheers, James
Hi BI Monkey,
For the 32-bit question, check out this forum post: social.msdn.microsoft.com/.../83862898-b578-4e1a-870d-ab07ad9e699d
Will it support usage by SSIS, the answer is that there are plans to have an ODBC driver to connect to Hadoop. You can configure ODBC Connection Manager in SSIS to connect, see this post:
alexeikh.wordpress.com/.../moving-data-from-hive-table-in-ms-hadoop-into-sql-server-using-ssis-server
HTH,
Michele
Hi RR Netha and vyasmanish,
I believe that most of your questions were answered in the forums. But just in case:
Yes, the Hive driver will work with the Cloudera distribution.
For your 2nd question, visit Denny's blog and read his article on Connecting PowerPivot to Hadoop on Azure -- Self-Service BI to Big Data in the Cloud.
And, yes, the driver and add-in are free.