Tuesday 28 January 2014

ACCESS HBASE TABLE WITH TABLEAU DESKTOP 8.0




Hope you have Tableau installed on system .

Concept :-

You can't directly connect to hbase  table via tableau you need to connect to hive table and hive internally mapped to hbase table.

Please check below link for more explanation :

http://nosql.mypopescu.com/post/17262685876/visualizing-hadoop-data-with-tableau-software-and

Step 1 :- 


Download Tableau driver for hive


Step 2 :- (Driver installation)

Install Above downloaded driver.

Step 3 :- (Configure ODBC driver)

Click on start go to Data Source (ODBC).

Click on System DSN.

Select Cloudera ODBC driver for Apache Hive.

Fill the details.

Save Setting.

Step 4 :- (Run Hive as Thrift service)

[ nitin@nitin-ubuntu:~ # ] $ sudo hive --service hiveserver --hiveconf /etc/hive/conf/hive-site.xml

Make sure you have auxpath set in above hive-site.xml and all jar present .

Below jar needed by hive client to talk to hbase and get data from hbase.

For example :- 

<property>
    <name>hive.aux.jars.path</name>
    <value>file:///usr/lib/hive/lib/hive-hbase-handler-0.10.0-cdh4.4.0.jar,file:///usr/lib/hbase/lib/hbase-0.94.6-cdh4.4.0.jar,file:///usr/lib/zookeeper/zookeeper-3.4.5-cdh4.4.0.jar,file:///usr/share/cmf/lib/guava-14.0.jar
   </value>
</property>


Step 4 :- (Connect tableau to hive tables)

Select tableau from start menu.

Go to data Click on connect data than click on cloudera database.

It will ask you to make connections.

Give your hive thrift  server IP and port as 10000.

Click on connect.

If its connected properly than you will get default in schema section.

Select table where you want to make computation.

Click OK.








4 comments:

  1. Not sure,,if the previous comment came thru
    So typing again...

    thx for the blog.
    a quick quesion..what are ur thoughts on the ODBC driver from Simbah technologies (just saw that when I googled, and ofcuse saw ur post too..)

    if we connnect to hbase thru hive then the speed inmemory will go down, right ?

    Thanks for letting me know ur thoughts !

    ReplyDelete
    Replies
    1. Simbah ODBC connecter will also work well. I guess there will be some performance issues but you can download free version of Simbah and test as per your business logic.

      Delete
  2. This comment has been removed by the author.

    ReplyDelete
  3. Nice stuff you have shared on Hadoop data analytics and I really like it. Thanks for sharing.

    Rickybroad

    ReplyDelete

Ansible Cheat sheet

Install Ansible  # yum install ansible Host file configuration  File  [ansible@kuber2 ~]$ cat /etc/ansible/hosts     [loca...