Category: hadoop

October 10, 2017

Installing Apache Zeppelin 0.7.3 in HDP 2.5.3 with Spark and Spark2 Interpreters

Background As a recent client requirement I needed to propose a solution in order to add spark2 as interpreter to zeppelin in HDP (Hortonworks Data Platform) 2.5.3 The first hurdle is, HDP 2.5.3 comes with zeppelin 0.6.0 which does not support spark2, which was included as a technical preview. Upgrade the HDP version was not…
January 25, 2017

Connect to Hive using Teradata Studio 16

Introduction Teradata Studio is the client used to perform database administration task on Aster and Teradata databases, as well as moving data from and to Hadoop. Recently I was asked to test a solution to integrate Hadoop with Teradata in order to build a modern Data Warehouse architecture, this was my first step and I…
June 16, 2016

Export data to Hadoop using Polybase – Insert into external table

Introduction This post is a continuation of Polybase Query Service and Hadoop – Welcome SQL Server 2016 One of the most interesting use cases of Polybase is the ability to store historical data from relational databases into a Hadoop File System. The storage costs could be reduced while keeping the data accessible and still can be…
May 29, 2016

Polybase Query Service and Hadoop – Welcome SQL Server 2016

Introduction One of the coolest features of SQL Server 2016 is Polybase. Already available for Parallel Data Warehouse, this functionality is now integrated in SQL Server 2016 and allows to combine relational and non-relational data, for example, query data in Hadoop and join it with relational data, import external data into SQL Server or export…
May 8, 2016

My experience building Hadoop 2.7.1 on Windows Server 2012

Introduction Building the Hadoop sources on windows could be cumbersome even when the official documentation states: “… building a Windows package from the sources is fairly straightforward”. There are several good resources containing the steps needed in order to successfully build a distribution. The most useful for me was this one: Hadoop 2.7.1 for Windows…

Category: hadoop

Installing Apache Zeppelin 0.7.3 in HDP 2.5.3 with Spark and Spark2 Interpreters

Connect to Hive using Teradata Studio 16

Export data to Hadoop using Polybase – Insert into external table

Polybase Query Service and Hadoop – Welcome SQL Server 2016

My experience building Hadoop 2.7.1 on Windows Server 2012