Author Archives: Paul Hernandez

About Paul Hernandez

I'm an Electronic Engineer and Computer Science professional, specialized in Data Analysis and Business Intelligence Solutions. Also a father, swimmer and music lover.

Installing Apache Zeppelin 0.7.3 in HDP 2.5.3 with Spark and Spark2 Interpreters

Background As a recent client requirement I needed to propose a solution in order to add spark2 as interpreter to zeppelin in HDP (Hortonworks Data Platform) 2.5.3 The first hurdle is, HDP 2.5.3 comes with zeppelin 0.6.0 which does not … Continue reading

Posted in Analytics, hadoop, Spark | Tagged | 1 Comment

Talend job to lookup geographic coordinates into a shape file

Introduction Recently for an open data integration project I had to select some tools in order to be able to process geospatial data. I had a couple of choices: I could use R and try to work out a solution … Continue reading

Posted in Business Intelligence, Geospatial data, Open Data, Talend | Tagged , , , , | 3 Comments

Connect to Hive using Teradata Studio 16

Introduction Teradata Studio is the client used to perform database administration task on Aster and Teradata databases, as well as moving data from and to Hadoop. Recently I was asked to test a solution to integrate Hadoop with Teradata in … Continue reading

Posted in Big Data, hadoop, Teradata | Tagged , , , , | 9 Comments

Teradata Express 15.10 Installation using Oracle VirtualBox

Introduction For professional reasons I needed to start learning Teradata after some years of intensive Microsoft BI projects. To start breaking the ice and have a playground to test everything I want, I decided to download the newest Teradata Express … Continue reading

Posted in Business Intelligence, Teradata, VirtualBox | Tagged , , , , , , | 8 Comments

Apache Zeppelin installation on Windows 10

Disclaimer: I am not a Windows or Microsoft fan, but I am a frequent Windows user and it’s the most common OS I found in the Enterprise everywhere. Therefore, I decided to try Apache Zeppelin on my Windows 10 laptop … Continue reading

Posted in Analytics, data visualization, R, Spark | Tagged , , , , | 17 Comments

Introduction to R Services and R client – SQL Server 2016

Introduction After some time using R and SQL server as two different tools (not 100% true because I already have imported data from SQL Server into R Studio), now Microsoft is offering as part of the SQL Server 2016 R … Continue reading

Posted in Analytics, Business Intelligence, R, SQL Server | Tagged , , , | 1 Comment

Export data to Hadoop using Polybase – Insert into external table

Introduction This post is a continuation of Polybase Query Service and Hadoop – Welcome SQL Server 2016 One of the most interesting use cases of Polybase is the ability to store historical data from relational databases into a Hadoop File System. … Continue reading

Posted in Big Data, hadoop, SQL Server | Tagged , , , , | 17 Comments