Spark, Impala, Tez and Hive: Interview with David Gruzman February 24, 2015 by Saggi Neumann Big Data consultant David Gruzman answered some of our burning questions about which Big Data platform to use, whether streaming is a must or not, and what are the biggest...
Become a Twitter Data Analyst with Integrate.io September 30, 2014 by Saggi Neumann Let’s say that you’re doing some marketing for a Big Data startup. As part of your campaign, you want to find the most influential tweeters who talk about Hadoop and...
How to get Website Visitor Geolocations from IPs July 21, 2014 by Saggi Neumann Although the Internet made the world flat, geography still matters. Knowing which countries your users live in could provide business opportunities to localize your services and increase profits. The only...
8 Data Integration Best Practices June 26, 2014 by Saggi Neumann You’ve spent hours tinkering and preparing the perfect dataflow to batch process zillions of web logs. Feeling satisfied, you run the job on one of the clusters and leave your...