What an awesome product.
Finally, I got my certificate for the Coursera ‘Introduction to DataScience‘ course!
This weekend, I finished my 5-sized RaspberryPI cluster with a custom power source (5V/10A). I installed Hadoop on the 5 Pi’s just for educational purposes. Further, I used a VirtualBox Debian install als the Hadoop master node. Works like a … Continue reading
Today I’m analyzing the properties of a 0.5TB dataset (a billion vertices in a graph) using Pig/Hadoop on Amazon’s Elastic Map Reduce service. I configured a cluster which contains the following nodes: 1 MASTER: c1.medium 9 CORE: c1.xlarge x9 (High-CPU … Continue reading