Hadoop word count

Learn How to manage massive Data

Simple Steps to get Started

Follow the steps Below to learn the Basics of Hadoop Big Data

Download VMWARE WORKSTATION PLAYER from this link : 64 bits systems: https://www.vmware.com/fr/products/player/playerpro-evaluation.html

Download the virtual machine, that contains hadoop 2.6.0 installed, and eclipse from this link: http://www.mediafire.com/download/6tq0hwxytk7mely/Ubuntu+64-bit.rar

Decompress the .rar file of the Virtual machine

Open VMWARE WORKSTATION PLAYER

Click on file then open a virtual machine

Go to the path where you decompressed the virtual machine and open ubuntu 64-bit.vmx file

Log in to the virtual machine, password is intellitech

Download the Wordcount Workflow Manual

Simple Steps to get Started

Follow the steps Below to learn the Basics of Hadoop Big Data

Open Terminal

Format the namenode via this command line hadoop namenode -format

Start the hdfs daemons (NameNode, SecondaryNameNode and DataNode) via this command line: start-dfs.sh

Start the yarn daemons (ResourceManager and NodeManager) via this command line: start-yarn.sh

Create the folder /training/wordcountinput/ where we should store the input file via this command line: hdfs dfs -mkdir -p /training/wordcountinput/

Copy the input file in the address previously created via this command line: hdfs dfs -copyFromLocal wordcountinput.txt /training/wordcountinput/

Open eclipse

Right Click on the button wordCount project

Click on export button

Click on JAR file

Click on browse and save the .jar in the following address : /home/hadoopworkshop/wordcount.jar

Click on finish

Run the jar file via this command line yarn jar wordcount.jar com.intellitech.wordcount.jobs.WordCountJob

Check the output files hdfs dfs -cat /training/wordcountoutput/part-r-00000 hdfs dfs -cat /training/wordcountoutput/part-r-00001

Still have questions ? Download FAQ

Need More Clarification on Hadoop Big Data Training ?