In Map Reduce Keeping Key , Value becomes a hectic for programmer.
To Solve these Problem Apache Pig has been Introduced : -
Apache Pig is one of the component of hadoop built on top of HDFS which is ment for processing the huge amount of data over mapreduce.
Apache Pig is High level and abstract language over mapreduce to Process the data .
Apache pig Will take the data summerization with respect to processing.
Processing in Apache Pig Will happen by the means of Transformation only or built on operation.f Transformation .Data Will flow through This Transformation only to get the desired output.
Hence We can call it as Data Flow or Transformational Language.
Pig Latin introduced by yahoo laboratory by there researcher and afterward it adopted by Apache Software Foundation .
Pig Installation
Note : -Apache pig was not bunding with default installation of hadoop ,hence we need to install Pig on Hadoop boxes.
Note : - Even though processing in Pig will happen with respect to multiple transformation each & every transforamation is internally converted into map reduce job.
Click to Continue : Modes of Pig Execution
No comments:
Post a Comment