Pig was initially developed at Yahoo! to allow people using Hadoop® to focus more on analyzing large data sets and spend less time having to write mapper and reducer programs. Like actual pigs, who eat almost anything, the Pig programming language is designed to handle any kind of data—hence the name!
About This Course
Pig was initially developed at Yahoo! to allow people using Hadoop® to focus more on analyzing large data sets and spend less time having to write mapper and reducer programs. Like actual pigs, who eat almost anything, the Pig programming language is designed to handle any kind of data—hence the name!
Pig is made up of two components: the first is the language itself, which is called PigLatin, and the second is a runtime environment where PigLatin programs are executed.
This course begins with an overview of Pig. It explains the data structures supported by Pig and how to access data using the LOAD operator. The next lesson covers the Pig relational operators. This is followed by the Pig evaluation functions, as well as math and string functions.
What will I get after passing this course?
You will receive a completion certificate.
You will receive the IBM Explorer - Big Data Programmingbadge (upon completion of all badge criteria).
COURSE SYLLABUS
- Lesson 1 - Pig Basics
- List the ways to invoke Pig
- Describe the data structures used in Pig
- Explain how to use the LOAD operator to read data
- Describe how to substitute parameters in a Pig script
- Explain how to use the OUTPUT operator to write data
- Lesson 2 - Pig Relational Operators
- List the relational operators used in Pig
- Explain how to flatten your data
- Describe dereference operators
- Lesson 3 - Pig Evaluation Functions and Other Commands
- List the eval functions used in Pig
- Describe how to invoke HDFS shell commands from a Pig script or the Grunt shell
- Explain how to execute a Pig script from the Grunt shell
- Describe the use of the Explain operator
GENERAL INFORMATION
- This course is free.
- It is self-paced.
- It can be taken at any time.
- It can be taken as many times as you wish.
Big Data University has been chosen by IBM as one of the issuers of badges as part of the IBM Open Badge program. Share your achievements through LinkedIn, Facebook, Twitter, and more!
Big Data University leverages the services of Pearson VUE Acclaim to assist in the administration of the IBM Open Badge program. By enrolling into this course, you agree to Big Data University sharing your details with Pearson VUE Acclaim for the strict use of issuing your badge upon completion of the badge criteria.
REQUIREMENTS
Have taken the Hadoop Foundations course on Big Data University
RECOMMENDED SKILLS PRIOR TO TAKING THIS COURSE
- Basic Linux Operating System knowledge
- Basic programming skills