Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
30 Chapter 2 • HADOOP Procedure<br />
Program Description<br />
1. The first FILENAME statement assigns the file reference CFG to the physical<br />
location of a Hadoop configuration file that is named sample_config.xml, which is<br />
shown in “Using PROC HADOOP” on page 22.<br />
2. The second FILENAME statement assigns the file reference CODE to the physical<br />
location of the file that contains the Pig language code that is named sample_pig.txt,<br />
which is shown above.<br />
3. The PROC HADOOP statement controls access to the Hadoop server by referencing<br />
the Hadoop configuration file with the OPTIONS= option, identifying the user ID<br />
and password on the Hadoop server with the USERNAME= and PASSWORD=<br />
options, and specifying the VERBOSE option, which enables additional messages to<br />
the SAS log.<br />
4. The PIG statement includes the following options:<br />
• CODE= to specify the SAS fileref CODE that is assigned to the physical location<br />
of the file that contains the Pig language code<br />
• REGISTERJAR= to specify the JAR file that contains the Pig scripts to execute.