12.07.2015 Views

Transfer of Data from Terabyte Disk to BMRC Archives

Transfer of Data from Terabyte Disk to BMRC Archives

Transfer of Data from Terabyte Disk to BMRC Archives

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Finally, ‘au<strong>to</strong>-nxt-run.config’:% tail $HOME/au<strong>to</strong>-nxt-run.config#MailTo="l.hanson@bom.gov.au"export MailToRunPath="/bm/gkeep/lih/bmrc/afm/p03"export RunPathRemDFlist="remoteDirFileList.txt"export RemDFlistYou will need <strong>to</strong> change the ‘MailTo’, and ‘RunPath’ entries. The ‘RunPath’ item is the ‘gale’direc<strong>to</strong>ry you have set up <strong>to</strong> hold the scripts and data lists, and <strong>from</strong> which you will run the scriptswhich au<strong>to</strong>mate the various processes <strong>of</strong> this work.5. Create, or take a copy <strong>of</strong> the ‘listalldirs.txt’ file (used by scripts like ‘list-size.pl’), whichis an ‘ls -lR’ format listing <strong>of</strong> all <strong>of</strong> the files available on the PCMDI data portal.4 <strong>Transfer</strong> <strong>Data</strong> From <strong>Terabyte</strong> <strong>Disk</strong> To ‘gale’<strong>Data</strong> is transferred <strong>from</strong> the TB disk (connected on the Linux PC) over <strong>to</strong> a staging area on ‘gale’(rather than directly accross <strong>to</strong> the long term archive s<strong>to</strong>rage system) because some files (i.e., huge dailydata files) need <strong>to</strong> be further processed <strong>to</strong> extract a sub-set <strong>of</strong> the data they contain (<strong>to</strong> reduce the finalamounts <strong>of</strong> s<strong>to</strong>rage required). The steps required in this data transfer process are:1. Partition the data list file (e.g., ‘data?.txt’), if required, in<strong>to</strong> a series <strong>of</strong> more manageable part listfiles:% ~lih/bin/df-part.sh data1.txtThis creates a series <strong>of</strong> files named like ‘data1-p1.txt’, ‘data1-p2.txt’, ‘data1-p3.txt’, etc.Or if the data list file is deemed small enough <strong>to</strong> handle in one chunk, then make a copy <strong>of</strong> it withan appropriate part number, for example:% cp data1.txt data1-p1.txt2. Use the ‘list-size.pl’ script <strong>to</strong> determine the <strong>to</strong>tal size <strong>of</strong> the netCDF files in the data list file(or part data list file), for example:% list-size.pl data1-p1.txtlist-size: List number <strong>of</strong> netCDF files: 55list-size: List <strong>to</strong>tal netCDF file size: 49394698640 => ( 46 GB)list-size: File sizes were obtained by: SymLinks: 55list-size: Symbolic Links: file count: 553. Start the transfer <strong>of</strong> data <strong>from</strong> the terabyte disk <strong>to</strong> ‘gale’ by running the ‘nxt-run’ script with theappropriate arguments. For example, <strong>to</strong> commence the transfer <strong>of</strong> the files listed in the part file‘data1-p1.txt’, you would run the command:% nxt-run 1 1This process can take a considerable amount <strong>of</strong> time. For example, when connected <strong>to</strong> a USB-1port, the transfer <strong>of</strong> approximately 46 gigabytes <strong>of</strong> data <strong>to</strong>ok just over seventeen (17) hours, hence,the effective data transfer rate was about 0.8 megabytes per second.Soon after we started, we replaced the USB port with a FireWire (i.e., IEEE 1394) interface, and thisconsiderably reduced the data transfer times, producing data transfer rates <strong>of</strong> about 11.5 megabytesper second, so a data transfer <strong>of</strong> 46 GB now takes approximately sixty eight (68) minutes.4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!