02.06.2013 Views

An NSA Big Graph experiment Paul Burkhardt, Chris Waring May 20, 2013

An NSA Big Graph experiment Paul Burkhardt, Chris Waring May 20, 2013

An NSA Big Graph experiment Paul Burkhardt, Chris Waring May 20, 2013

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Good principles for MapReduce algorithms<br />

Be prepared to Think in MapReduce. . .<br />

avoid iteration and minimize number of rounds<br />

limit child JVM memory<br />

number of concurrent tasks is limited by per machine RAM<br />

set IO buffers carefully to avoid spills (requires memory!)<br />

pick a good partitioner<br />

write raw comparators<br />

leverage compound keys<br />

minimizes hot-spots by distributing on key<br />

secondary-sort on compound keys is almost free<br />

Round-Memory tradeoff<br />

Constant, O(1), in memory and rounds.<br />

<strong>Paul</strong> <strong>Burkhardt</strong>, <strong>Chris</strong> <strong>Waring</strong> <strong>An</strong> <strong>NSA</strong> <strong>Big</strong> <strong>Graph</strong> <strong>experiment</strong><br />

<strong>NSA</strong>-RD-<strong>20</strong>13-056001v1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!