28.11.2014 Views

Data Warehousing and Analytics Infrastructure at Facebook

Data Warehousing and Analytics Infrastructure at Facebook

Data Warehousing and Analytics Infrastructure at Facebook

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Single Warehouse for Adhoc <strong>and</strong> Periodic jobs<br />

Challenge: support adhoc <strong>and</strong> periodic jobs in a single<br />

instance of the warehouse<br />

– New job arrives every 10 seconds<br />

– 90% jobs are small <strong>and</strong> finish within a few minutes<br />

– Users run lots of experiments to mine d<strong>at</strong>a<br />

– B<strong>at</strong>ch processing for large hourly/daily/weekly reporting<br />

– 99% uptime<br />

Hadoop Map-Reduce FairShare Scheduler<br />

– Each user gets an equal share of the cluster<br />

– Optimize l<strong>at</strong>ency for small jobs<br />

– Optimize cluster utiliz<strong>at</strong>ion <strong>and</strong> fair-share for larger jobs<br />

– Resource aware scheduling (CPU <strong>and</strong> memory resources only)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!