Data Warehousing and Analytics Infrastructure at Facebook
Data Warehousing and Analytics Infrastructure at Facebook
Data Warehousing and Analytics Infrastructure at Facebook
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Warehouse is Business Critical<br />
Challenge: Remove all single points of failure (SPOF)<br />
Hadoop NameNode was a SPOF<br />
Hadoop Av<strong>at</strong>arNode<br />
– Active-passive Hot St<strong>and</strong>by pair of NameNodes<br />
– Failover time for 20 PB file system having 65 million files is 10<br />
seconds<br />
– Work-in progress to support active-active Av<strong>at</strong>arNode