Data structures for statistical computing in Python - SciPy Conferences
Data structures for statistical computing in Python - SciPy Conferences
Data structures for statistical computing in Python - SciPy Conferences
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
pandas library<br />
Began build<strong>in</strong>g at AQR <strong>in</strong> 2008, open-sourced late 2009<br />
Many goals<br />
<strong>Data</strong> <strong>structures</strong> to make work<strong>in</strong>g with <strong>statistical</strong> or “labeled” data sets<br />
easy and <strong>in</strong>tuitive <strong>for</strong> non-experts<br />
Create a both user- and developer-friendly backbone <strong>for</strong> implement<strong>in</strong>g<br />
<strong>statistical</strong> models<br />
Provide an <strong>in</strong>tegrated set of tools <strong>for</strong> common analyses<br />
Implement <strong>statistical</strong> models!<br />
Takes some <strong>in</strong>spiration from R but aims also to improve <strong>in</strong> many<br />
areas (like data alignment)<br />
Core idea: ndarrays with labeled axes and lots of methods<br />
McK<strong>in</strong>ney () Statistical <strong>Data</strong> Structures <strong>in</strong> <strong>Python</strong> <strong>SciPy</strong> 2010 10 / 31