computing the quartet distance between general trees

More documents

Recommendations

Info

50 CHAPTER 7. SUB-CUBIC TIME ALGORITHMSimple Python implementation of the sub-cubic algorithm without advanced mat. mult.time in seconds - t(n)1000100101bin, best exp= 2.03ran, best exp= 2.06sqrt, best exp= 1.81star, best exp= 2.02wc, best exp= 1.99nn 2n 30.10.01Figure 7.7:Python.50 100 200 500 800number of leaves - nPerformance of the naive prototype of the sub-cubic algorithm intime in seconds - t(n)10001001010.1Simple C++ implementation of the sub-cubic algorithm without advanced mat. mult.bin, best exp= 2.05ran, best exp= 2.06sqrt, best exp= 1.90star, best exp= 2.96wc, best exp= 2.18nn 2n 30.0150 100 200 500 800number of leaves - nFigure 7.8: Performance of the naive prototype of the sub-cubic algorithm in C++.
7.2. IMPLEMENTATION 51oretic approach.Mailund et al. [14] suggest the use of advanced matrix multiplication methods andsubstantiate their algorithmic results with the best theoretic result within the field ofmatrix multiplication, namely the Coppersmith-Winograd algorithm [7]. As mentionedin Chap. 1, this kind of theoretic result will be met with immediate scepticism by mostprogrammers, since one might suspect that the demands of the theory can never be metby an implementation. In this case because of the size of the input being too small toobserve the improvement.Here the ideal situation would be to find an algorithm that is sub-cubic in theoryand still efficiently implemented in practice. Searching the literature and the web for areasonable method for matrix multiplication I came across the Strassen algorithm [18],which is approximately O(n 2.8 ) and is described as being efficient in practice for largematrices. However, I was not successful in finding an optimized implementation of thealgorithm or a linear algebra library containing one. Since matrix multiplication is notthe focus of this thesis, but rather a smaller piece in the algorithmic puzzle, the time didnot allow me to attempt implementing it on my own. In addition, it seems unreasonablethat I would be competitive with highly optimized libraries.Instead I decided to go with a highly optimized, reliable and robust library. This isa meaningful decision, since the goal of the thesis is to test the practical usefulness ofthe algorithm. What is the use of a theoretic result if it is not practically applicable? Thefinal choice was to use the Basic Linear Algebra Subprograms (BLAS) API 4 , which is a defacto standard for various linear algebra packages. Since my first implementation was inPython, BLAS caught my attention after discovering that it integrates with SciPy/NumPy,which is automatically compiled against the BLAS installation if present on the machine.Furthermore, the Boost.NumericBindings library 5 , provides a generic layer between theBoost.uBlas data types and the BLAS linear algebra routines in C++.The exact routines utilized are some level 3 BLAS calls for matrix-matrix multiplication.In NumPy and Boost.NumericBindings these have general interfaces used throughthe calls C = dot(A,B) and void gemm(A,B,C) respectively. Both solutions require abit of setup using the right type of matrix container structure, but with the libraries thisis not difficult. I will not provide further details, but merely refer to the code which isavailable (see App. B).To get an idea of how the performance improved, a comparison has been made of thefour implementations used, namely the two used in the prototypes and the two BLAS in-4 The BLAS specification: http://www.netlib.org/blas/5 Boost.NumericBindings: http://svn.boost.org/svn/boost/sandbox/numeric_bindings-v1/libs/numeric/bindings/doc/index.html
Page 1:
Master’s thesisCOMPUTING THE QUAR
Page 5:
AcknowledgementsFirst of all I woul
Page 8 and 9: viiiCONTENTS5.3 Implementing leaf s
Page 10 and 11: 2 CHAPTER 1. INTRODUCTIONhas is cal
Page 12 and 13: 4 CHAPTER 1. INTRODUCTIONIt is evid
Page 14 and 15: 6 CHAPTER 1. INTRODUCTIONsub-cubic
Page 17 and 18: Chapter 2PrerequisitesFirst, this c
Page 19: 2.2. CHOICE OF LANGUAGE AND TEST EN
Page 22 and 23: 14 CHAPTER 3. EXPERIMENTAL APPROACH
Page 28 and 29: 20 CHAPTER 4. QUARTIC TIME ALGORITH
Page 30 and 31: 22 CHAPTER 4. QUARTIC TIME ALGORITH
Page 33 and 34: Chapter 5Calculating leaf set sizes
Page 35 and 36: 5.3. IMPLEMENTING LEAF SET ALGORITH
Page 37 and 38: 5.3. IMPLEMENTING LEAF SET ALGORITH
Page 39 and 40: Chapter 6Cubic time algorithmHere I
Page 41 and 42: 6.1. IMPLEMENTATION 33that we can l
Page 43 and 44: 6.1. IMPLEMENTATION 35carried out o
Page 45 and 46: Chapter 7Sub-cubic time algorithmIn
Page 47 and 48: 7.1. THE ALGORITHM 39CcabADFigure 7
Page 49 and 50: 7.1. THE ALGORITHM 41reflecting Eq.
Page 51 and 52: 7.1. THE ALGORITHM 43choice of indi
Page 53 and 54: 7.1. THE ALGORITHM 45More interesti
Page 55 and 56: 7.2. IMPLEMENTATION 477.2 Implement
Page 57: 7.2. IMPLEMENTATION 49tween the num
Page 61 and 62: 7.2. IMPLEMENTATION 53well. My impl
Page 63 and 64: 7.2. IMPLEMENTATION 55Performance o
Page 65: 7.2. IMPLEMENTATION 57ble sort, wil
Page 68 and 69: 60 CHAPTER 8. RESULTS AND DISCUSSIO
Page 71 and 72: Chapter 9ConclusionThe focus of thi
Page 73 and 74: Bibliography[1] Mukul S. Bansal, Ji
Page 75: BIBLIOGRAPHY 67[20] M.S. Waterman a
Page 78 and 79: 70 APPENDIX A. PREPROCESSING FOR TH
Page 81 and 82: Appendix CReal-life application of
Page 83: 75t29t25t54t20t27 t13 t1t3t17t24t41
show all

computing the quartet distance between general trees

Create successful ePaper yourself

Delete template?

Save as template?