Computational Models of Music Similarity and their ... - OFAI

More documents

Recommendations

Info

34 2 Audio-based Similarity Measures Blue Rondo A La Turk 86.4 Kathy’s Waltz 80.9 Bad Medicine 38.6 10 20 30 0 −39.5 0 −28.4 0 −9.1 Bring Me To Life 37.4 Someday 85.1 Bolero 43.0 10 20 30 10 20 30 0 −13.3 10 20 30 0 −13.6 10 20 30 0 −13.7 Figure 2.12: Full covariance matrices for 6 songs (G1). On both axes the dimensions are Mel frequency bands. The dimension of the gray shadings is dB. the problem pieces had only little variance in their spectra. For example, one of them was very short (30 second) and calm. Such cases can easily be identified and excluded (e.g., all pieces can be ignored which have a value larger than 10 10 in the inverse covariance). Illustrations Figure 2.12 shows the covariances for the 6 songs used in previous figures. As can be seen, there is a lot of information besides the diagonal. Noticeable are that the variances for lower frequencies are higher. Furthermore, for some of the songs there is a negative covariance between low frequencies and mid frequencies. Figure 2.13 shows the same plots for G1 which were already discussed for G30 and G30S in Figures 2.10 and 2.11. Since G1 uses only one Gaussian, there is only one line plotted in the second row. Noticeable, is also that there are more lines visible in rows 4 and 5. This indicates there are fewer frames (sampled or original) which have much higher probabilities than all others. Otherwise, in particular the third and last row are very similar to those of G30 and G30S and indicate that the models are very similar. 2.2.3.5 Computation Times The CPU times for G30, G30S, and G1 are given in Table 2.1. The frame clustering (FC) time is less interesting than the time needed to compute the cluster model similarity (CMS). While FC can be computed offline, either all possible distances need to be precomputed (and at least partially stored) or
2.2 Techniques 35 Blue Rondo ... Kathy’s Waltz Bad Med. Bring Me To ... Someday Bolero 20 0 −20 20 0 −20 20 0 −20 20 0 −20 20 0 −20 20 0 −20 10 20 30 10 20 30 10 20 30 10 20 30 10 20 30 10 20 30 Figure 2.13: Illustration of the cluster model for G1. The plots are generated as described in Figure 2.10.
Page 1: DISSERTATION Computational Models o
Page 5: Abstract This thesis aims at develo
Page 8 and 9: evaluate similarity measures for dr
Page 10 and 11: 2.2.7.3 Always Dissimilar . . . . .
Page 13 and 14: Chapter 1 Introduction This chapter
Page 15 and 16: 1.1 Outline of this Thesis 3 measur
Page 17 and 18: 1.2 Matlab Syntax 5 ◦ Development
Page 19 and 20: 1.2 Matlab Syntax 7 A frequently us
Page 21 and 22: Chapter 2 Audio-based Similarity Me
Page 23 and 24: 2.1 Introduction 11 Experts High qu
Page 25 and 26: 2.2 Techniques 13 2.2 Techniques To
Page 27 and 28: 2.2 Techniques 15 Amplitude 0 ZCR:
Page 29 and 30: 2.2 Techniques 17 MFCCs Mel Frequen
Page 31 and 32: 2.2 Techniques 19 Segment wav(idx)
Page 33 and 34: 2.2 Techniques 21 Triangular Filter
Page 35 and 36: 2.2 Techniques 23 num_coeffs = 5 nu
Page 37 and 38: 2.2 Techniques 25 2.2.2.5 Parameter
Page 39 and 40: 2.2 Techniques 27 used for clusteri
Page 41 and 42: 2.2 Techniques 29 FFT window size w
Page 43 and 44: 2.2 Techniques 31 Unlike G30 no ran
Page 45: 2.2 Techniques 33 2.2.3.4 Single Ga
Page 49 and 50: 2.2 Techniques 37 G30 G30S G1 G1 re
Page 51 and 52: 2.2 Techniques 39 Relative Fluctuat
Page 53 and 54: 2.2 Techniques 41 36 mel 71 1 12 me
Page 55 and 56: 2.2 Techniques 43 2.2.5.1 Time Doma
Page 57 and 58: 2.2 Techniques 45 Alternatively, th
Page 59 and 60: 2.2 Techniques 47 ZCR (×10 −3 )
Page 61 and 62: 2.2 Techniques 49 2.2.6 Linear Comb
Page 63 and 64: 2.3 Optimization and Evaluation 51
Page 97 and 98:
2.3 Optimization and Evaluation 85
Page 99 and 100:
2.3 Optimization and Evaluation 87
Page 101 and 102:
2.5 Alternative: Web-based Similari
Page 103 and 104:
2.6 Conclusions 91 2.5.3 Limitation
Page 105 and 106:
Chapter 3 Applications This chapter
Page 107 and 108:
3.2 Islands of Music 95 Figure 3.1:
Page 109 and 110:
3.2 Islands of Music 97 they use to
Page 111 and 112:
3.2 Islands of Music 99 a b c d Fig
Page 113 and 114:
3.2 Islands of Music 101 AMBIENT CL
Page 115 and 116:
3.2 Islands of Music 103 Figure 3.6
Page 117 and 118:
3.2 Islands of Music 105 scribing a
Page 119 and 120:
Page 121 and 122:
Page 123 and 124:
3.3 Fuzzy Hierarchical Organization
Page 125 and 126:
Page 127 and 128:
Page 129 and 130:
Page 131 and 132:
Page 133 and 134:
Page 135 and 136:
Page 137 and 138:
3.4 Dynamic Playlist Generation 125
Page 139 and 140:
Page 141 and 142:
Page 143 and 144:
Page 145 and 146:
Page 147 and 148:
Page 149 and 150:
3.5 Conclusions 137 + Punk / Bad Re
Page 151 and 152:
Chapter 4 Conclusions In this thesi
Page 153 and 154:
Bibliography [AHH + 03] Eric Allama
Page 155 and 156:
[CKGB02] Pedro Cano, Martin Kaltenb
Page 157 and 158:
[Got03] Masataka Goto, A Chorus-Sec
Page 159 and 160:
[Lüb05] Dominik Lübbers, SoniXplo
Page 161 and 162:
[PFW05b] , Improvements of Audio-Ba
Page 163 and 164:
[SKW05a] Markus Schedl, Peter Knees
Page 165:
Elias Pampalk I was born in 1978 in
show all

Computational Models of Music Similarity and their ... - OFAI

Create successful ePaper yourself

Delete template?

Save as template?