Full proceedings volume - Australasian Language Technology ...
Full proceedings volume - Australasian Language Technology ...
Full proceedings volume - Australasian Language Technology ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Ex. 1 Ex. 2 Ex. 3 Score Song Title - Artist<br />
2 1 6 931.99 Fools Get Wise - B.B. King<br />
8 8 1 876.10 Strange - R.E.M<br />
1 7 7 837.14 Lonely Days - M.E.S.T.<br />
3 2 4 625.93 Thief Of Always - Jaci Velasquez<br />
9 4 9 372.41 Almost Independence Day - Van Morrison<br />
7 6 3 343.87 Impossible - UB40<br />
5 5 2 299.51 Try Me - Val Emmich<br />
4 3 8 134.05 One Too Many - Baby Animals<br />
6 9 5 131.38 Aries - Galahad<br />
10 10 10 26.76 Walkin’ On The Sidewalks - Queens of the Stone Age<br />
Table 11: Expected and application rankings for ten randomly selected songs.<br />
LyricWiki Collection<br />
Number One Hits<br />
0 1000 2000 3000 4000<br />
Figure 1: Boxplots showing the quartiles of lyric<br />
scores for the lyric collection from the 1990-2010<br />
era, and the corresponding set of number one hits<br />
from the era.<br />
hits are indeed more typical than other songs,<br />
or perhaps that a song that reaches number<br />
one influences other lyricists who then create<br />
works in a similar style. Earlier attempts to<br />
compare number one hits with the full collection<br />
of lyrics revealed an increase in cliché<br />
score over time for hit songs. We believe that<br />
this was not so much due to an increase in<br />
cliché in pop over time but that the language<br />
in the lyrics of popular music changes over<br />
time, as happens with all natural language.<br />
6 Conclusion<br />
We have explored the use of tf-idf weighting to<br />
find typical phrases and rhyme pairs in song<br />
lyrics. These attributes have been extracted<br />
with varying degrees of success, dependent on<br />
sample size. The use of a background model<br />
of text worked well in removing ordinary language<br />
from the results, and the technique may<br />
go towards improving rhyme detection software.<br />
An application was developed that estimates<br />
how clichéd given song lyrics are. However,<br />
while results were reasonable for distinguishing<br />
very clichéd songs from songs that<br />
are fairly free from cliché, it was less successful<br />
with songs that are not at the extremes.<br />
Our method of obtaining human judgements<br />
was not ideal, consisting of two rankings<br />
of ten songs by the research team involved<br />
in the project. For our future work we hope<br />
to obtain independent judgements, possibly of<br />
smaller snippets of songs to make the task easier.<br />
As it is unclear how consistent people are<br />
in judging the clichédness of songs, we expect<br />
to collect a larger set of judgements per lyric.<br />
There were several instances where annotations<br />
in the lyrics influenced our results. Future<br />
work would benefit from a larger, more<br />
accurately transcribed collection. This could<br />
be achieved using Multiple Sequence Alignment<br />
as in Knees et al. (2005). Extending the<br />
model beyond trigrams may also yield interesting<br />
results.<br />
A comparison of number one hits with a<br />
larger collection of lyrics from the same time<br />
period revealed that the typical number one<br />
hit is more clichéd, on average. While we have<br />
examined the relationship between our cliché<br />
score and song popularity, it is important to<br />
note that there is not necessarily a connection<br />
between these factors and writing quality, but<br />
this may also be an interesting area to explore.<br />
95