Harvesting microblogs for contextual music similarity estimation: a co-occurrence-based framework
Tóm tắt
Microtexts are a valuable, albeit noisy, source to infer collaborative information. As music plays an important role in many human lives, microblogs on music-related activities are available in abundance. This paper investigates different strategies to estimate music similarity from these data sources. In particular, we first present a framework to extract co-occurrence scores between music artists from microblogs and then investigate 12 similarity estimation functions to subsequently derive resemblance scores. We evaluate the approaches on a collection of microblogs crawled from
Twitter
over a period of 10 months and compare them to standard tf-idf approaches. As evaluation criteria we use precision and recall in an artist retrieval task as well as rank proximity. We show that collaborative chatter on music can be effectively used to develop music artist similarity measures, which are a core part of every music retrieval and recommendation system. Furthermore, we analyze the effects of the “long tail” on retrieval results and investigate whether results are consistent over time, using a second dataset.
Tài liệu tham khảo
Armentano, M.G., Godoy, D., Amandi, A.A.: Recommending information sources to information seekers in twitter. In: Proceedings of the IJCAI 2011: International Workshop on Social Web Mining. Barcelona, Spain (2011)
Aucouturier, J.J., Pachet, F.: Representing musical genre: a state of the art. J. New Music Res. 32(1), 83–93 (2003)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern information retrieval. Addison Wesley, Boston (1999)
Baltrunas, L., Kaminskas, M., Ludwig, B., Moling, O., Ricci, F., Lüke, K.H., Schwaiger, R.: InCarMusic: context-aware music recommendations in a car. In: Proceedings of the International Conference on Electronic Commerce and Web Technologies (EC-Web), Toulouse, France (2011)
Baumann, S., Hummel, O.: Using cultural metadata for artist recommendation. In: Proceedings of the 3rd International Conference on web delivering of music (WEDELMUSIC 2003). Leeds, UK (2003)
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent Dirichlet allocation. Mach. Learn. Res. 3, 993–1022 (2003)
Casey, M.A., Veltkamp, R., Goto, M., Leman, M., Rhodes, C., Slaney, M.: Content-based music information retrieval: current directions and future challenges. Proc. IEEE 96, 668–696 (2008)
Celma, O.: Music recommendation and discovery—the long tail, long fail, and long play in the digital music space. Springer, Berlin (2010)
Celma, O., Cano, P., Herrera, P.: SearchSounds: An Audio Crawler Focused on weblogs. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006). Victoria, Canada (2006)
Cimiano, P., Handschuh, S., Staab, S.: Towards the Self-annotating Web. In: Proceedings of the 13th International Conference on World Wide Web (WWW 2004), pp. 462–471. ACM Press, New York, NY, USA (2004)
Cimiano, P., Staab, S.: Learning by Googling. ACM SIGKDD Explor. Newsl. 6(2), 24–33 (2004). doi:10.1145/1046456.1046460
Cohen, W.W., Fan, W.: Web-collaborative filtering: recommending music by crawling the Web. WWW9 / Comput. Netw. 33(1–6), 685–698 (2000)
Geleijnse, G., Korst, J.: Web-based artist categorization. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006). Victoria, Canada (2006)
Govaerts, S., Corthaut, N., Duval, E.: Using search engine for classification: does It still work? In: Proceedings of the IEEE International Symposium on Multimedia (ISM2009): International Workshop on Advances in Music Information Research (AdMIRe 2009). San Diego, CA, USA (2009)
Hu, X., Downie, J.S., West, K., Ehmann, A.: Mining music reviews: promising preliminary results. In: Proceedings of the 6th International Conference on Music Information Retrieval (ISMIR 2005). London, UK (2005)
Jones, M.C., Downie, J.S., Ehmann, A.F.: Human similarity judgments: implications for the design of formal rvaluations. In: International Conference on Music Information Retrieval, pp. 539–542 (2007)
Knees, P., Pampalk, E., Widmer, G.: Artist Classification with Web-based data. In: Proceedings of the 5th International Symposium on Music Information Retrieval (ISMIR 2004), pp. 517–524. Barcelona, Spain (2004)
Knees, P., Pohle, T., Schedl, M., Widmer, G.: A Music search engine built upon audio-based and Web-based similarity measures. In: Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007). Amsterdam, The Netherlands (2007)
Knees, P., Schedl, M., Pohle, T., Widmer, G.: An innovative three-dimensional user interface for exploring music collections enriched with meta-information from the Web. In: Proceedings of the 14th ACM International Conference on Multimedia (MM 2006). Santa Barbara, CA, USA (2006)
McFee, B., Lanckriet, G.: The natural language of playlists. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR 2011). Miami, FL, USA (2011)
Metzler, D., Dumais, S., Meek, C.: Similarity measures for short segments of text. In: Proceedings of the 29th European Conference on Information Retrieval (ECIR 2007). Rome, Italy (2007)
Naveed, N., Gottron, T., Kunegis, J., Alhadi, A.C.: Searching microblogs: coping with sparsity and document quality. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM 2011), pp. 183–188 (2011)
Pampalk, E., Goto, M.: MusicRainbow: A new user interface to discover artists using audio-based similarity and Web-based labeling. In: Proceedings of the 7th International Conference on Music Information Retrieval (ISMIR 2006). Victoria, Canada (2006)
Peat H.J., Willett P. (1991) The limitations of TermCo-occurrence data for query expansion in document retrievalsystems. J. Am. Soc. Inform. Sci. Technol. 42:378–383
Pohle, T., Knees, P., Schedl, M., Pampalk, E., Widmer, G.: “Reinventing the wheel”: a novel approach to music player interfaces. IEEE Trans. Multimedia 9, 567–575 (2007)
Ramzan, N., Zwol, R., Lee, J.S., Clüver, K., Hua, X.S. (eds): Social Media Retrieval. Springer, Berlin (2012)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975). doi:10.1145/361219.361220
Schedl, M.: On the use of microblogging posts for similarity estimation and artist labeling. In: Proceedings of the 11th International Society for Music Information Retrieval Conference (ISMIR 2010). Utrecht, The Netherlands (2010)
Schedl, M.: Analyzing the potential of microblogs for spatio-temporal popularity estimation of music artists. In: Proceedings of the IJCAI 2011: International Workshop on Social Web Mining. Barcelona, Spain (2011)
Schedl, M.: Music data mining, chap. Web-based and community-based music information extraction. CRC Press/Chapman Hall, Boca Raton (2011)
Schedl, M.: #nowplaying Madonna: a large-scale evaluation on estimating similarities between music artists and between movies from microblogs. Inf. Retr. 15, 183–217 (2012)
Schedl, M.: Leveraging microblogs for spatiotemporal music information retrieval. In: Proceedings of the 35th European Conference on Information Retrieval (ECIR 2013). Moscow, Russia (2013)
Schedl, M., Hauger, D.: Mining microblogs to infer music artist similarity and cultural listening patterns. In: Proceedings of the 21st International World Wide Web Conference (WWW 2012): 4th International Workshop on Advances in Music Information Research (AdMIRe 2012). Lyon, France (2012)
Schedl, M., Knees, P.: Personalization in multimodal music retrieval. In: Proceedings of the 9th Workshop on Adaptive Multimedia Retrieval (AMR 2011). Barcelona, Spain (2011)
Schedl, M., Knees, P., Böck, S.: Investigating the similarity space of music artists on the micro-blogosphere. In: Proceedings of the 12th International Society for Music Information Retrieval Conference (ISMIR 2011). Miami, FL, USA (2011)
Schedl, M., Knees, P., Widmer, G.: A Web-based approach to assessing artist similarity using co-occurrences. In: Proceedings of the 4th International Workshop on Content-Based Multimedia Indexing (CBMI 2005). Riga, Latvia (2005)
Urbano, J., Schedl, M.: Minimal test collections for low-cost evaluation of audio music similarity and retrieval systems. Int. J. Multimedia Inform. Retr. 2(1), 59–70 (2013)
Vapnik, V.N.: The nature of statistical learning theory. Springer, Berlin (1995)
Weng, J., Lim, E.P., Jiang, J., He, Q.: TwitterRank: finding topic-sensitive influential twitterers. In: Proceedings of the 3th ACM International Conference on Web Search and Data Mining (WSDM 2010). New York, NY, USA (2010)
Whitman, B., Lawrence, S.: Inferring descriptions and similarity for music from community metadata. In: Proceedings of the 2002 International Computer Music Conference (ICMC 2002), pp. 591–598. Göteborg, Sweden (2002)
Zadel, M., Fujinaga, I.: Web services for music information retrieval. In: Proceedings of the 5th International Symposium on Music Information Retrieval (ISMIR 2004). Barcelona, Spain (2004)