Role detection in online forums based on growth models for trees

Social Network Analysis and Mining - Tập 7 - Trang 1-13 - 2017
Alberto Lumbreras1, Bertrand Jouve2,3, Julien Velcin4, Marie Guégan5
1CNRS, Institut de Recherche en Informatique de Toulouse - UMR 5505, Toulouse, France
2FRAMESPA - UMR 5136, CNRS, Université de Toulouse, Toulouse, France
3IMT - UMR 5219, CNRS, Université de Toulouse, Toulouse, France
4Laboratoire ERIC, Université de Lyon, Bron, France
5Technicolor, Cesson-Sévigné, France

Tóm tắt

Some structural characteristics of online discussions have been successfully modeled in the recent years. When parameters of these models are properly estimated, the models are able to generate synthetic discussions that are structurally similar to the real discussions. A common aspect of these models is that they consider that all users behave according to the same model. In this paper, we combine a growth model with an Expectation–Maximization algorithm that finds different parameters for different latent groups of users. We use this method to find the different roles that coexist in the community. Moreover, we analyze whether we can predict users behaviors based on their roles. Indeed, we show that predictions are improved for some of the roles when compared with a simple growth model.

Tài liệu tham khảo

Agarwal N, Liu H, Tang L, Yu PS (2008) Identifying the influential bloggers in a community. In: Proceedings of the international conference on web search and web data mining—WSDM ’08, New York, NY, USA. ACM Press, p 207 Angeletou S, Rowe M, Alani H (2011) Modelling and analysis of user behaviour in online communities. In: Proceedings of the 10th international semantic web conference, pp 35–50 Aragón P, Gómez V, Kaltenbrunner A (2017) To thread or not to thread: the impact of conversation threading on online discussion. In: 11th international AAAI conference on web and social media. The AAAI Press Barabási A-L, Albert R (1999) Emergence of scaling in random networks. Science 286(October):509–512 Bensmail H, Celeux G, Raftery A, Robert C (1997) Inference in model-based cluster analysis. Stat Comput 7:1–10 Buntain C, Golbeck J (2014) Identifying social roles in reddit using network structure. In: Proceedings of the companion publication of the 23rd international conference on World wide web companion, pp 615–620 Chan J, Hayes, C, Daly E (2010) Decomposing discussion forums using common user roles. In: Proceedings of the WebSci10: extending the frontiers of society on-line Cheng J, Danescu-Niculescu-Mizil C, Leskovec J (2015) Antisocial behavior in online discussion communities. In: AAAI international conference on weblogs and social media. AAAI Press, pp 61–70 Choobdar S, Ribeiro P, Silva F (2017) Evolutionary role mining in complex networks by ensemble clustering. In: Proceedings of the symposium on applied computing, SAC ’17, New York, NY, USA. ACM, pp 1053–1060 Forestier M, Velcin J, Stavrianou A, Zighed D (2012) Extracting celebrities from online discussions. In: Proceedings of the 2012 IEEE/ACM international conference on advances in social networks analysis and mining, ASONAM 2012, pp 322–326 Golder SA (2003) A typology of social roles in usenet. Ph.D. thesis, Harvard University Gómez V, Kappen HJ, Kaltenbrunner A (2010) Modeling the structure and evolution of discussion cascades. In: Proceedings of the 22nd ACM conference on hypertext and hypermedia, pp 181–190 Gómez V, Kappen HJ, Litvak N, Kaltenbrunner A (2012) A likelihood-based framework for the analysis of discussion threads. World Wide Web 16(5–6):645–675 Goyal A, Bonchi F, Lakshmanan LV (2008). Discovering leaders from community actions. In: Proceeding of the 17th ACM conference on information and knowledge mining—CIKM ’08, New York, NY, USA. ACM Press, p 499 Himelboim I, Gleave E, Smith M (2009) Discussion catalysts in online political discussions: content importers and conversation starters. J Comput Med Commun 14(4):771–789 Kolaczyk ED (2009) Statistical analysis of network data: methods and models. Springer, New York Kumar R, Mahdian M, McGlohon M (2010) Dynamics of conversations. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 553–562 Kumar S, Spezzano F, Subrahmanian VS (2014) Accurately detecting trolls in Slashdot Zoo via decluttering. In: Proceedings of the 2014 IEEE/ACM international conference on advances in social networks analysis and mining, pp 188–195 Lui M, Baldwin T (2010) Classifying user forum participants: separating the gurus from the hacks, and other tales of the internet. In: Proceedings of Australasian language technology association workshop, pp 49–57 Nelder J, Mead R, Nelder BJ, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313 Nolker RD, Zhou L (2005) Social computing and weighting to identify member roles in online communities. In: The 2005 IEEE/WIC/ACM international conference on web intelligence (WI’05). IEEE, pp 87–93 Rowe M, Fernandez M, Angeletou S, Alani H (2013) Community analysis through semantic rules and role composition derivation. Web Semant Sci Serv Agents World Wide Web 18(1):31–47 Wang C, Ye M, Huberman BA (2012) From user comments to on-line conversations. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 244–252 White A, Chan J, Hayes C, Murphy BT (2012) Mixed Membership models for exploring user roles in online fora. In: Proceedings of the 6th annual international conference on weblogs and social media—ICWSM2012, pp 599–602