Published on February 20, 2014
Automated Discovery and Visualization of Communication Networks from Social Media Anatoliy Gruzd @gruzd firstname.lastname@example.org Associate Professor, School of Information Management Director, Social Media Lab Faculty of Management / Faculty of Computer Science Dalhousie University London School of Economics February 20, 2014
Outline • Studying Online Social Networks • Sample Projects • Netlytic.org Anatoliy Gruzd Twitter: @gruzd 2
Growth of Social Media and Social Networks Data Facebook Social Media have become an integral part of our daily lives! Twitter 1B users 500M users
How to Make Sense of Social Big Data? Anatoliy Gruzd Twitter: @gruzd 10
How to Make Sense of Social Big Data? Social Big Data -> Visualizations -> Understanding (Development, Application & Validation) Anatoliy Gruzd Twitter: @gruzd 11
How to Make Sense of Social Big Data? Social Network Analysis (SNA) • Nodes = People • Edges /Ties (lines) = Relations/ “Who retweeted/ replied/ mentioned whom” Anatoliy Gruzd Twitter: @gruzd 12
Advantages of Social Network Analysis • Reduce the large quantity of data into a more concise representation • Makes it much easier to understand what is going on in a group Anatoliy Gruzd Once the network is discovered, we can find out: • How do people interact with each other, • Who are the most/least active members of a group, • Who is influential in a group, • Who is susceptible to being influenced, etc… Twitter: @gruzd 13
How Do We Collect Information About Online Social Networks? Common approach for collecting social network data: • Self-reported social network data may not be available/accurate • Surveys or interviews Problems with surveys or interviews • • • • • Time-consuming Questions can be too sensitive Answers are subjective or incomplete Participant can forget people and interactions Different people perceive events and relationships differently Anatoliy Gruzd Twitter: @gruzd 14
How Do We Collect Information About Social Networks? • Common approach: surveys or interviews • A sample question about students’ perceived social structures Please indicate on a scale from  to , YOUR FRIENDSHIP RELATIONSHIP WITH EACH STUDENT IN THE CLASS  - don’t know this person  - just another member of class  - a slight friendship  - a friend  - a close friend Alice D.      … Richard S.      Source: C. Haythornthwaite, 1999 Anatoliy Gruzd Twitter: @gruzd 15
How Do We Collect Information About Online Social Networks? Goal: Automated Networks Discovery Challenge: Figuring out what content-based features of online interactions can help to uncover nodes and ties between group members Anatoliy Gruzd Twitter: @gruzd 16
How Do We Collect Information About Online Social Networks? • Forum networks • Blog networks • Friends’ networks (Facebook, Twitter, Google+, etc…) • Networks of like-minded people (YouTube, Flickr, etc…) http://www.visualcomplexity.com/vc
Automated Discovery of Social Networks Direct Facebook Messages • Nodes = People Nick • Ties = “Who talks to whom” Rick • Tie strength = The number of messages exchanged between individuals Dick Anatoliy Gruzd Twitter: @gruzd 18
Automated Discovery of Social Networks “Many to Many” Communication Forum Anatoliy Gruzd Mailing listserv Chat Twitter: @gruzd Comments 19
Automated Discovery of Social Networks Approach 1: Chain Network (Reply-to) Posting header FROM: Sam PREVIOUS POSTER: Gabriel Content .... .... .... Anatoliy Gruzd Twitter: @gruzd 20
Automated Discovery of Social Networks Approach 1: Chain Network (Reply-to) Posting header FROM: Sam PREVIOUS POSTER: Gabriel Content “ Nick, Gina and Gabriel: I apologize for not backing this up with a good source, but I know from reading about this topic that … ” Possible Missing Connections: • Sam -> Nick • Sam -> Gina • Nick <-> Gina
Automated Discovery of Social Networks Approach 2: Name Network This approach looks for personal names in the content of the messages to identify social connections between group members. FROM: Ann “Steve and Natasha, I couldn't wait to see your site. I knew it was going to [be] awesome!” Anatoliy Gruzd Twitter: @gruzd 22
Automated Discovery of Social Networks Approach 2: Name Network • Main Communicative Functions of Personal Names (Leech, 1999) – getting attention and identifying addressee – maintaining and reinforcing social relationships • Names are “one of the few textual carriers of identity” in discussions on the web (Doherty, 2004) • Their use is crucial for the creation and maintenance of a sense of community (Ubon, 2005) • Remembering a person’s name can potentially shape that person’s response to a request (Howard et al. 1995, 1997) Anatoliy Gruzd Twitter: @gruzd 23
Automated Discovery of Social Networks Name Network Method: Challenges Kurt Cobain, a lead singer for the rock band Nirvana chris is not a group member Santa Monica Public Library John Dewey, philosopher & educator mark up language Solution: - Name alias resolution Anatoliy Gruzd Twitter: @gruzd 24
Evaluating Name Networks Example: Youtube comments Name Network Chain Network Chain Network (less connections) Anatoliy Gruzd Name Network (more connections) Twitter: @gruzd 25
Automated Discovery of Social Networks Twitter Messages • Nodes = People @John • Ties = “Who retweeted/ replied/mentioned whom” • Tie strength = The number of retweets, replies or mentions @Peter @Paul Anatoliy Gruzd Twitter: @gruzd 26
2012 Olympics in London Twitter: @dalprof Anatoliy Gruzd
#tarsand Twitter Community Twitter: @dalprof Anatoliy Gruzd
#1b1t Twitter Book Club Twitter: @dalprof Anatoliy Gruzd
Outline • Studying Online Social Networks • Sample Projects (1) Politics & (2) Health • Netlytic.org Anatoliy Gruzd Twitter: @gruzd 30
Sample Project: Politics Political Polarization on Social Media Gruzd, A. and Roy, J (2014, forthcoming). Political Polarization on Social Media: Do Birds of a Feather Flock Together on Twitter? Policy & Internet.
Sample Project: Politics Social Media Use during the Canadian Federal Election #CndPoli Twitter Communication Network (April 6-9, 2011) Anatoliy Gruzd Twitter: @gruzd 32
Sample Project: Politics Social Media Use during the Canadian Federal Election • There are some pockets of political polarization on Twitter Conservative – homophily - when people in social networks tend to group around similar backgrounds and interests, including shared political views. Left Liberal • Twitter has potential for supporting open cross-ideological discourse Anatoliy Gruzd Unknown & Undecided Green Bloc – 43% of the accounts in the sample did not explicitly stated their support for any party or stated support to more than one party Spam NDP Other Manual Classification of Twitter Users based on their self-declared party affiliation Twitter: @gruzd 33
Sample Project: Health Social Media & Health Gruzd, A. & Haythornthwaite, C. (2013). Enabling Community through Social Media. Journal of Medical Internet Research 15(10):e248. DOI: 10.2196/jmir.2796
Sample Project: Health Health Care Social Media Canada Case Study: #hcsmca Twitter Community Background • #hcsmca is a vibrant community of people interested in exploring social innovation in health care. • #hcsmca hosts a tweet chat every Wednesday at 1 pm ET. The last Wednesday of the month is our monthly evening chat at 9 pm ET. Source: http://cyhealthcommunications.wordpress.com/hcsmca-2/ Anatoliy Gruzd Twitter: @gruzd 35
Sample Project: Health Health Care Social Media Canada Case Study: #hcsmca Twitter Community Research question: 1. What accounts for the relative longevity of this particular online community? – Is it because of the founder’s leadership and her continuing involvement in this community? – Or is there a core group of members who are also actively and persistently involved in this community? 2. What is the composition of this community? Does one’s professional role/title determine a person’s centrality within this community. Anatoliy Gruzd Twitter: @gruzd 36
Sample Project: Health Some Topics Discussed by the #hcsmca Community Nov 14, 2012 Challenge of engaging SM to inform a research agenda Nov 21, 2012 Are healthcare blogs a useful tool for education and knowledge transfer? Number of Messages Over Time Anatoliy Gruzd Twitter: @gruzd 37
Sample Project: Health #hcsmca Communication Network on Twitter (Nov 12 - Dec 13, 2012) Roles SM health content providers Unaffiliated individual users Communicators - not specifically health related Communicators - Health related Count Healthcare professionals 50 Health institutions 31 Advocacy 30 Students 16 Educators, professors 13 Researchers Government and health policy makers 10 110 89 74 59 4 *Roles are assigned manually Node size = In-Degree Centrality Anatoliy Gruzd Twitter: @gruzd 38
Outline • Studying Online Social Networks • Sample Projects (1) Politics & (2) Health • Netlytic.org Twitter: @gruzd Anatoliy Gruzd 39
Netlytic.org - a cloud-based analytic tool Anatoliy Gruzd Networks Content Stats for automated text analysis & discovery of social networks from online communication Twitter: @gruzd 40
Small deck used during Use Case roundtable at JiveWorld 2014. On each on the 14 ta...
This 30 minute presentation was given at the 2014 Rochester Young Professionals En...
Automated Discovery and Visualization of Online Social Networks ... Growth of Social Media and Social Networks ... Automated Discovery of Communication ...
Automated Discovery and Visualization of Communication Networks from Social Media . Anatoliy Gruzd @dalprof . email@example.com . Associate Professor, School of ...
Embeddable version - Anatoliy Gruzd - Automated Discovery and Visualization of Communication Networks from Social Media. Presenter(s) To Be Announced ...
Title: Automated Discover and Visualization of Communication Networks from Social Media: Creator: Gruzd, Anatoliy: Date Issued: 2013: Description: Webcast ...
News & Events Events. IT ... Automated Discovery and Visualization of Formal and Informal ... discover and analyze various communication networks from ...
Semantic Wiki for Visualization of Social Media ... in efforts to enable automated network ... of temporal visualization for social communications ...
... An analysis of two-way communication on social media; ... Home / Research / Text Mining and Social Network ... “Automated Discovery of Emerging ...
NodeXL: Network Overview, Discovery and ... Direct Connections to Social Networks NodeXL Basic allows ... to the Social Media Research Foundation's NodeXL ...