An effective and efficient subpopulation extraction method in large social networks

Bin Zhang, David Krackhardt, Ramayya Krishnan, Patrick Doreian

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Scopus citations

Abstract

With the help of information technologies, we have access to very large networks, even with billions of nodes. This large size has limited our ability to perform analysis and provide theoretical compelling explanation on the whole network. One solution is to extract connected subgraphs and analyze them as subpopulations. We propose a method for extracting such subpopulation archiving two desirable properties: 1) be effective, resulting in subpopulations with more ties within them than to the external network; and 2) be fast, so that it scales well to large networks. We develop a method called the "Transitive Clustering and Pruning" (T-CLAP) algorithm. We compare the speed and effectiveness of this algorithm to two other popularly community detection algorithms - Newman's and Clauset's algorithms. We find that T-CLAP is orders of magnitudes faster than Newman's algorithm; and is superior to Clauset's algorithm in terms of returning effective subpopulations that are useful.

Original languageEnglish (US)
Title of host publicationInternational Conference on Information Systems 2011, ICIS 2011
Pages477-493
Number of pages17
StatePublished - 2011
Event32nd International Conference on Information System 2011, ICIS 2011 - Shanghai, China
Duration: Dec 4 2011Dec 7 2011

Publication series

NameInternational Conference on Information Systems 2011, ICIS 2011
Volume1

Other

Other32nd International Conference on Information System 2011, ICIS 2011
Country/TerritoryChina
CityShanghai
Period12/4/1112/7/11

Keywords

  • Large scale data
  • Social network
  • Subpopulation extraction

ASJC Scopus subject areas

  • Information Systems

Fingerprint

Dive into the research topics of 'An effective and efficient subpopulation extraction method in large social networks'. Together they form a unique fingerprint.

Cite this