TY - GEN
T1 - Developing a Dark Web collection and infrastructure for computational and social sciences
AU - Zhang, Yulei
AU - Zeng, Shuo
AU - Huang, Chun Neng
AU - Fan, Li
AU - Yu, Ximing
AU - Dang, Yan
AU - Larson, Catherine A.
AU - Denning, Dorothy
AU - Roberts, Nancy
AU - Chen, Hsinchun
PY - 2010
Y1 - 2010
N2 - In recent years, there have been numerous studies from a variety of perspectives analyzing the Internet presence of hate and extremist groups. Yet the websites and forums of extremist and terrorist groups have long remained an underutilized resource for terrorism researchers due to their ephemeral nature and access and analysis problems. The purpose of the Dark Web archive is to provide a research infrastructure for use by social scientists, computer and information scientists, policy and security analysts, and others studying a wide range of social and organizational phenomena and computational problems. The Dark Web Forum Portal provides web enabled access to critical international jihadist and other extremist web forums. The focus of this paper is on the significant extensions to previous work including: increasing the scope of data collection, adding an incremental spidering component for regular data updates; enhancing the searching and browsing functions; enhancing multilingual machine-translation for Arabic, French, German and Russian; and advanced Social Network Analysis. A case study on identifying active participants is shown at the end.
AB - In recent years, there have been numerous studies from a variety of perspectives analyzing the Internet presence of hate and extremist groups. Yet the websites and forums of extremist and terrorist groups have long remained an underutilized resource for terrorism researchers due to their ephemeral nature and access and analysis problems. The purpose of the Dark Web archive is to provide a research infrastructure for use by social scientists, computer and information scientists, policy and security analysts, and others studying a wide range of social and organizational phenomena and computational problems. The Dark Web Forum Portal provides web enabled access to critical international jihadist and other extremist web forums. The focus of this paper is on the significant extensions to previous work including: increasing the scope of data collection, adding an incremental spidering component for regular data updates; enhancing the searching and browsing functions; enhancing multilingual machine-translation for Arabic, French, German and Russian; and advanced Social Network Analysis. A case study on identifying active participants is shown at the end.
KW - Dark Web archive
KW - Incremental forum spidering
KW - Multilingual translation
KW - Social Network visualization
UR - http://www.scopus.com/inward/record.url?scp=77954799296&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77954799296&partnerID=8YFLogxK
U2 - 10.1109/ISI.2010.5484774
DO - 10.1109/ISI.2010.5484774
M3 - Conference contribution
AN - SCOPUS:77954799296
SN - 9781424464609
T3 - ISI 2010 - 2010 IEEE International Conference on Intelligence and Security Informatics: Public Safety and Security
SP - 59
EP - 64
BT - ISI 2010 - 2010 IEEE International Conference on Intelligence and Security Informatics
T2 - 2010 IEEE International Conference on Intelligence and Security Informatics: Public Safety and Security, ISI 2010
Y2 - 23 May 2010 through 26 May 2010
ER -