Title: Finding and validating medical information shared on Twitter: experiences using a crowdsourcing approach

Authors: Scott J. Duberstein; Daniel Adomako Asamoah; Derek Doran; Shu Z. Schiller

Addresses: Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH, USA ' Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH, USA ' Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH, USA ' Wright State University, 3640 Colonel Glenn Hwy, Dayton, OH, USA

Abstract: Social media provide users a channel to share meaningful and insightful information with their network of connected individuals. Harnessing this public information at scale is a powerful notion as social media is rife with public perceptions, signals, and data about a variety of topics. However, there is a common trade-off in collecting information from social media: the more specific the topic, the more challenging it is to extract reliable and truthful information. In this paper, we present an experience report describing our efforts in developing and applying a novel approach to identify, extract, and validate topic specific information using the Amazon Mechanical Turk (AMT) crowdsourcing platform. The approach was applied in a use-case where meaningful information about a medical condition (major depressive disorder) was successfully extracted from Twitter. Our approach, and lessons learned, may serve as a generic methodology for extracting relevant and meaningful data from social media platforms and help researchers who are interested in harnessing Twitter, AMT, and the like for reliable information discovery.

Keywords: crowdsourcing; Amazon Mechanical Turk; AMT; Twitter; social media; major depressive disorder.

DOI: 10.1504/IJWET.2019.100344

International Journal of Web Engineering and Technology, 2019 Vol.14 No.1, pp.80 - 98

Published online: 26 Jun 2019 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article