TY - GEN
T1 - IISE 2019 conference & expo a multi-algorithm approach for classifying misinformed Twitter data during crisis events
AU - Hunt, Kyle
AU - Agarwal, Puneet
AU - Zhuang, Jun
N1 - Publisher Copyright:
© 2019 IISE Annual Conference and Expo 2019. All rights reserved.
PY - 2019
Y1 - 2019
N2 - Social media is being increasingly utilized to spread breaking news and updates during disasters of all magnitudes. Unfortunately, due to the unmoderated nature of social media platforms such as Twitter, rumors and misinformation are able to propagate widely. Given this, a surfeit of research has studied rumor diffusion on social media, especially during natural disasters. In many studies, researchers manually code social media data to further analyze the patterns and diffusion dynamics of users and misinformation. This method requires many human hours, and is prone to significant incorrect classifications if the work is not checked over by another individual. In our studies, we fill the research gap by applying seven different machine learning algorithms to automatically classify misinformed Twitter data that is spread during disaster events. Due to the unbalanced nature of the data, three different balancing algorithms are also applied and compared. We collect and drive the classifiers with data from the Manchester Arena bombing (2017), Hurricane Harvey (2017), the Hawaiian incoming missile alert (2018), and the East Coast US tsunami alert (2018). Over 20,000 tweets are classified based on the veracity of their content as either true, false, or neutral, with overall accuracies exceeding 89%.
AB - Social media is being increasingly utilized to spread breaking news and updates during disasters of all magnitudes. Unfortunately, due to the unmoderated nature of social media platforms such as Twitter, rumors and misinformation are able to propagate widely. Given this, a surfeit of research has studied rumor diffusion on social media, especially during natural disasters. In many studies, researchers manually code social media data to further analyze the patterns and diffusion dynamics of users and misinformation. This method requires many human hours, and is prone to significant incorrect classifications if the work is not checked over by another individual. In our studies, we fill the research gap by applying seven different machine learning algorithms to automatically classify misinformed Twitter data that is spread during disaster events. Due to the unbalanced nature of the data, three different balancing algorithms are also applied and compared. We collect and drive the classifiers with data from the Manchester Arena bombing (2017), Hurricane Harvey (2017), the Hawaiian incoming missile alert (2018), and the East Coast US tsunami alert (2018). Over 20,000 tweets are classified based on the veracity of their content as either true, false, or neutral, with overall accuracies exceeding 89%.
KW - Disaster
KW - Machine learning
KW - Misinformation
KW - Social media
KW - Twitter
UR - https://www.scopus.com/pages/publications/85085507480
M3 - Conference contribution
AN - SCOPUS:85085507480
T3 - IISE Annual Conference and Expo 2019
BT - IISE Annual Conference and Expo 2019
PB - Institute of Industrial and Systems Engineers, IISE
T2 - 2019 Institute of Industrial and Systems Engineers Annual Conference and Expo, IISE 2019
Y2 - 18 May 2019 through 21 May 2019
ER -