Agenda detector: labeling tweets with political policy agenda

dc.contributor.advisor Wallapak Tavanapong
dc.contributor.author Kaul, Sheetal
dc.contributor.department Computer Science
dc.date 2018-08-11T19:15:33.000
dc.date.accessioned 2020-06-30T02:57:31Z
dc.date.available 2020-06-30T02:57:31Z
dc.date.copyright Thu Jan 01 00:00:00 UTC 2015
dc.date.embargo 2016-03-24
dc.date.issued 2015-01-01
dc.description.abstract <p>In nearly one decade of Twitter’s being it has witnessed an ever growing user base from various realms of the world, one of them being politics. In the political domain, Twitter is used as a vital tool for communication purposes, running effective e-campaigns, and mining and affecting public opinions to name a few. We study the problem of automatically detecting whether a tweet posted by a state’s Senate’s twitter handle in the US has a reference to policy agenda(s). Such a capability can help detect the policy agendas that a state focuses on and also capture the inception of ideas leading to framing of bill/law. Furthermore, analyzing the spatial and temporal dynamics of tweets carrying policy agendas can facilitate study of policy diffusion among states, and help in comprehending the changing aspects of states learning policy-making from each other.</p> <p>Currently, no study has been carried out that analyzes Twitter data to detect whether or not a tweet refers to a policy agenda. We present our analysis on 122,965 tweets collected from verified Twitter handles of the US state’s upper house – Senate. We present our high-level analysis on (a) how much Twitter has penetrated into state politics and (b) how states use the medium differently in terms of the messages they broadcast. Our proposed approach aims to automate classification of a tweet based on having a reference to policy agenda (Has Agenda) or not (No Agenda). We accomplish this by leveraging existing text classification methodology and achieve a recall of 89.1% and precision of 77.2% for the “Has Agenda” class. We investigate several machine learning algorithms to determine the best performing one for our binary classification problem. We conclude that support vector machine using linear kernel was the most efficient algorithm to use for our dataset. Lastly, we propose a set of hand-crafted features that together with feature selection and stemming improved our classifier’s performance. Prior to including these features the classifier was developed using, basic preprocessing techniques, and term occurrence (for feature extraction). An overall improvement of 5.187 % at a significance level of α=0.05 was achieved.</p>
dc.format.mimetype application/pdf
dc.identifier archive/lib.dr.iastate.edu/etd/14553/
dc.identifier.articleid 5560
dc.identifier.contextkey 7988763
dc.identifier.doi https://doi.org/10.31274/etd-180810-4101
dc.identifier.s3bucket isulib-bepress-aws-west
dc.identifier.submissionpath etd/14553
dc.identifier.uri https://dr.lib.iastate.edu/handle/20.500.12876/28738
dc.language.iso en
dc.source.bitstream archive/lib.dr.iastate.edu/etd/14553/Kaul_iastate_0097M_15013.pdf|||Fri Jan 14 20:22:14 UTC 2022
dc.subject.disciplines Computer Sciences
dc.subject.disciplines Political Science
dc.subject.keywords Computer Science
dc.subject.keywords State Legislature
dc.subject.keywords Support Vector Machine
dc.subject.keywords Twitter
dc.title Agenda detector: labeling tweets with political policy agenda
dc.type article
dc.type.genre thesis
dspace.entity.type Publication
relation.isOrgUnitOfPublication f7be4eb9-d1d0-4081-859b-b15cee251456
thesis.degree.level thesis
thesis.degree.name Master of Science
File
Original bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Kaul_iastate_0097M_15013.pdf
Size:
1.1 MB
Format:
Adobe Portable Document Format
Description: