O'Hare, Neil, Davy, Michael, Bermingham, Adam, Ferguson, Paul, Sheridan, Páraic, Gurrin, Cathal ORCID: 0000-0003-4395-7702 and Smeaton, Alan F. ORCID: 0000-0003-1028-8389 (2009) Topic-dependent sentiment analysis of financial blogs. In: TSA 2009 - 1st International CIKM Workshop on Topic-Sentiment Analysis for Mass Opinion Measurement, 6 November 2009, Hong Kong, China. ISBN 978-1-60558-805-6
Abstract
While most work in sentiment analysis in the financial domain has focused on the use of content from traditional finance news, in this work we concentrate on more subjective sources of information, blogs. We aim to automatically determine the sentiment of financial bloggers towards companies and their stocks. To do this we develop a corpus of financial blogs, annotated with polarity of sentiment with respect to a number of companies. We conduct an analysis of the annotated corpus, from which we show there is a significant level of topic shift within this collection, and also illustrate the difficulty that human annotators have when annotating certain sentiment categories. To deal with the problem of topic shift within blog articles, we propose text extraction techniques to create topic-specific sub-documents, which we use to train a sentiment classifier. We show that such approaches provide a substantial improvement over full documentclassification and that word-based approaches perform better than sentence-based or paragraph-based approaches.
Metadata
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Event Type: | Workshop |
Refereed: | Yes |
Uncontrolled Keywords: | sentiment analysis; opinion mining; financial blogs; |
Subjects: | Computer Science > Machine learning Computer Science > Information storage and retrieval systems |
DCU Faculties and Centres: | Research Institutes and Centres > National Centre for Language Technology (NCLT) DCU Faculties and Schools > Faculty of Engineering and Computing > School of Computing Research Institutes and Centres > CLARITY: The Centre for Sensor Web Technologies |
Published in: | CIKM 2009 (The 18th ACM Conference on Information and Knowledge Management). . Association for Computing Machinery. ISBN 978-1-60558-805-6 |
Publisher: | Association for Computing Machinery |
Official URL: | http://dx.doi.org/10.1145/1651461.1651464 |
Use License: | This item is licensed under a Creative Commons Attribution-NonCommercial-Share Alike 3.0 License. View License |
Funders: | Science Foundation Ireland, SFI 07/CE/I1147, Enterprise Ireland, EI IP/2008/0549 |
ID Code: | 14830 |
Deposited On: | 08 Sep 2009 15:12 by Neil OHare . Last Modified 02 Nov 2018 15:11 |
Documents
Full text available as:
Preview |
PDF
- Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
211kB |
Metrics
Altmetric Badge
Dimensions Badge
Downloads
Downloads
Downloads per month over past year
Archive Staff Only: edit this record