Abstract
Social media has recently become a rich resource in mining user sentiments. In this paper, Twitter has been chosen as a platform for opinion mining in trading strategy with Mubasher products, which is a leading stock analysis software provider in the Gulf region. This experiment proposes a model for sentiment analysis of Saudi Arabic (standard and Arabian Gulf dialect) tweets to extract feedback from Mubasher products. A hybrid of natural language processing and machine learning approaches on building models are used to classify tweets according to their sentiment polarity into one of the classes positive, negative and neutral. Firstly, document's Pre-processing are explored on the dataset. Secondly, Naive Bayes and Support Vector Machines (SVMs) are applied with different feature selection schemes like TF-IDF (Term Frequency-Inverse Document Frequency) and BTO (Binary-Term Occurrence). Thirdly, the proposed model for sentiment analysis is expanded to obtain the results for N-Grams term of tokens. Finally, human has labelled the data and this may involve some mistakes in the labelling process. At this moment, neutral class with generalisation of our classification will take results to different classification accuracy.
| Original language | English |
|---|---|
| Title of host publication | nan |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Print) | 9781467387439 |
| DOIs | |
| Publication status | Published - 2 May 2016 |
| Event | International Conference on Industrial Informatics and Computer Systems (CIICS) - Sharjah Duration: 13 Mar 2016 → 15 Mar 2016 |
Conference
| Conference | International Conference on Industrial Informatics and Computer Systems (CIICS) |
|---|---|
| City | Sharjah |
| Period | 13/03/16 → 15/03/16 |
| Other | International Conference on Industrial Informatics and Computer Systems (CIICS) (13/03/2016-15/03/2016, Sharjah) |
Keywords
- Mubasher
- Pre-Processing
- Saudi Arabia
- data mining
- sentiment analysis
Fingerprint
Dive into the research topics of 'Identifying Mubasher software products through sentiment analysis of Arabic tweets'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver