Amazon Product Sentiment Analysis using RapidMiner

Authors

  • Nur Hasifah A Razak Faculty of Computer and Mathematical Sciences, Universiti Teknologi MARA Cawangan Kelantan, Bukit Ilmu, 18500 Machang, Kelantan, Malaysia

Abstract

Nowadays, online reviews from customers have created significance for any business especially when it comes to Amazon website. This research predicts the customer reviews based on three main categories; health and beauty, toys and games and electronics. The reviews are classified whether as positive, negative, or neutral. Sentiment Analysis is a data analysis concept in which a collection of reviews is considered, and those reviews are analyzed, processed, and recommended to the user. The dataset use in this research is collected from the Dataworld website. The research presented in this paper was carried out initially; the reviews must be pre- processed in order to remove the unwanted data before being converted from text to vector representation using a range of feature extraction techniques such as TF-IDF. After that, the dataset is classified using Naive Bayes, Decision Tree and Random Forest algorithms. The accuracy, precision and recall were implemented as performance measures in order to evaluate the performance sentiment classification for the given reviews. The result shows that Decision Tree is the best classifier with the highest accuracy for the health and beauty, and electronic categories. For the toys and games category, the best classifier with the highest accuracy is Random Forest.

Keywords:

Decision Tree, Naive Bayes, Random Forest, Sentiment Analysis

Downloads

Published

2022-12-31

How to Cite

Nur Hasifah A Razak. (2022). Amazon Product Sentiment Analysis using RapidMiner. Applied Mathematics and Computational Intelligence (AMCI), 11(2), 336–349. Retrieved from https://ejournal.unimap.edu.my/index.php/amci/article/view/123