The exponential growth of social media platforms has generated massive amounts of unstructured text data, offering valuable insights into public opinion, trends, and behaviors. Text mining, when integrated with big data analytics, provides a powerful approach to process, analyze, and extract meaningful information from social media comments. This review paper focuses on various text mining techniques—such as sentiment analysis, topic modeling, and opinion extraction—applied to large-scale social media datasets. The study explores the role of big data frameworks like Hadoop and Spark in handling high-volume, high-velocity data efficiently. Additionally, it highlights preprocessing methods including tokenization, stop-word removal, and feature selection, which are critical for improving model accuracy. The paper also discusses challenges such as data noise, language ambiguity, scalability, and privacy concerns. Overall, this review emphasizes how big data-driven text mining can support decision-making in domains such as marketing, politics, and public health by uncovering patterns and sentiments hidden within social media interactions.
100
75
43
100
75
43
100
75
43
Copyright © 2019, This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC-BY-NY-SA). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Corresponding Author:
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Conflict of interest: The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Or share your Opinion
The rapid growth of digital education has led to increased demand for scalable and flexible learning solutions. This...
Cryogenics is the branch of science that deals with the production, behavior, and application of materials at extremely...
The rapid advancement of the Internet of Things (IoT) has revolutionized traditional agricultural practices by enabling real-time monitoring,...
Medical image segmentation is a crucial step in computer-aided diagnosis, treatment planning, and biomedical image analysis. This paper...
Wireless Sensor Networks (WSNs) are widely used in environmental monitoring, military surveillance, healthcare, and smart cities. However, due...
The Virtual Mouse Assistant is an innovative human-computer interaction system designed to provide an intuitive and hands-free alternative...
The Virtual Mouse Assistant is an innovative human-computer interaction system designed to provide an intuitive and hands-free alternative...
An The central theme of this design is to reduce the space between the natural world and the...
One of the most common cancers in the world, thyroid cancer is becoming more common. Improving patient outcomes...
Comments(0)