Abstract
Urban opinion from crowdsourced data often leads to imbalanced datasets due to the diversity of issues related to urban social, economic, and environmental topics. This study presents a novel hybrid approach that combines Random Over-Sampling and Random Forest (ROS-RF) to effectively classify such imbalanced data. Using crowdsourced urban opinion data from Jakarta, experimental results show that the ROS-RF method outperforms other approaches. The ROS-RF classifier achieved an impressive F1-score, recall, precision, and accuracy of 98%. These findings highlight the superior effectiveness of the ROS-RF method in classifying urban opinions, especially those related to social, economic, and environmental issues in urban settings. This hybrid approach provides a robust solution for managing imbalanced datasets, ensuring more accurate and reliable classification outcomes. The study underscores the potential of ROS-RF in enhancing urban data analysis and decision-making processes
Published Version
Talk to us
Join us for a 30 min session where you can share your feedback and ask us any queries you have
Disclaimer: All third-party content on this website/platform is and will remain the property of their respective owners and is provided on "as is" basis without any warranties, express or implied. Use of third-party content does not indicate any affiliation, sponsorship with or endorsement by them. Any references to third-party content is to identify the corresponding services and shall be considered fair use under The CopyrightLaw.