Confident Learning on IndoBERT: Enhancing Sentiment Classification Performance
DOI:
https://doi.org/10.33022/ijcs.v13i5.4401Keywords:
IndoBERT, Confidentt Learning, Sentiment Analysis, Label QualityAbstract
In the rapidly evolving field of artificial intelligence (AI), label uncertainty in datasets has become a significant challenge threatening the sustainability of AI. This study investigates the enhancement of IndoBERT's performance in Indonesian sentiment analysis by integrating the Confident Learning (CL) method. IndoBERT, an adaptation of BERT for Indonesian, shows strong performance but is affected by label uncertainty. CL is applied to correct mislabeled data and improve model accuracy. The results indicate that IndoBERT + CL achieves an accuracy improvement from 85.15% to 86.03%, with enhancements in precision, recall, and F1 score to 87.93%, 85.00%, and 86.44%, respectively. The confusion matrix results also show that IndoBERT + CL is more accurate in identifying positive labels. This research highlights the importance of applying CL to enhance label quality and model performance in NLP sentiment analysis.
Downloads
Published
Issue
Section
License
Copyright (c) 2024 Daffa Al Akhdaan, Taufik Edy Sutanto, Muhaza Liebenlito

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.