Sentiment Anlaysis on Indonesia-English Code-Mixed Data

HILAL RAMADHAN UTOMO

Informasi Dasar

20 kali
23.04.2659
006.35
Karya Ilmiah - Skripsi (S1) - Reference

Social media users nowadays tend to use code-mixed language to express their opinion. The users of social media has exponentially risen in some countries like Indonesia, it has given rise to large volumes of code-mixed data, in which users use more than one language in a single text. Data with code-mixed is often noisy and most importantly, the monolingual model usually does not work well on it. This has been a challenge for Natural Language Processing (NLP) for processing and analyzing the data. In this work, we conduct experiment of sentiment analysis on English-Indonesian code-mixed data. The approach that is by utilizing a multilingual pre-trained model, mBERT. By analyzing the sentiment analysis models' predictions, we may assess how effectively the model can adjust to the implicit noises inherent in code-mixed data. The classification model's performance was tested using batch size and epochs parameters to discover and obtain the highest accuracy. The experimental result shows that the highest accuracy we obtained from the mBERT model that is trained with our dataset obtained was 76\%, with 16 batch size and epochs used 7.

Subjek

Natural language processing
Language - modern-study and teaching,

Katalog

Sentiment Anlaysis on Indonesia-English Code-Mixed Data
 
 
Indonesia

Sirkulasi

Rp. 0
Rp. 0
Tidak

Pengarang

HILAL RAMADHAN UTOMO
Perorangan
Ade Romadhony
 

Penerbit

Universitas Telkom, S1 Informatika
Bandung
2023

Koleksi

Kompetensi

  • CII4E4 - TUGAS AKHIR

Download / Flippingbook

 

Ulasan

Belum ada ulasan yang diberikan
anda harus sign-in untuk memberikan ulasan ke katalog ini