Informasi Umum

Kode

23.04.2524

Klasifikasi

006.35 - Natural Language Processing, Computer Science

Jenis

Karya Ilmiah - Skripsi (S1) - Reference

Subjek

Natural Language Processing, Data Analysis,

Dilihat

309 kali

Informasi Lainnya

Abstraksi

<p>In this work, we conduct sentiment analysis on Indonesian-Sundanese code-mixed tweets. Sundanese is one of Indonesia’s regional languages with over 42.000.000 speakers. We use a pre-trained language model, IndoBERT, to tackle the sentiment analysis task. Our evaluation result shows that the best accuracy is 81%. We analyze the errors and find that most mislabeled tweets are because the words on the wrongly predicted tweet contain many words from other labels. It is also possible that it happens since the sentence in the tweet is ambiguous, the words used in the tweet are unavailable in the training data set, or the use of abbreviated words in the tweet.</p>

  • CII4G3 - PEMROSESAN BAHASA ALAMI

Koleksi & Sirkulasi

Tersedia 1 dari total 1 Koleksi

Anda harus log in untuk mengakses flippingbook

Pengarang

Nama HAJAROT NAJIHA
Jenis Perorangan
Penyunting Ade Romadhony
Penerjemah

Penerbit

Nama Universitas Telkom, S1 Informatika
Kota Bandung
Tahun 2023

Sirkulasi

Harga sewa IDR 0,00
Denda harian IDR 0,00
Jenis Non-Sirkulasi