IMPROVING FINAL PREDICTION FROM INTERNAL ENSEMBLE LEARNING TRANSFORMER FOR OXFORD IIIT-PET DATASET - Dalam bentuk buku karya ilmiah

WESLI YEREMI VALENTINO RAMBI

Informasi Dasar

36 kali
25.05.670
000
Karya Ilmiah - Thesis (S2) - Reference

Research in image classification has attracted considerable interest, par ticularly in Fine-Grained Visual Classification (FGVC), specializing in the com plex task of differentiating objects and subtle variations within species, such as those observed in different animal types. Various techniques have been developed to tackle these challenges, including feature coding, part-based approaches, and attention-based approaches. Within these approaches, the Vision Transformer (ViT) has shown remarkable success in image recognition tasks. The Internal Ensemble Learning Transformer (IELT) builds on ViT as its foundation, achieving impressive results. To enhance the effectiveness of IELT, we propose an innovative approach that focuses on refining its feature representation. This is achieved by incorporating a softmax activation function and a Radial Basis Function (RBF) layer to enhance the final prediction accuracy. Experimental findings demonstrate that our proposed method significantly boosts accuracy on fine-grained datasets, such as Oxford-IIIT Pet, Surpassing current cutting-edge methods. Keywords: Fine-Grained Visual Classification, Vision Transformer, Radial Basis Function

Subjek

ARTIFICIAL INTELEGENCE
 

Katalog

IMPROVING FINAL PREDICTION FROM INTERNAL ENSEMBLE LEARNING TRANSFORMER FOR OXFORD IIIT-PET DATASET - Dalam bentuk buku karya ilmiah
 
xii, 25p.: il,; pdf file
Indonesia

Sirkulasi

Rp. 0
Rp. 0
Tidak

Pengarang

WESLI YEREMI VALENTINO RAMBI
Perorangan
Suryo Adhi Wibowo, Unang Sunarya
 

Penerbit

Universitas Telkom, S2 Teknik Elektro
Bandung
2025

Koleksi

Kompetensi

  • TEI6A3 - SISTEM CERDAS

Download / Flippingbook

 

Ulasan

Belum ada ulasan yang diberikan
anda harus sign-in untuk memberikan ulasan ke katalog ini