Pengenalan komponen imbuhan dan kata dasa pada isyarat kata berimbuhan dalam sibi (sistem isyarat bahasa indonesia dengan menggunakan probabilistic graphical models) = Recognition of affixes and root words in inflectional word gestures in sibi (sign language system for Indonesian language using probabilistic graphical models)

Erdefi Rakun, author

Pengenalan komponen imbuhan dan kata dasa pada isyarat kata berimbuhan dalam sibi (sistem isyarat bahasa indonesia dengan menggunakan probabilistic graphical models) = Recognition of affixes and root words in inflectional word gestures in sibi (sign language system for Indonesian language using probabilistic graphical models)

Erdefi Rakun; Aniati Murni Arymurthy, promotor; Lim Yohanes Stefanus, co-promotor; Denny, examiner; Harry Budi Santoso, examiner; Mirna Adriani, examiner; Mohamad Ivan Fanany, examiner; Agus Buono, examiner ([Publisher not identified] , 2016)

Abstrak

ABSTRAK

SIBI merupakan bahasa isyarat resmi bagi penyandang tunarungu di Indonesia. Dalam pembentukan isyarat, SIBI mengikuti aturan tata bahasa Indonesia. Untuk membentuk isyarat kata berimbuhan, maka isyarat imbuhan awalan, akhiran dan partikel ditambahkan ke isyarat kata dasar. Karena banyak isyarat SIBI merupakan isyarat kata berimbuhan dan belum ada penelitian tentang kata tersebut, maka penelitian ini fokus pada membangun sistem penerjemah kata berimbuhan SIBI ke teks. Gerakan isyarat ditangkap oleh kamera Kinect yang menghasilkan data color, depth dan skeleton. Data Kinect ini diolah menjadi fitur yang dipakai oleh model untuk mengenali gerakan. Sistem penerjemah memerlukan teknik ekstraksi fitur, yang dapat menghasilkan sebuah feature vector set dengan ukuran yang minimal. Penelitian ini berusaha untuk dapat memisahkan isyarat imbuhan dan kata dasar pada isyarat kata berimbuhan. Dengan kemampuan ini, sistem penerjemah menghasilkan 3 feature vector set: kata dasar, awalan dan akhiran. Tanpa pemisahan, feature vector set yang harus disediakan adalah sebanyak perkalian cartesian dari ketiga feature vector set tersebut. Perkalian ketiga set ini tentunya akan menghasilkan feature vector set total yang berukuran sangat besar. Model yang dicoba pada penelitian ini adalah Conditional Random Fields, Hidden Markov Model, Long Short-Term Memory Neural Networks LSTM dan Gated Recurrent Unit. Akurasi yang terbaik yang dicapai oleh untuk LSTM 2-layer 77.04 . Keunggulan dari LSTM terletak pada inputnya yang berupa sequence-of-frames dan setiap frame direpresentasi oleh fitur lengkap, bukan fitur hasil clustering. Model sequence-of-frames lebih cocok untuk SIBI, karena gerakan isyarat SIBI memiliki long-term temporal dependencies. Error hasil prediksi banyak terjadi pada kelompok awalan dan akhiran. Hal ini karena miripnya gerakan pada isyarat-isyarat imbuhan SIBI tersebut. LSTM 2-layer yang dipakai untuk mengenali kata dasar saja memberikan akurasi yang tertinggi 95.4 .

ABSTRACT

SIBI is the official sign language system for the Indonesian language. The formation of SIBI gestures follow Indonesian grammar rules, including inflectional words. Inflectional words are root words with prefixes, infixes, and suffixes, or a mix of the three. Inflectional gestures are made from root word gestures, with prefix, suffix and particle gestures added in the order in which they appear, all of which is unique to SIBI. This research aims to find a suitable model that can quickly and reliably perform SIBI to text translation on inflectional word gestures. The hand movement of the signer is captured by a Kinect camera. The Kinect data was then processed to yield features for the models to use recognize the gestures. Extant research have been able to translate the alphabet, root words, and numbers from SIBI to text, but none has been able to translate SIBI inflectional word gestures. In order for the translation system to work as efficiently as possible, this research developed a new method that splits an inflectional word into three feature vector sets root, prefix, suffix . This ensures that a minimally descriptive feature sets are used. Without using this, the feature sets would otherwise be as big as the Cartesian product of the prefixes, suffixes and root words feature sets of the inflectional word gestures. Four types of machine learning models were tested Conditional Random Fields, Hidden Markov Model, Long Short Term Memory Net, dan Gated Recurrent Unit. The 2 layer LSTM, with an accuracy of 77.04 , has been proven to be the most suitable. This model 39 s performance is due to the fact that it can take entire sequences as input and doesn 39 t rely on pre clustered per frame data. The 2 layer LSTM performed the best, being 95.4 accurate with root words. The lower accuracy with inflectional words is due to difficulties in recognizing prefix and suffix gestures.

File Digital: 1

Shelf

D2244-Erdefi Rakun.pdf :: Unduh

LOGIN required

Kata Kunci

conditional random fields

hidden markov model

gated recurrent unit

long short-term memory

sibi

Metadata

Jenis Koleksi :	UI - Disertasi Membership
No. Panggil :	D2244
Entri utama-Nama orang :	Erdefi Rakun, author


Entri tambahan-Nama orang :	Aniati Murni Arymurthy, promotor Lim Yohanes Stefanus, co-promotor Denny, examiner Harry Budi Santoso, examiner Mirna Adriani, examiner Mohamad Ivan Fanany, examiner Agus Buono, examiner
Entri tambahan-Nama badan :	Universitas Indonesia. Fakultas Ilmu Komputer

Program Studi :	Ilmu Komputer
Subjek :	Sign Language Probabilistic databases
Penerbitan :	[Place of publication not identified]: [Publisher not identified], 2016

Bahasa :	ind
Sumber Pengatalogan :	LibUI ind rda
Tipe Konten :	text
Tipe Media :	unmediated ; computer
Tipe Carrier :	volume ; online resource
Deskripsi Fisik :	xviii, 107 pages : illustration ; 28 cm + appendix
Naskah Ringkas :
Lembaga Pemilik :	Universitas Indonesia
Lokasi :	Perpustakaan UI, Lantai 3

Ketersediaan
Ulasan
Sampul

No. Panggil	No. Barkod	Ketersediaan
D2244	07-19-225825794	TERSEDIA

Ulasan:

Tidak ada ulasan pada koleksi ini: 20446734