Dynamic and incremental exploration strategy in fusion adaptive resonance theory for online reinforcement learning./

Budhitama Subagdja, author

Dynamic and incremental exploration strategy in fusion adaptive resonance theory for online reinforcement learning./

Budhitama Subagdja; (Nanyang Technological University, Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly, 2016)

Abstrak

One of the fundamental challenges in reinforcement learning is to setup a proper balance between ex-ploration and exploitation to obtain the maximum cummulative reward in the long run. Most proto-cols for exploration bound the overall values to a convergent level of performance. If new knowledge is inserted or the environment is suddenly changed, the issue becomes more intricate as the explo-ration must compromise the pre-existing knowledge. This paper presents a type of multi-channel adaptive resonance theory (ART) neural network model called fusion ART which serves as a fuzzy approximator for reinforcement learning with inherent features that can regulate the exploration stra-tegy. This intrinsic regulation is driven by the condition of the knowledge learnt so far by the agent. The model offers a stable but incremental reinforcement learning that can involve prior rules as boot-strap knowledge for guiding the agent to select the right action. Experiments in obstacle avoidance and navigation tasks demonstrate that in the configuration of learning wherein the agent learns from scratch, the inherent exploration model in fusion ART model is comparable to the basic E-greedy po-licy. On the other hand, the model is demonstrated to deal with prior knowledge and strike a balance between exploration and exploitation.

Salah satu permasalahan mendasar dari Reinforcement Learning adalah mengatur keseimbangan anta-ra eksplorasi dan eksploitasi untuk mendapatkan ganjaran (reward) maksimal secara kumulatif dalam jangka waktu yang lama. Ketika pengetahuan awal diikutsertakan, masalah muncul karena eksplorasi yang dilakukan harus dikompromikan dengan pengetahuan sebelumnya yang telah dipelajari. Maka-lah ini menampilkan salah satu jenis jaringan saraf tiruan adaptive resonance theory (ART) berkanal ganda yang dikenal juga dengan sebutan fusion ART yang juga merupakan aproksimator Fuzzy untuk reinforcement learning dengan kemampuan meregulasi strategi eksplorasi sebagai sifat dasarnya. Mo-del ini menawarkan proses pembelajaran yang stabil tetapi inkremental serta mampu melibatkan pe-ngetahuan awal yang memilih aksi yang benar. Eksperimen menggunakan navigasi dan menghindari rintangan sebagai domain masalah menunjukkan bahwa konfigurasi pembelajaran menggunakan sifat dasar untuk meregulasi eksplorasi sebanding dengan metoda umum yang menggunakan aturan E-greedy. Di lain pihak, model yang diusulkan ini juga menunjukkan kemampuan dalam menggunakan pengetahuan awal serta mencapai keseimbangan dalam eksplorasi dan eksploitasi pengetahuan

Kata Kunci

Reinforcement learning

exploration strategy

adaptive resonance theory

Metadata

No. Panggil :	PDF
Entri utama-Nama orang :	Budhitama Subagdja, author





Subjek :	Strategy in fusion
Penerbitan :	[Place of publication not identified]: Nanyang Technological University, Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly, 2016

Sumber Pengatalogan :	LibUI eng rda
ISSN :	25029274
Majalah/Jurnal :	Jurnal Ilmu Komputer dan Informamsi
Volume :	Vol. 9, No. 2 2016: Hal. 59-69
Tipe Konten :	text
Tipe Media :	computer
Tipe Carrier :	online resource
Akses Elektronik :	http://jiki.cs.ui.ac.id/index.php/jiki/article/view/380
Institusi Pemilik :	Universitas Indonesia
Lokasi :

Ketersediaan
Ulasan

No. Panggil	No. Barkod	Ketersediaan
PDF		TERSEDIA

Ulasan:

Tidak ada ulasan pada koleksi ini: 20447868

:: Artikel Jurnal :: Kembali

Artikel Jurnal :: Kembali

Dynamic and incremental exploration strategy in fusion adaptive resonance theory for online reinforcement learning./

Abstrak

Kata Kunci

Metadata