Hasil Pencarian

Ditemukan 13464 dokumen yang sesuai dengan query

Peter Pangestu

Histogram equalization implementation in the preprocessing phase on optical character recognition

"A 2014 report from Digital Marketing Philippines stated that the number of web applications with visual content as their main product has increased significantly. Image processing technology has also undergone significant growth. One example of this is optical character recognition (OCR), which can convert the text on an image to plain text. However, a problem occurs when the image has low contrast and low exposure, which potentially results in information being hidden in the image. To address this problem, histogram equalization is used to enhance the image’s contrast so the hidden information can be shown. Similar to X-ray scanning used in the medical field, histogram equalization processes scanned images that have low brightness and low contrast. In this study, histogram equalization was successfully implemented using OCR preprocessing. The test was done with a dataset that contains dark background images with low light text; the successful outcome resulted in the ability to show 74.95% of the information hidden in the image."

Depok: Faculty of Engineering, Universitas Indonesia, 2017

UI-IJTECH 8:5 (2017)

Artikel Jurnal Universitas Indonesia Library

Puja Romulus

Analisis implementasi optical character recognition pada aksara batak dengan menggunakan prinsip K-nearest neighbour = Analysis of optical character recognition implementation for batak characters using K-nearest neighbour / Puja Romulus

"ABSTRAK

Skripsi ini bertujuan untuk mendukung pemeliharaan aset budaya bangsa, terkhusus dalam hal sistem penulisan atau aksara kuno. Implementasinya akan membahas aspek teknologi yaitu pengolahan citra. Pada penelitian kali ini objek yang dikhususkan adalah aksara kuno dari suku Batak. Implementasi dari ide ini akan berbentuk program yang dapat mendeteksi karakter-karakter pada citra dari sebuah dokumen aksara Batak yang bebas dari noise. Program akan memproses citra dari tahapan segmentasi, preprocessing, ekstraksi fitur hingga tahapan klasifikasi. Secara khusus pada ekstraksi fitur dan juga klasifikasi akan ada dua metode yang digunakan yaitu Geometric Moment Invariant dan juga K-Nearest Neighbor. Hasil dari uji coba terdiri dari dua yaitu akurasi atau ketepatan pembacaan, dan juga waktu pemrosesan. Jangkauan hasil pada akurasi berada pada 42% - 96% sementara waktu pemrosesan berada pada 1.9 – 34 detik.

ABSTRACT

This undergraduate thesis is intended to support the preservation of national cultural asset, especially for the ancient characters. The implentation uses technological approach in image processing field. The researched object for this thesis is Batak ancient character. The implementation of the idea will result an application program that will detect the characters in a sample image of a Batak’s document which is still free from any noise. The application program will process the image through several phases. The phases are segmentation, preprocessing, feature extraction, and classification. There is a special method used in each feature extraction and classification. Feature extraction uses Geometric Moment Invariant whereas classification phases uses K-Nearest Neighbour. There will be two results for this test, the first is accuration of the detection and second is the procesing time. The range for the accuration is 42% - 96% and the processing time ranged from 1.9 – 34 seconds."

Fakultas Teknik Universitas Indonesia, 2014

S56323

UI - Skripsi Membership Universitas Indonesia Library

Aplikasi optical character recognition pada sistem pendataan pengunjung perpustakaan

"Telah dibuat sistem pendataan pengunjung perpustakaan dengan cara membaca Nomor Pokok Mahasiswa (NPM) yang tertera pada image kartu mahasiswa (KTM). Penangkapan image KTM dilakukan dengan menggunakan webcam Prolink PCC 900. Sistem ini meliputi modul image processing untuk mengekstraksi NPM dari image KTM dan mengsegmentasi untuk setiap karakternya; dan modul artificial neural network untuk pengenalan karakter. Modul image processing melibatkan profil proyeksi, operator dilatasi, graying, adaptive thresholding, dan filter median. Neural network feed-forward dua layer digunakan untuk mengenali pola piksel satu dan nol pada image karakter yang telah disegmentasi dari image NPM. Network ini dilatih dengan menggunakan metode dasar back-propagation. Beberapa ukuran network diuji dan kemudian dipilih salah satu yang memberikan nilai error terkecil pada sampel pengujian."

Universitas Indonesia, 2006

S28855

UI - Skripsi Membership Universitas Indonesia Library

Parluhutan, Matthew Tumbur

Ekstraksi Key Moments Otomatis pada Video Perkuliahan di Fasilkom UI Menggunakan Optical Character Recognition dan Topic Modelling = Automatic Key Moments Extraction on Lecture Videos in Fasilkom UI with Optical Character Recognition and Topic Modelling

"Pandemi COVID-19 mengubah pola kehidupan manusia, termasuk sistem perkuliahan yang berubah ke metode daring. Video perkuliahan dengan salindia menjadi salah satu pilihan sarana penyampaian materi kuliah secara daring. Penelitian ini bermaksud menguji keabsahan rancangan sistem yang mampu melakukan segmentasi temporal sesuai topik secara otomatis pada video perkuliahan. Sistem yang diajukan dibagi menjadi tiga sub-sistem yang memanfaatkan teknologi keyframe extraction, optical character recognition (OCR), dan topic modelling. Pertama, video perkuliahan akan diubah menjadi kumpulan keyframe dengan memanfaatkan metode Slide Detector yang dimodifikasi. Selanjutnya, akan dilakukan ekstraksi teks dari frame-frame tersebut menggunakan Tesseract OCR dengan preprocessing tambahan. Akhirnya, BERTopic dengan beragam algoritma clustering dan LDA diuji kemampuannya dalam topic modelling yang berguna untuk mengambil topik yang koheren dari teks tersebut. Penelitian pada tahap keyframe extraction menunjukkan bahwa terdapat peningkatan recall sebesar 0,235-025 dari 0 dan precision sebesar 0,619-0,75 dari 0 pada beberapa video pada Slide Detector termodifikasi. Sebaliknya, penelitian pada tahap OCR menunjukkan bahwa tambahan preprocessing belum bisa membantu meningkatkan performa Tesseract OCR. Pada tahap terakhir, ditemukan bahwa BERTopic lebih unggul daripada LDA dalam menarik topik yang koheren untuk use case penelitian ini. Agglomerative dan KMeans clustering ditemukan lebih optimal untuk kasus video perkuliahan jika dibandingkan dengan metode density-based. Augmentasi data dengan takaran yang sesuai diperlukan untuk mendapatkan hasil sedemikian rupa pada tahap ini. Secara umum, sistem dengan tiga bagian yang diusulkan pada penelitian ini sudah mampu melakukan segmentasi video perkuliahan sesuai tujuan, namun, video perkuliahan bersalindia merupakan dataset yang sangat heterogen dan merancang sebuah sistem yang mampu memanfaatkan dataset tersebut adalah tantangan tersendiri.

The COVID-19 pandemic changed the lifestyle of many people, including university lectures that moved to online delivery. Lecture videos with slides became an option to deliver lecture materials online. This work attempts to show a proof of concept for a system design that is able to automatically segment a lecture video temporally based on the topic. The proposed system is divided into three subsystems that make use of keyframe extraction, optical character recognition (OCR), and topic modelling techniques. First, a lecture video will be converted to a collection of keyframes using a modified Slide Detector technique. Next, those frames will be processed using Tesseract OCR with some additional preprocessing steps to extract text. Lastly, BERTopic with various clustering techniques and LDA will be used for topic modelling to obtain a coherent topic from the text extracted earlier. The research in the keyframe extraction step shows that there is an increase of 0.235-0,5 points from 0 for recall and 0,619-0,75 points from 0 for precision for certain videos using the modified Slide Detector. On the other hand, the research in the OCR step shows that the additional preprocessing is not yet able to help increase the performance of Tesseract OCR. At the last step, BERTopic proves to be better than LDA to obtain the coherent topic for this system's use case. Agglomerative and KMeans clustering is better for lecture videos compared to density-based methods. Appropriate amounts of data augmentation is needed to obtain the best results at this step. Overall, the three-part system in this research is able to segment lecture videos as intended, however, lecture videos with slides is a dataset that is very heterogeneous and designing a system to handle all types of videos is a large challenge."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2023

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library