Hasil Pencarian

Ditemukan 103664 dokumen yang sesuai dengan query

Jason Andreas Sudana

Implementasi Image Loop Control Menggunakan YOLOv8 pada Quadrotor Dji Tello = Implementation of Image Loop Control Using YOLOv8 on Dji Tello Quadrotor

Pengembangan algoritma untuk kendali quadrotor semakin masif dilakukan oleh peneliti diseluruh dunia. Sama seperti manusia yang melihat dan kemudian dapat mendekati dan menyentuh suatu obyek, penelitian ini juga diarahkan untuk menciptakan prinsip yang sama yang kami sebut sebagai Image Loop Control (ILC). Proses pendeteksian objek memanfaatkan kecerdasan buatan YOLOv8 (AI deep learning) sebagai state-of-the-art pada dunia pendeteksian objek kecil membawa performa pendeteksian objek kecil ke tingkat yang lebih tinggi dengan inovasinya yang revolusioner. Penerapannya di quadrotor diharapkan dapat memungkinkan tingkat otonomi pada otomasi quadrotor melalui image loop control tersebut. Di dalam ILC tetap digunakan kendali Proporsional dan Differensial (PD) untuk mengendalikan gerak pada tiap sumbu gerakan. Skripsi ini melaporkan gerak yaw yang dilakukan oleh quadrotor sebagai respon dari deteksi obyek oleh YOLOv8. Pada proses validasi hasil pelatihan dataset, sebesar 96% gambar pintu tertutup terdeteksi sebagai close, 94% gambar pintu terbuka terdeteksi sebagai open, dan 87% gambar pintu setengah terbuka terdeteksi sebagai semi. Hasil proses image loop control respon kontroler PD di sumbu yaw, memiliki rata-rata time delay sebesar 0,98 detik, rata-rata rise time sebesar 1,26 detik, dan rata-rata settling time sebesar 8,62 detik menggunakan nilai Kp = 1,2 dan Kd = 0,5.

The development of quadrotor control algorithm has been extensively pursued by numerous researchers around the world. Similar to how humans can look, move around, and interact with an object, this research aims to achieve the same through a principle we define as the Image Loop Control (ILC). The process of object detection using the artificial intelligence YOLOv8 (deep learning AI) as the state-of-the-art in the small object detection world has brought the performance of small object detection algorithms to a higher level thanks to its revolutionary innovation. Its implementation in a quadrotor may enhance the degree of autonomy on automated quadrotors by using an image loop control. Within the ILC framework, we use a Proportional and Differential (PD) controller to control quadrotor movements along each axis. This thesis presents the performance of yawing movements executed by the quadrotor in response to object detections identified by the YOLOv8. During the validation process of the trained dataset, the system detected 96% of closed doors accurately, 94% of open doors accurately, and 87% of semi opened doors accurately. The response of the image loop control response using a PD controller on the yaw axis resulted in an average time delay of 0.98 seconds, average rise time of 1.26 seconds, and average settling time of 8.62 seconds with the values Kp = 1.2 and Kd = 0.5."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Research developments in computer vision and image processing : methodologies and applications

""This book brings together various research methodologies and trends in emerging areas of application of computer vision and image processing for those interested in the research developments of this rapidly growing field"--"

Hershey, PA: Information Science Reference, 2014

006.6 SRI r (1)

Buku Teks SO Universitas Indonesia Library

Edison Kurniawan

Optimalisasi visual pada programable translator seri 26700 dengan menggunakan piranti lunak lab-view

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2000

S28563

UI - Skripsi Membership Universitas Indonesia Library

Aldy Raja

Sistem Pengenal Aksi Spatio-Temporal pada Video Multi-Objek dengan Dynamic Positioning Aerial Camera = Spatio-Temporal Action Recognition System on Multi-Object Video with Dynamic Positioning Aerial Camera

"Klasifikasi aksi multi-objek berdasarkan video RGB aerial merupakan tantangan kompleks yang dapat berguna untuk pengembangan sistem keamanan. Terdapat dua pendekatan jaringan saraf tiruan yang umum digunakan dalam sistem pengenal berbasis kerangka, Convolutional Neural Network (CNN) dan Graph Convolutional Network (GCN). Pendekatan CNN lebih efektif dalam mempelajari fitur spatio-temporal, lebih kuat terhadap noise dalam estimasi pose, dan dapat menangani skenario multi-objek dengan komputasi yang lebih ringan. Penelitian ini meliputi pengembangan pengenal aksi manusia dengan pendeteksi spatio-temporal berbasis kerangka menggunakan pendekatan 3D Convolutional Neural Network (3D-CNN). Pendeteksi spatio-temporal memungkinkan sistem untuk mengenali tiap-tiap aksi yang simultan dilakukan oleh multi-objek dalam satu rekaman video. Percobaan dilakukan menggunakan sejumlah pre-trained dataset dan menggunakan dataset video RGB aerial primer yang dilatih terhadap model pengenal aksi berbasis video frontal, dengan menerapkan metode transfer learning. Proses tranfer learning dilakukan dengan dataset khusus untuk menghasilkan model pelatihan yang memiliki akurasi tinggi. Pelatihan memberi keluaran berupa model jaringan saraf tiruan dengan nilai akurasinya. Pengujian dilakukan menggunakan data video untuk mengetahui ketepatan model. Dari model yang diperoleh, akan dilakukan analisis terhadap keberhasilan dan keakuratan metode dalam mengenali aksi manusia.

Multi-object action recognition based on aerial RGB video is a complex challenge that can be useful for security system development. There are two commonly used artificial neural network approaches in skeleton-based recognition systems, Convolutional Neural Network (CNN) and Graph Convolutional Network (GCN). CNN approach is more effective in learning spatio-temporal features, more robust to noise in pose estimation, and can handle multi-object scenarios with lighter computation. This research involves developing a human action recognition with skeleton-based spatio-temporal detection using a 3D Convolutional Neural Network (3D-CNN) approach. Spatio-temporal detection allows the system to recognize each simultaneous action performed by multiple objects in a single video footage. Experiments were conducted using a number of pre-trained datasets and using a primary aerial RGB video dataset trained on a frontal video-based action recognition model, by applying the transfer learning method. The transfer learning process is performed with a specific dataset to produce a high-accuracy training model. The training outputs an artificial neural network model with its accuracy value. Testing is done using video data to determine the accuracy of the model. From the model obtained, the success and accuracy of the method in recognizing human actions will be analyzed."

Depok: Fakultas Teknik Universitas Indonesia, 2024

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Low, Adrian, 1956-

Introduction computer vision and image processing

London: McGraw-Hill, 1991

621.399 LOW i

Buku Teks Universitas Indonesia Library

Ramandika Garindra Putra

Pengembangan Teknologi Computer Vision Berbasis Image Processing pada Robotic Arm Manipulator Untuk Pemantauan Keausan pada Mata Pahat Micromilling = Initial Development of Image Processing Based Computer Vision Technology on Robotic Arm Manipulator for Tool Wear Monitoring on Micromilling

"ABSTRAK

Proses pemantauan tool wear pada micromilling membutuhkan ketelitian yang tinggi, proses ini dapat dilakukan menggunakan mikroskop digital Dino-Lite dan mikroskop elektron. Namun penggunaan mikroskop elektron membutuhkan waktu yang lama karena harus datang ke laboratorium mikroskop elektron. Maka dari itu dibuatlah perancangan teknologi computer vision berbasis image processing untuk mendeteksi luas wear pada tool micromilling. Proses pengembangan menggunakan fitur OpenCV program Python. Proses dimulai dengan mengambil gambar tool baru dan tool rusak menggunakan Dino-Lite dengan spesifikasi gambar yang sama persis. Kemudian gambar diimpor ke program Python dan dikonversi menjadi bentuk HSV (Hue, Saturation, Value). Gambar HSV kemudian diberikan fitur noise reduction menggunakan gaussian blur untuk mengurangi noise pada gambar. Gambar HSV yang sudah diberikan fitur noise reduction kemudian diberi fitur color detection untuk mendapatkan thresholding dari hasil pengaturan variabel masking HSV. Hasil thresholding kemudian diberikan fitur image Canny sebagai fitur pendeteksian luas berdasarkan kontur gambar hasil thresholding. Kemudian nilai luas permukaan tool baru dan tool rusak akan muncul. Kedua nilai ini akan dibandingkan dan menghasilkan persentase tool wear. Pengujian yang penulis lakukan adalah dengan membuat variasi variabel noise reduction menggunakan gaussian blur, nilai gaussian blur yang diberikan sebesar 0, 1, 3, 5, 7, 9, 11, 13, 15 dan 17 (nilai gaussian blur hanya bisa 0 dan bilangan ganjil). Data yang diperoleh ada yang tidak lengkap karena keadaan gambar yang tidak mendukung, namun dengan keberadaan gaussian blur, dapat membantu perekaman luas. Hasil menunjukkan bahwa semakin tinggi nilai gaussian blur, maka meningkatkan potensi gambar kontur tool dapat dideteksi luasnya.

ABSTRACT
Tool wear monitoring on micromilling needs a high value of accuracy, this process can be done using Dino-Lite digital microscope and electron microscope. However, the usage of electron microscope needs a long period of time since we have to go to the electron microscope laboratory. Therefore, the design of image processing-based computer vision for tool wear monitoring on micromilling was developed. The development process uses OpenCV feature on Python. The process begins with gathering the images of the new tool and the broken tool using Dino-Lite with exactly the same image properties. The images are then imported to Python and converted to HSV format (Hue, Saturation, Value). The HSV images are then given a noise reduction feature using Gaussian Blur to reduce the noise of the images. The HSV images that have been given the noise reduction feature are then given a color detection feature to obtain thresholding of the results of the HSV masking variable adjustment. The thresholding results are the given an image Canny feature as the contour area detection from the thresholding results. Afterwards, the face area value of the new tool and the broken tool will be displayed. These two values will be compared and generate the tool wear percentage. The experiment that the authors has done is to make variations in noise reduction variables using gaussian blur, the given gaussian blur values are 0, 1, 3, 5, 7, 9, 11, 13, 15 dan 17 (only 0 and odd numbers of the gaussian blur value can only be added). The data that be obtained are not complete due to the unsupported image condition, however, in the presence of the gaussian blur, could support the documentation process. At the end, the results show that the tool area on the images are more potential to be detected due to the increasing number of gaussian blur value."

Depok: Fakultas Teknik Universitas Indonesia, 2020

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Abstrak PDF

Abstrak

Galbiati, Louis J.

Machine vision and digital image processing fundamentals / Louis J. Galbiati

Englewood Cliffs, New Jersey: Prentice-Hall, 1990

621.367 GAL m

Buku Teks SO  Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Moeslund, Thomas B.

Introduction to video and image processing: building real systems and applications

London: Springer-Verlag, 2012

e20407902

eBooks  Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Ferdi Fadillah

Restorasi Citra Dengan Tetesan Air Hujan Menggunakan Zero-Shot Denoising Diffusion Null Model = Raindrop Removal with Zero-Shot Denoising Diffusion Null Model

"Dalam pengambilan citra, dapat terjadi penurunan kualitas akibat kondisi lingkungan sekitar. Salah satu kondisi penyebab penurunan kualitas tersebut adalah kondisi hujan, yang menyebabkan citra dengan tetesan air hujan. Dewasa ini, pendekatan deep learning dapat menjadi solusi, dengan banyaknya model yang mampu melakukan restorasi citra dengan tetesan air hujan (raindrop removal). Akan tetapi, banyak model yang hanya mampu menyelesaikan kasus spesifik dan tergantung pada data melalui metode supervised. Sebagai alternatif, terdapat model yang berpotensi dalam kasus ini adalah Zero-Shot Denoising Diffusion Null Model. Zero-Shot Denoising Diffusion Null Model adalah model yang bisa menyelesaikan kasus umum dengan tetap berperforma baik dengan pendekatan zero-shot, yaitu tanpa optimisasi dan pelatihan data. Sayangnya, sejauh ini Zero-Shot DDNM masih terbatas pada masalah linier, sementara itu masalah raindrop removal adalah masalah non-linier. Penelitian ini bertujuan untuk mengukur kemampuan metode Zero-Shot Denoising Diffusion Null Model dalam menyelesaikan masalah non-linier seperti raindrop removal. Untuk membantu memodelkan masalah raindrop removal ini pada Zero-Shot Denoising Diffusion Null Model, dibutuhkan input tambahan berupa raindrop mask. Pada penelitian ini, raindrop mask diperoleh menggunakan Attentive Generative Adversarial Network kemudian dilakukan thresholding dengan nilai 0.5 yang digunakan bersama dengan operator degradasi untuk melakukan deblurring with mask untuk mencari hasil restorasi terbaik. Data citra yang digunakan dalam penelitian ini adalah dataset citra dengan tetesan air hujan di luar ruang. Selain itu diterapkan juga beberapa strategi tambahan yaitu time-travel trick. Hasil restorasi citra dengan tetesan air hujan menunjukkan hasil paling baik dalam metrik Structural Similarity Index Measure yaitu simplified denoising with mask dengan time-travel trick, sedangkan hasil paling baik dalam metrik Peak Signal-to-Noise Ratio yaitu menggunakan simplified denoising with mask.
When taking an image, there can be a decrease in quality due to the surrounding conditions. One of the causes of this decrease in quality is rain, resulting in raindrops in the image. These days, there are many deep learning approached to solve raindrop removal. However, many of the models that perform well for raindrop removal are only able to solve specific cases based on the available data through supervised methods. A potential model capable of more general cases is the Zero-Shot Denoising Diffusion Null Model. Zero-Shot Denoising Diffusion Null Model has the capacity to solve common cases while still performing well in a zero-shot manner, that is, without data optimization and training. However, the model is limited to linear problems, while raindrop removal is a non-linear problem. This study aims to measure the ability of the zero-shot method in solving the non-linear problems of raindrop removal. To help model the raindrop removal task, it is necessary to generate raindrop masks that indicate the location of raindrops in the image. In this research, the raindrop masks are generated using Attentive Generative Adversarial Network then are thresholded with 0.5 value that are used with an adjusted degradation operator for deblurring with masks to find the best restoration results. Additionally, some strategies are implemented such as time-travel trick. The image data used in this study was an image dataset with raindrops outside the room. The result of raindrop removal shows the best results in the Structural Similarity Index Measure metric is simplified denoising with mask and time-travel trick, while the best result in the Peak Signal-to-Noise Ratio metric is using simplified denoising with mask."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership  Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Abstrak PDF

Abstrak

Zaky Nuryasin

Perancangan Sistem Penyensoran Rokok Pada Video Real Time Menggunakan Algoritma YOLOv8 = Development of Cigarette Censorship in Realtime Video Using YOLOv8 Algorithm

"Kecerdasan buatan (artificial intelligence, AI) merupakan teknologi yang sedang berkembang dengan cepat pada masa ini. Adanya teknologi AI membuat banyak permasalahan sederhana dan kompleks dapat diatasi dengan program komputer. Salah satu penerapan dari teknologi AI yang memiliki perkembangan yang besar adalah pada computer vision, yang mana dapat dibuat program yang dapat mendeteksi dan mengklasifikasi objek pada suatu gambar. Pada bidang ini, computer vision dapat digunakan untuk mendeteksi rokok. Algoritma dapat dibuat untuk mengetahui jika ada objek rokok dan lokasi dari rokok tersebut pada gambar. Hal ini dapat berguna untuk menyensor rokok pada media video yang dikonsumsi oleh anak-anak. Pada media video, biasanya sensor dilakukan dengan cara manual dan dengan bantuan tracking. Cara ini dapat melelahkan karena walaupun dengan tracking, harus ada orang sebagai pendeteksi yang menunjukkan lokasi objek rokok secara berkala. Terdapat banyak arsitektur dan model algoritma untuk deteksi objek, salah satunya adalah YOLOv8 (You Only Look Once version 8). YOLOv8 adalah versi terbaru dari algoritma YOLO, yang mana merupakan salah satu algoritma state-of-the-art dalam deteksi objek. YOLO merupakan model dari Convolutional Neural Network (CNN) yang melakukan deteksi dengan konsep single stage detector, yaitu algoritma ini melakukan deteksi objek dengan menggunakan keseluruhan gambar sekaligus untuk menjadi masukan input neural network-nya. Cara ini membuat YOLO memiliki tingkat kecepatan yang tinggi mendekati real-time. Selain deteksi objek, diterapkan juga algoritma tracking yang berfungsi untuk menandai pergerakan objek rokok pada video. Sehingga objek rokok akan tetap disensor walaupun terjadi perubahan cahaya, terhalang objek lain, dan gangguan visual lainnya pada video. Algoritma tracking yang digunakan pada penelitian ini adalah ByteTrack. ByteTrack adalah algoritma tracking yang menggunakan komputasi yang minim karena dapat melakukan tracking dengan hanya memproses lokasi bounding box tiap frame video. Perbedaan algoritma ini dibandingkan yang lain adalah ByteTrack akan memanfaatkan semua hasil deteksi objek walaupun terdapat nilai confidence yang kecil. Pada penelitian ini didapatkan model training terbaik dari YOLOv8 dengan nilai presisi sebesar 86,5%, nilai recall sebesar 86,1%, nilai mAP 50 sebesar 88,1%, dan nilai mAP 50:95 sebesar 58,3%. Lalu pada konfigurasi confidence ByteTrack didapatkan hasil terbaik dengan pada confidence tahap pertama sebesar 0,247 dan tahap kedua sebesar 0,01. Hasil tracking ini mendapatkan nilai presisi sebesar 62,3%, nilai recall sebesar 62,7%, nilai akurasi sebesar 45,5%, dan nilai F1 sebesar 62,5%.
Artificial intelligence (AI) is a technology that is developing rapidly and popular in this era. AI technology creates the possibility to solve and overcome many simple complex problems. One example of the application of AI technology that has great development is computer vision, which is a concept that can make a computer program to detect and classify objects in an image. Using computer vision, this technology can be used to detect cigarette. From image or video media, the algorithm can check if there is any cigarette and then locate the object in the image. This is useful to censor cigarette from media that consumed by children. On video medium, censorship usually done manually with the help of object tracking. This method can be tiring because even if object tracking is used, there must be a person as a detector that locate the cigarette every few frames. There are many architectures and models for object detection, YOLOv8 (You Only Look Once version 8) is one of them. YOLOv8 is the latest version of YOLO algorithm. YOLOv8 is considered as one of the state-of-the-art algorithm for object detection. YOLO model is based from Convolutional Neural Network (CNN). The concept of this algorithm to detect object is called single stage detector, which means that it takes the whole image as input for its neural network thus only single image process needed. This concept makes YOLO fast to detect objects. Object tracking algorithm is also used to keep track detected cigarette even if there is a change in light, occlusion from other object, and other visual changes in the video. ByteTrack is used for the tracking algorithm in this study. ByteTrack works by processing bounding box location of each frame in video, making it use little computation. The main difference of this algorithm is that it process all bounding boxes from the object detection, including detected object with low confidence score. In this study, the YOLOv8 model managed to obtain the best performance with precision value of 86.5%, recall value of 86.1%, mAP 50 value of 88.1%, and mAp 50:95 value of 58.3%. For the confidence configuration of ByteTrack, best performance is achieved with 0.247 confidence score for the first association and 0.01 confidence score for the second association. The result of this configuration is a precision value of 62.5%, a recall value of 62.7%, an accuracy value of 45.5%, and a F1 score of 62.5%."

Depok: Fakultas Teknik Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership  Universitas Indonesia Library

Cari yang mirip

Tambahkan ke Favorit

Metadata PDF

Abstrak PDF

Abstrak

<< 1 2 3 4 5 6 7 8 9 10 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian