Hasil Pencarian

Ditemukan 47 dokumen yang sesuai dengan query

Winter, Patricia de

Starting out in statistics : an introduction for students of human health, disease and psychology

Chichester: Wiley BlackwellXXIV, 2014

610.21 WIN s

Buku Teks SO Universitas Indonesia Library

Tuwaji

A Study of bibliometric and topic distribution of special collection for academic society of STAN Al-Fatah Jayapura 2007-2017 (IAIN Fattahul Muluk Papua)

"This study of bibliometric and topic distribution of special collection of STAIN Al Fatah Jayapura library covered scientific books written by lecturers, research report, articles of jabal hikmah journal"

Jakarta: Pusat Jasa Perpustakaan dan Informasi, 2019

020 VIS 21:1 (2019)

Artikel Jurnal Universitas Indonesia Library

Maxwell, Robert L.

FRBR: a guide for the perplexed

"FRBR Functional Requirements for Bibliographic Records is an evolving conceptual model designed to help users easily navigate catalogs and find the material they want in the form they want it be that print, DVD, audio, or adaptations. Developed by the International Federation of Library Associations and Institutions Cataloging Section, FRBR is now being integrated into cataloging theory and implemented into systems and practice."

Chicago: [American Management Association, ], 2008

e20437556

eBooks Universitas Indonesia Library

Chaluemwut Noyunsan

A social network newsworthiness filter based on topic analysis / Chaluemwut Noyunsan, Tatpong Katanyukul, Yuqing Wu, Kanda Runapongsa Saikaew

"Assessing trustworthiness of social media posts is increasingly important, as the number of online users and activities grows. Current deploying assessment systems measure post trustworthiness as credibility. However, they measure the credibility of all posts, indiscriminately. The credibility concept was intended for news types of posts. Labeling other types of posts with credibility scores may confuse the users. Previous notable works envisioned filtering out non-newsworthy posts before credibility assessment as a key factor towards a more efficient credibility system. Thus, we propose to implement a topic-based supervised learning approach that uses Term Frequency-Interim Document Frequency (TF-IDF) and cosine similarity for filtering out the posts that do not need credibility assessment. Our experimental results show that about 70% of the proposed filtering suggestions are agreed by the users. Such results support the notion of newsworthiness, introduced in the pioneering work of credibility assessment. The topic-based supervised learning approach is shown to provide a viable social network filter."

2016

J-Pdf

Artikel Jurnal Universitas Indonesia Library

Raden Trivan Sutrisman

Analisis metode inisialisasi pada algortima eigenspace based fuzzy c-means untuk pendeteksian topik berita online Indonesia = Analysis of initialization methods on eigenspace based fuzzy c-means algorithm for Indonesian online news topic detection

"ABSTRAK

Perkembangan berita online di Indonesia saat ini sudah semakin meningkat sehingga kebutuhan dalam melakukan analisis data berita sangat diperlukan untuk mendapatkan intisari informasi yang akurat dan cepat. Topik merupakan komponen dasar yang sering digunakan untuk menganalisis data dalam bentuk teks seperti berita. Dengan menggunakan pemodelan topik, dapat dilakukan pendeteksian topik secara otomatis pada koleksi dokumen berita yang sangat besar dan sulit dilakukan secara manual oleh manusia. Salah satu pemodelan topik yang dapat digunakan adalah metode clustering menggunakan Eigenspace Based Fuzzy C-Means (EFCM). Metode EFCM pada umumnya menggunakan inisialisasi random. Pada penelitian ini akan diimplementasikan metode inisialisasi menggunakan Non-Negative Double Singular Value Decomposition (NNDSVD) dan Fuzzy C-Means++ (FCM++) sebagai alternatif metode inisialisasi pada algoritma EFCM. Hasil simulasi menggunakan inisialisasi NNDSVD dan FCM++ menunjukkan nilai akurasi yang lebih baik dalam hal tingkat interpretabilitas topik daripada metode random.

ABSTRACT
The rapid increasing of online news in Indonesia creates the need for news analysis to obtain information as fast as possible. Topics are basic components that are often used to analyze data in the textual forms, such as the news article. By using topic modeling, topics can be detected automatically on large news documents which are difficult to perform manually. One of the topic modeling that can be used is the clustering-based method, i.e., Eigenspace-based Fuzzy C-Means (EFCM). The common initialization method of EFCM is random. In this research, Non-Negative Double Singular Value Decomposition (NNDSVD) and Fuzzy C-Means++ (FCM++) will be used as initialization methods of EFCM. The simulations show that the NNDSVD and FCM++ methods gives better accuracies in term of interpretability score than the random method."

Depok: Universitas Indonesia, 2018

T50041

UI - Tesis Membership Universitas Indonesia Library

Kellar, Stacey Plichta

Munro's statistical methods for health care research

"This text provides students with a solid foundation for understanding data analysis and specific statistical techniques. Focusing on the most current and frequently used statistical methods in todays health care literature, the book covers essential material for a variety of program levels including in-depth courses beyond the basic statistics course. Well-organized, clear text discussions and great learning tools help students overcome the complexities and fully comprehend the concepts of this often intimidating area of study."

Philadelphia: Wolters Kluwer Health/&Lippincott Williams & Wilkins, 2013

610.727 KEL m

Buku Teks SO Universitas Indonesia Library

Chaluemwut Noyunsan

A social network newsworthiness filter based on topic analysis

Depok: Faculty of Engineering, Universitas Indonesia, 2016

UI-IJTECH 7:7 (2016)

Artikel Jurnal Universitas Indonesia Library

Julizar Isya Pandu Wangsa

Studi Perbandingan Metode Clustering K-Means, DBSCAN, dan HDBSCAN pada BERTopic untuk Pendeteksian Topik = Comparative Study of K-Means, DBSCAN, and HDBSCAN Clustering Methods on BERTopic for Topic Detection

"Pendeteksian topik merupakan suatu proses pengidentifikasian suatu tema sentral yang ada dalam kumpulan dokumen yang luas dan tidak terorganisir. Hal ini merupakan hal sederhana yang bisa dilakukan secara manual jika data yang ada hanya sedikit. Untuk data yang banyak dibutuhkan pengolahan yang tepat agar representasi topik dari setiap dokumen didapat dengan cepat dan akurat sehingga machine learning diperlukan. BERTopic adalah metode pemodelan topik yang memanfaatkan teknik clustering dengan menggunakan model pre-trained Bidirectional Encoder Representations from Transformers (BERT) untuk melakukan representasi teks dan Class based Term Frequency Invers Document Frequency (c-TF-IDF) untuk ekstraksi topik. Metode clustering yang digunakan pada penelitian ini adalah metode K-Means, Density-Based Spatial Clustering of Applications with Noise (DBSCAN), dan Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN). BERT dipilih sebagai metode representasi teks pada penelitian ini karena BERT merepresentasikan suatu kalimat berdasarkan sequence-of-word dan telah memperhatikan aspek kontekstual kata tersebut dalam kalimat. Hasil representasi teks merupakan vektor numerik dengan dimensi yang besar sehingga perlu dilakukan reduksi dimensi menggunakan Uniform Manifold Approximation and Projection (UMAP) sebelum clustering dilakukan. Model BERTopic dengan tiga metode clustering ini akan dianalisis kinerjanya berdasarkan matrik nilai coherence, diversity, dan quality score. Nilai quality score merupakan perkalian dari nilai coherence dengan nilai diversity. Hasil simulasi yang didapat adalah model BERTopic menggunakan metode clustering K-Means lebih unggul 2 dari 3 dataset untuk nilai quality score dari kedua metode clustering yang ada.

Topic detection is the process of identifying a central theme in a large, unorganized collection of documents. This is a simple thing that can be done manually if there is only a small amount of data. For large amounts of data, proper processing is needed to represent the topic of each document quickly and accurately, so machine learning is required. BERTopic is a topic modeling method that utilizes clustering techniques by using pre-trained Bidirectional Encoder Representations from Transformers (BERT) models to perform text representation and Class based Term Frequency Inverse Document Frequency (c-TF-IDF) for topic extraction. The clustering methods used in this research are the K-Means, Density-Based Spatial Clustering of Applications with Noise (DBSCAN), and Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN). BERT was chosen as the text representation method in this research because BERT represents a sentence based on sequence-of-words and has considered the contextual aspects of the word in the sentence. The result of text representation is a numeric vector with large dimensions, so it is necessary to reduce the dimensions using Uniform Manifold Approximation and Projection (UMAP) before clustering is done. The BERTopic model with three clustering methods will be analyzed for performance based on the matrix of coherence, diversity, and quality score values. The quality score value is the multiplication of the coherence value with the diversity value. The simulation results obtained are the BERTopic model using K-Means clustering method is superior to 2 of the 3 datasets for the quality score value of the two existing clustering methods."

Depok: Fakultas Matematika dan Ilmu Pengetahuan Alam Universitas Indonesia, 2023

S-pdf

UI - Skripsi Membership Universitas Indonesia Library

Dian Isnaeni Nurul Afra

Analisis Sentimen dan Pemodelan Topik dengan Data Media Sosial Twitter: Studi Kasus Komisi Pemberantasan Korupsi = Sentiment Analysis and Topic Modelling using Twitter Social Media Data: A Case Study of the Corruption Eradication Commission

"Komisi Pemberantasan Korupsi (KPK) memiliki kewenangan dalam melakukan pendaftaran dan pemeriksaan terhadap Laporan Harta Kekayaan Penyelenggara Negara (LHKPN). Pelaporan ini berfungsi untuk melakukan pengawasan kejujuran, integritas, dan deteksi kemungkinan adanya tindakan memperkaya diri secara melawan hukum oleh pejabat publik. Publikasi LHKPN sering menimbulkan prasangka negatif dan kecurigaan publik terhadap laporan harta kekayaan pejabat yang mengakibatkan kekhawatiran pejabat untuk melaporkan harta kekayaan secara lengkap dan benar. Persepsi ini menjadi kontraproduktif dengan upaya pencegahan korupsi yang dilakukan oleh KPK apabila tidak direspon dengan cepat. Penelitian ini bertujuan untuk membuat model analisis sentimen dan pemodelan topik yang dapat mengeksplorasi topik dari data media sosial Twitter. Indonesia memiliki jumlah pengguna aktif terbesar keenam di dunia dengan 15,7 juta pengguna yang didominasi kelompok usia 25-34 tahun. Dataset sejumlah 881 data diambil dari Twitter dengan kata kunci "lhkpn" dan "harta kekayaan pejabat" pada periode 1 Agustus sampai 5 November 2021. Penelitian ini mengekplorasi beberapa algoritma klasifikasi, representasi fitur unigram, bigram, dan trigram dengan CountVectorizer dan TFIDF, serta metode oversampling SMOTE. Algoritma klasifikasi dengan performa paling baik pada penelitian ini adalah Multilayer Perceptron dengan fitur unigram CountVectorizer dan metode oversampling dengan accuracy 76,60%, precision 78,19%, recall 76,60%, dan F1 score 76,95%. Hasil pemodelan topik menggunakan Latent Dirichlet Allocation pada kategori ‘negatif’ didominasi ekspresi kekecewaan dan kemarahan masyarakat terhadap meningkatnya harta kekayaan pejabat selama masa pandemi Covid-19 yang berbanding terbalik dengan meningkatnya utang negara dan kesulitan yang dihadapi masyarakat selama pandemi. Topik yang dihasilkan pada kategori ‘positif’ cukup beragam mulai dari aturan untuk melakukan pembuktian terbalik, usulan mengenai kewajiban pelaporan dan sanksi, permintaan untuk membuka laporan kekayaan kepada publik, serta pembahasan mengenai kewajaran penambahan harta kekayaan yang disebabkan oleh meningkatnya nilai aset tidak bergerak.

The Corruption Eradication Commission (KPK) has the authority to register and examine Public Officials Wealth Reports (LHKPN). This report serves to monitor honesty, integrity, and detect the possibility of illegal enrichment by public officials. Publication of LHKPN often creates negative prejudice and public suspicion of official wealth reports, which causes officials to worry about reporting assets completely and correctly. This perception is counterproductive to the efforts to prevent corruption carried out by the KPK if it is not responded to quickly. This study aims to create a sentiment analysis model and topic modelling that can explore topics from Twitter social media data. Indonesia has the sixth-largest number of active users in the world with 15.7 million users, dominated by the 25-34 year age group. A dataset of 881 data was taken from Twitter with the keywords "lhkpn" and "official assets" in the period August 1 to November 5, 2021. This study explores several classification algorithms, representation of unigram, bigram, and trigram features with CountVectorizer and TFIDF, as well as SMOTE oversampling methods. The classification algorithm with the best performance is the Multilayer Perceptron with the unigram CountVectorizer feature and the oversampling method with 76.60% accuracy, 78.19% precision, 76.60% recall, and 76.95% F1 score. The results of topic modelling using Latent Dirichlet Allocation in the 'negative' category are dominated by expressions of public disappointment and anger towards the increase in official wealth during the Covid-19 pandemic which is inversely proportional to the increase in state debt and the difficulties faced by the community during the pandemic. The topics generated in the 'positive' category are quite diverse, starting from the rules for conducting reverse verification, proposals on reporting obligations and sanctions, requests to disclose wealth reports to the public, as well as discussions on the reasonableness of adding to assets caused by the increase in the value of immovable assets."

Depok: Fakultas Ilmu Komputer Universitas Indonesia, 2022

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library

Hanif Sudira

Pembuatan model analisis sentimen untuk perhitungan brand reputation serta pemanfaatan topic modelling pada layanan Indihome menggunakan data Twitter dan Instagram = Creating a sentiment analysis model for calculation of brand reputation and utilization of topic modelling on Indihome services using Twitter and Instagram data

"Peran internet semakin penting dalam berbagai aspek kehidupan masyarakat. Kebutuhan akan internet menjadi peluang bagi penyedia internet, salah satunya Telkom dengan IndiHome. Sebagai BUMN, Telkom berperan sebagai penyedia layanan internet untuk memenuhi kebutuhan masyarakat. Berdasarkan survei kepuasan pelanggan tahun 2019 dan 2020, NPS IndiHome tidak mencapai target. Dari target besar atau sama dengan 5, tahun 2019 dan 2020, NPS IndiHome sebesar -1,67 dan 2,87. Hal ini karena pengerjaan permasalahan masih berdasarkan laporan, belum memiliki cara untuk mengetahui permasalahan yang terjadi dan belum memanfaatkan opini media sosial karena masih memanfaatkan survei. Penelitian ini membangun model analisis sentimen dam topic modelling IndiHome pada twitter & instagram. Data diambil dari bulan Maret 2019-April 2021. Model yang dihasilkan menggunakan metode SVM, twitter akurasi 70,13% dan instagram akurasi 73,55%. Sentimen mayoritas negatif, nilai NPS -79,49 pada twitter dan -56,12 pada Instagram. Dari twitter & instagram respons terhadap IndiHome memiliki indeks negatif, dimana masyarakat tidak puas dengan IndiHome. Hasil Topik diskusi negatif yaitu internet IndiHome mati mendadak, internet IndiHome lamban, internet IndiHome mati ketika terjadi hujan, biaya IndiHome mahal, pelayanan IndiHome tidak responsif, pelayanan IndiHome tidak solutif, sudah bayar internet diisolir, janji temu teknisi tidak sesuai waktu, dan ingin berhenti berlangganan atau pindah provider.

The role of the internet is increasingly important in various aspects of people's lives. The need for internet is an opportunity for internet providers, one of which is Telkom and IndiHome. As a BUMN, Telkom acts as a provider of internet services to meet the needs of the community. Based on customer satisfaction surveys in 2019 and 2020, IndiHome's NPS did not reach the target. Of the large target or equal to 5, in 2019 and 2020, IndiHome's NPS is -1.67 and 2.87. This is because the problem solving is still based on reports, does not have a way to find out the problems that occur and has not used social media opinions because they are still using surveys. This study builds a sentiment analysis model and IndiHome topic modeling on Twitter & Instagram. The data was taken from March 2019-April 2021. The resulting model used the SVM method, twitter 70.13% accuracy and instagram 73.55% accuracy. The majority sentiment is negative, the NPS score is -79.49 on Twitter and -56.12 on Instagram. From Twitter & Instagram, the response to IndiHome has a negative index, where people are not satisfied with IndiHome. The results of the negative discussion topics are IndiHome internet shuts down suddenly, IndiHome internet is slow, IndiHome internet shuts down when it rains, IndiHome costs are expensive, IndiHome services are unresponsive, IndiHome services are not solutive, already paid for the internet is isolated, technician appointments are not on time, and want to stop subscribe or switch providers."

Jakarta: Fakultas Ilmu Komputer Universitas Indonesia, 2022

TA-pdf

UI - Tugas Akhir Universitas Indonesia Library

<< 1 2 3 4 5 >>

Hasil Pencarian :: Simpan CSV :: Kembali

Hasil Pencarian