IMPLEMENTATION OF TEXT INDEXING SYSTEM IN WEB-BASED DOCUMENT SEARCH APPLICATION USING MONGODB

  • Frankie Informatics Engineering, Faculty of Information Technology, Universitas Kristen Satya Wacana, Indonesia
  • Yeremia Alfa Susetyo Informatics Engineering, Faculty of Information Technology, Universitas Kristen Satya Wacana, Indonesia
Keywords: Data Search, Fulltext Search, MongoDB, Python, Text Indexing

Abstract

The rapid growth of information technology has led to an increase in the amount of data stored in databases every day. Relational databases (SQL) that have been in use for a long time are now being developed with the emergence of NoSQL databases such as MongoDB. MongoDB stores data in BSON format and has a Text Indexes feature that is useful for speeding up text search on string content. This feature is particularly useful in searching for data in the form of texts or strings in large quantities. MongoDB's Text Indexes have a flexible schema that does not require a strict schema structure to index text data, unlike SQL databases that require columns with the appropriate data type to perform indexing. MongoDB's Text Indexes support more languages than SQL because they use an open-source text search engine called Apache Lucene. In this study, the researcher will implement Text Indexing on document data (PDF) that has been converted into text, then inserted into MongoDB before indexing. Afterward, the researcher will compare the performance of search queries between indexed and non-indexed data in MongoDB in terms of speed. The comparison results will be presented in tables and graphs to facilitate understanding. Based on the research conducted, it can be concluded that the use of the text indexing feature in MongoDB can speed up keyword or string search time. In the experiment conducted using 5000 data records, the results showed that the use of text indexing for searching 1 keyword resulted in a search speed improvement of 11705,88%, for searching 2 keywords it was 60833,33%, and for searching 3 keywords it was 44320%.

Downloads

Download data is not yet available.

References

S. Akbar, F. Fadhila, V. A. Saputro, E. Utami & Khusnawi, “Perbandingan Performa SQL dan NoSQL Dengan PHP Pada 5 Juta Data,” Indonesian Journal on Computer and Information Technology, vol. 6, no.1, pp. 38-42, 2021.

M. F. Abdi, A. Susanto & Kusnawi, “Perbandingan Kecepatan Pencarian Data Sql Dan NoSQL,” Jurnal Teknologi Informasi, vol.5, no.1, 2021.

M. Silalahi & D. Wahyudi, “Perbandingan Performansi Database Mongodb Dan Mysql Dalam Aplikasi File Multimedia Berbasis Web,” Computer Based Information System Journal, vol. 6, no. 1, pp. 38-42, 2018.

A. Sonita & M. Sari, “Implementasi Algoritma Sequential Searching Untuk Pencarian Nomor Surat Pada Sistem Arsip Elektronik,” Jurnal Pseudocode, vol. 5, no. 1 , 2018.

MongoDB, “Welcome to the MongoDB Documentation - Text Indexes,” 2022, https://www.mongodb.com/docs/manual/core/index-text/#text-indexes (accessed Okt. 18, 2022).

MongoDB, ” Text Search Languages,” 2023, https://www.mongodb.com/docs/manual/reference/text-search-languages/#std-label-text-search-languages (accessed 27/03/2023)

T. M. Kadarina & M. H. I. Hajar, “Pengenalan Bahasa Pemrograman Python Menggunakan Aplikasi Games Untuk Siswa/I Di Wilayah Kembangan Utara,” Jurnal Abdi Masyarakat,vol. 5, no. 1, 2019.

Novindri, G. F., & Saian, P. O. N., “Implementasi Flask pada Sistem Penentuan Minimal Order untuk Tiap Item Barang di Distribution Center pada PT XYZ Berbasis Website,” Jurnal Mnemonic, vol. 5, no. 2, pp. 80-85, 2022.

R. Irsyad, “Penggunaan Python Web Framework Flask Untuk Pemula,” 2018, doi: 10.31219/osf.io/t7u5r.

Renaldi, B. C. Santoso, Y. Natasya, S. Willian, & F. Alfando, “Tinjauan Pustaka Sistematis terhadap Basis Data MongoDB,” JII: Jurnal Inovasi Informatika Universitas Pradita, vol 5, no. 2, pp. 132-142, 2020.

Putra, M. Y., Safitri, N., Fauziah, N. F., Safei, A., Rayhan, & Lolly, W. R., “Desain Web Bagi Pemula Menggunakan Framework Bootstrap pada SMK Taruna Bangsa Bekasi,” Jurnal Buana Pengabdian, vol 3, no. 1,pp. 134-148, 2021.

Putra, M. Y., “Responsive Web Design Menggunakan Bootstrap Dalam Merancang Layout Website,” Information System For Educators And Professionals, vol. 5, no. 1, pp. 61-70, 2020.

PyMongo, “PyMongo 4.3.3 Documentation,” 2023,https://pymongo.readthedocs.io/en/stable/ (accessed Feb. 02, 2023).

I. Asyahri & M. S. Mauludin, “Implementasi Full Text Search Pada Sistem Informasi Perpustakaan Menggunakan Laravel,” Informatika dan RPL, vol. 1, no. 1, pp. 1-9, 2019

A. Hajar, E. Utami, & H. A. Fatta, “Penggunaan Fulltext Indexing Untuk Meningkatkan Efisiensi Pencarian Data Pada Basis Data MYSQL,” Jurnal SISFOTENIKA, vol. 12, no. 2, pp. 213-222, 2022

E. Selviyanti, H. Ajie & Widodo, “Pengembangan Sistem Pencarian Karya Akhir Berdasarkan Abstrak Menggunakan Full-Text Searching Di Sistem Informasi Perpustakaan Jurusan Teknik Elektro Universitas Negeri Jakarta,” JTIM : Jurnal Teknologi Informasi dan Multimedia, vol. 1, no. 2, pp 85- 95, 2019.

Published
2023-10-03
How to Cite
[1]
F. Frankie and Y. A. Susetyo, “IMPLEMENTATION OF TEXT INDEXING SYSTEM IN WEB-BASED DOCUMENT SEARCH APPLICATION USING MONGODB”, J. Tek. Inform. (JUTIF), vol. 4, no. 5, pp. 1081-1087, Oct. 2023.