CLASSIFICATION OF CAT SOUNDS USING CONVOLUTIONAL NEURAL NETWORK (CNN) AND LONG SHORT-TERM MEMORY (LSTM) METHODS

  • Fadhilah Gusti Safinatunnajah Program Studi Teknik Informatika, Fakultas Informatika, Institut Teknologi Telkom Purwokerto, Indonesia
  • Agi Prasetiadi Program Studi Teknik Informatika, Fakultas Informatika, Institut Teknologi Telkom Purwokerto, Indonesia
  • Merlinda Wibowo Program Studi Teknik Informatika, Fakultas Informatika, Institut Teknologi Telkom Purwokerto, Indonesia
Keywords: Cat, CNN, LSTM, Voice

Abstract

Cats become pets who are very close to humans, and they convey messages by producing identical sounds. Therefore, analysis of pet voices is important for a better relationship between cats and human. Animal communication through sound, especially in cats, depends on the situation or context in which the sound is made such as in a state of danger. Based on these problems, a classification method is needed to classify the similarity of characteristics in the resulting sound pattern. The classification methods used are Convolutional Neural Network (CNN) and Long Short-Term Memory (LSTM) which can remember information for a long time and are used for a long time period. This study aimed to determine feelings or moods based on the sound produced into 4 categories: The Purr, The Meow, The Mating Call, and The Howl. The result of this study is that the best architectural model is to use 4 CNN convolution layers measuring 8-8-8-8 and 2 LSTM layers measuring 8-8. The precision value in this architecture is 0.68, the recall value is 1.00, the accurary value is 0.5625 and the f1-score value is 0.77. The small value of the confusion matrix is ​​caused by the lack of dataset duration in the training process, resulting in underfitting.

Downloads

Download data is not yet available.

References

P. L. Bernstein, “The Human-Cat Relationship,” pp. 47–89, 2007, doi: 10.1007/978-1-4020-3227-1_3.

Y. R. Pandeya, D. Kim, and J. Lee, “Domestic cat sound classification using learned features from deep neural nets,” Appl. Sci., vol. 8, no. 10, pp. 1–17, 2018, doi: 10.3390/app8101949.

S. Schötz, The Secret Language of Cats: How to Understand Your Cat for a Better, Happier Relationship. Hanover Square Press, 2018.

C. Y. Koh, J. Y. Chang, C. L. Tai, D. Y. Huang, H. H. Hsieh, and Y. W. Liu, “Bird sound classification using convolutional neural networks,” CEUR Workshop Proc., vol. 2380, pp. 9–12, 2019.

S. D. H. Permana, G. Saputra, B. Arifitama, Yaddarabullah, W. Caesarendra, and R. Rahim, “Classification of bird sounds as an early warning method of forest fires using Convolutional Neural Network (CNN) algorithm,” J. King Saud Univ. - Comput. Inf. Sci., no. xxxx, 2021, doi: 10.1016/j.jksuci.2021.04.013.

Y. Luan and S. Lin, “Research on Text Classification Based on CNN and LSTM,” Proc. 2019 IEEE Int. Conf. Artif. Intell. Comput. Appl. ICAICA 2019, pp. 352–355, 2019, doi: 10.1109/ICAICA.2019.8873454.

C. Chen and F. Qi, “Single Image Super-Resolution Using Deep CNN with Dense Skip Connections and Inception-ResNet,” Proc. - 9th Int. Conf. Inf. Technol. Med. Educ. ITME 2018, pp. 999–1003, 2018, doi: 10.1109/ITME.2018.00222.

V. Guedes, A. Junior, J. Fernandes, F. Teixeira, and J. P. Teixeira, “Long short term memory on chronic laryngitis classification,” Procedia Comput. Sci., vol. 138, pp. 250–257, 2018, doi: 10.1016/j.procs.2018.10.036.

Y. Chen, J. Lv, Y. Sun, and B. Jia, “Heart sound segmentation via Duration Long–Short Term Memory neural network,” Appl. Soft Comput. J., vol. 95, p. 106540, 2020, doi: 10.1016/j.asoc.2020.106540.

Ridho Aji Pangestu, Basuki Rahmat, and Fetty Tri Anggraeny, “Implementasi Algoritma Cnn Untuk Klasifikasi Citra Lahan Dan Perhitungan Luas,” J. Inform. dan Sist. Inf. , vol. 1, no. 1, pp. 166–174, 2020.

X. H. Le, H. V. Ho, G. Lee, and S. Jung, “Application of Long Short-Term Memory (LSTM) neural network for flood forecasting,” Water (Switzerland), vol. 11, no. 7, 2019, doi: 10.3390/w11071387.

P. O. Sgd, D. Irfan, R. Rosnelly, M. Wahyuni, J. T. Samudra, and A. Rangga, “MENGGUNAKAN CNN,” vol. 4307, no. June, pp. 244–253, 2022.

N. D. Miranda, L. Novamizanti, and S. Rizal, “Convolutional Neural Network Pada Klasifikasi Sidik Jari Menggunakan Resnet-50,” J. Tek. Inform., vol. 1, no. 2, pp. 61–68, 2020, doi: 10.20884/1.jutif.2020.1.2.18.

A. Amrin and H. Saiyar, “Aplikasi Diagnosa Penyakit Tuberculosis Menggunakan Algoritma Naive Bayes,” Jurikom, vol. 5, no. 5, pp. 498–502, 2018.

J. Yang, X. Huang, H. Wu, and X. Yang, “EEG-based emotion classification based on Bidirectional Long Short-Term Memory Network,” Procedia Comput. Sci., vol. 174, no. 2019, pp. 491–504, 2020, doi: 10.1016/j.procs.2020.06.117.

C. Y. Koh, J. Y. Chang, C. L. Tai, D. Y. Huang, H. H. Hsieh, and Y. W. Liu, “Bird sound classification using convolutional neural networks,” CEUR Workshop Proc., vol. 2380, no. September, 2019.

W. Zhang, J. Han, and S. Deng, “Abnormal heart sound detection using temporal quasi-periodic features and long short-term memory without segmentation,” Biomed. Signal Process. Control, vol. 53, p. 101560, 2019, doi: 10.1016/j.bspc.2019.101560.

Published
2022-10-24
How to Cite
[1]
F. G. Safinatunnajah, A. Prasetiadi, and M. Wibowo, “CLASSIFICATION OF CAT SOUNDS USING CONVOLUTIONAL NEURAL NETWORK (CNN) AND LONG SHORT-TERM MEMORY (LSTM) METHODS”, J. Tek. Inform. (JUTIF), vol. 3, no. 5, pp. 1349-1353, Oct. 2022.