Development of video-based emotion recognition using deep learning with Google Colab

Gunawan, Teddy Surya; Ashraf, Arselan; Riza, Bob Subhan; Haryanto, Edy Victor; Rosnelly, Rika; Kartiwi, Mira; Janin, Zuriati

doi:10.12928/telkomnika.v18i5.16717

Cited by 46 publications

(18 citation statements)

References 23 publications

Supporting

Mentioning

Contrasting

Unclassified

Order By: Relevance

“…Hampir setiap aktivitas manusia selalu mengharapkan hasil yang maksimal sehingga diperlukan pernan teknologi [7]. Peningkatan luar biasa dalam perkembangan teknologi interaksi manusia-komputer yang canggih memungkinkan persoalan yang berhubungan dengan biometrik dapat diselesaikan bukan hanya sekedar mendeteksi jenis kelamin melainkan situasi mood seseorang juga dapat dipredisksi [8].…”

Section: Pendahuluanunclassified

Analysis of EfficientNetV2 Model Usage in Predicting Gender on the Face of Mask Users

Sinaga¹

2022

JATISI

View full text Add to dashboard Cite

Biometrik merupakan metode untuk mengenali karakteristik fisik atau perilaku manusia yang digunakan sebagai input untuk pengenalan pola. Setiap bentuk biometrik tentunya menggunakan teknologi yang berbeda dalam mengidentifikasikannya. Sebuah gallery atau pertujukan seperti bioskop, pusat perbelanjaan, pameran membutuhkan informasi pengunjung dari acara tersebut untuk dilakukan sebuah kajian dalam menawarkan atau menjual produk sesuai dengan jenis kelamin dari pengunjung. Model EfficientNetV2 merupakan Family Baru dalam kelompok Covolution Neural Network (CNN) yang memiliki kecepatan pelatihan lebih cepat dan efisiensi parameter yang lebih baik daripada model sebelumnya. Dalam uji coba menunjukkan bahwa model EfficientNetV2 berlatih jauh lebih cepat daripada model tercanggih dengan ukuran hingga 6,8x lebih kecil. Hasil dengan memanfaatkan model EfficientNetV2 yang dilakukan 25 epoch dan terdapat 2 class yaitu laki-laki dan perempuan dimana masing-masing terdiri dari 72.318 data training dan 16.813 data testing. Didapatkan nilai akurasi untuk training 0.9455 (94.5 %) dan untuk data testing nilai akurasinya didapatkan 0.9475 (94.7 %). Untuk Niilai loss untuk training 0.1375 (13.75 %) dan untuk data testing nilai loss nya didapatkan 0.1277 (12.7 %).

show abstract

Section: Pendahuluanunclassified

Analysis of EfficientNetV2 Model Usage in Predicting Gender on the Face of Mask Users

Sinaga¹

2022

JATISI

View full text Add to dashboard Cite

show abstract

“…In our experiments, we perform our implementation using an open-source library called Keras which was developed in 2018 by Chollet et al [33]. The training process is done on the Google Colaboratory platform through a Tesla K80 GPU [34], [35] for 100 epochs. We selected the RMSprop as the main optimizer to train our model.…”

Section: Training Data Processmentioning

confidence: 99%

End-to-end deep auto-encoder for segmenting a moving object with limited training data

Kebir¹,

Taibi²

2022

IJECE

View full text Add to dashboard Cite

<span lang="EN-US">Deep learning-based approaches have been widely used in various applications, including segmentation and classification. However, a large amount of data is required to train such techniques. Indeed, in the surveillance video domain, there are few accessible data due to acquisition and experiment complexity. In this paper, we propose an end-to-end deep auto-encoder system for object segmenting from surveillance videos. Our main purpose is to enhance the process of distinguishing the foreground object when only limited data are available. To this end, we propose two approaches based on transfer learning and multi-depth auto-encoders to avoid over-fitting by combining classical data augmentation and principal component analysis (PCA) techniques to improve the quality of training data. Our approach achieves good results outperforming other popular models, which used the same principle of training with limited data. In addition, a detailed explanation of these techniques and some recommendations are provided. Our methodology constitutes a useful strategy for increasing samples in the deep learning domain and can be applied to improve segmentation accuracy. We believe that our strategy has a considerable interest in various applications such as medical and biological fields, especially in the early stages of experiments where there are few samples.</span>

show abstract

“…The third challenge is the creation of the app ui, we used flutter to create an android app using an android emulator [7]. Flutter is constantly used for creation of android and IOS app and doesn't supply much back end resources for audio or musical uses [8].We had to create the back end that wasn't linked with flutter and is on the google collab that is able to pull online python packages capable of overcoming the challenges needed [9]. Since flutter and google collab are not linked by any direct path or package, we had to create our own way to send audio information from the app into the converter and from the converter back into the app.…”

Section: Connecting Our Powerful Musical Backend To a Comfortable Fro...mentioning

confidence: 99%

MusicApp, A Music Sheet Transcribing Moblie Platform using Machine Learning and Nature Language Processing

Wang¹,

Sun²

2022

Signal &Amp; Image Processing Trends

View full text Add to dashboard Cite

As technology advances, we have found more practical uses for it. This ranges from such things as cleaning the house using machines to serving restaurants with robots. Using technology, what if we can use machines to automatically write sheet music for us, transcribing it from audio [1]. This paper designs an application to do exactly that. We used Java to write a program and app that would be able to transcribe audio into sheet music and store it on an app. We applied our application to multiple cases and conducted a qualitative evaluation of the approach. The results show that it is possible with some fine tuning and may be usable in the near future.

show abstract

Development of video-based emotion recognition using deep learning with Google Colab

Cited by 46 publications

References 23 publications

Analysis of EfficientNetV2 Model Usage in Predicting Gender on the Face of Mask Users

Analysis of EfficientNetV2 Model Usage in Predicting Gender on the Face of Mask Users

End-to-end deep auto-encoder for segmenting a moving object with limited training data

MusicApp, A Music Sheet Transcribing Moblie Platform using Machine Learning and Nature Language Processing

Contact Info

Product

Resources

About