IJCVSP: Vol. 14

Direction-of-arrival estimation for conventional co-prime arrays using probabilistic Bayesian neural networks

Wael Elshennawy, Orange Business Services, Egypt
Abstract:
The paper investigates the direction-of-arrival (DOA) estimation of narrow band signals with conventional co-prime arrays by using efficient probabilistic Bayesian neural networks (PBNN). A super resolution DOA estimation method based on Bayesian neural networks and a spatially overcomplete array output formulation overcomes the pre-assumption dependencies of the model-driven DOA estimation methods. The proposed DOA estimation method utilizes a PBNN model to capture both data and model uncertainty. The developed PBNN model is trained to do the mapping from the pseudo-spectrum to the super resolution spectrum. This learning-based method enhances the generalization of untrained scenarios, and it provides robustness to non-ideal conditions, e.g., small angle separation, data scarcity, and imperfect arrays, etc. Simulation results demonstrate the root mean square error (RMSE) and loss curves of the PBNN model in comparison with deterministic model and spatial-smoothing MUSIC (SS-MUSIC) method. The proposed Bayesian estimator improves the DOA estimation performance for the case of low signal-to-noise ratio (SNR) or with a limited number of model trainable variables or spatially adjacent signals.
Full Paper (in PDF)

Hybrid Deep Learning for Assembly Action Recognition in Smart Manufacturing

Abdul Matin, University of Technology Sydney, Australia
Md Rafiqul Islam, University of Technology Sydney, Australia
Yeqian Zhu, University of Technology Sydney, Australia
Xianzhi Wang, University of Technology Sydney, Australia
Huan Huo, University of Technology Sydney, Australia
Guandong Xu, University of Technology Sydney, Australia
Abstract:
Deep learning algorithms have become essential in assembly action recognition (AAR) for driving advancements in intelligent manufacturing. While numerous sensor systems and algorithms are developing, their real-world applicability and robustness within the manufacturing sector need validation. Artificial intelligence (AI) applications in manufacturing have gained significant attraction in both academic and industrial circles. One key aspect of future smart manufacturing is identifying the actions of manufacturing workers, particularly monitoring repetitive assembly tasks, to guide them and improve efficiency. This recognition facilitates real-time efficiency measurement and evaluation of workers while providing augmented reality instructions to enhance their performance on the job. This paper introduces a hybrid deep-learning approach combining 3D CNN and ConvLSTM2D models to monitor assembly tasks to recognize human actions within the manufacturing context. The model’s performance is evaluated through simulations conducted on the HA4M dataset, comprising diverse multimodal data-capturing actions executed by various individuals constructing an epicyclic gear train (EGT). The proposed hybrid model demonstrated superior performance on the HA4M dataset relative to baselines.
Full Paper (in PDF)

Automated Fracture Detection from Pelvic X-ray: The Impact of Appropriate Labeling on the Performance of Deep Convolutional Neural Network

Rashedur Rahman, University of Hyogo, Japan
Naomi Yagi, University of Hyogo, Japan
Keigo Hayashi, Hyogo Prefectural Harima-Himeji General Medical Center, Japan
Akihiro Maruo, Hyogo Prefectural Harima-Himeji General Medical Center, Japan
Hirotsugu Muratsu, Hyogo Prefectural Harima-Himeji General Medical Center, Japan
Sayoji Kobashi, University of Hyogo, Japan
Abstract:
Pelvic X-rays (PXRs) are essential diagnostic tools used to visualize the pelvic region and assess pelvic fractures. The rising incidence of pelvic fractures leads to increased radiologist workload and initial misdiagnoses. As a result, there is a growing need for automated tools to assist doctors in pelvic fracture detection. Artificial intelligence has advanced recently, resulting in several methods for diagnosing PXRs for fractures. However, concerns regarding annotation accuracy and the limitations of PXRs due to constrained viewing angles persist. Some fractures are only visible in 3D computed tomography (CT) images, and it is difficult to understand their visibility in PXR. This study proposes a method for using annotations from pelvic CT to label PXRs, focusing on fracture visibility. Additionally, the impact of labeling PXRs based on visibility to fracture detection performance in PXR images is examined. First, all fractures in CT images are annotated using a 3D surface annotation approach. Next, annotated pseudo PXRs are synthesized from CT images utilizing digitally reconstructed radiographs (DRRs). The annotated pseudo PXRs serve as references for accurately labeling fractures in corresponding PXRs. By training a Resnet-101-based deep convolutional neural network (DCNN) with the labeled datasets considering fracture visibility, the proposed method significantly improved fracture detection performance, achieving an Area Under the Receiver Operating Characteristic (AUROC) of 0.9114. The AUROC of the conventional annotation method was 0.8202.
Full Paper (in PDF)

Exploring Oversampling Techniques for Fraud Detection with Imbalanced Classes

Sultan Alharbi, University of Technology Sydney, Australia
Abdulrhman Alorini, University of Technology Sydney, Australia
Khaled Alahmadi, University of Technology Sydney, Australia
Hadeel Alhosaini, University of Technology Sydney, Australia
Yeqian Zhu, University of Technology Sydney, Australia
Xianzhi Wang, University of Technology Sydney, Australia
Abstract:
Each year, credit card fraud has caused significant losses for financial institutions and individuals worldwide. Financial institutions must detect credit card fraud to prevent customers from being charged for products they did not order. Class imbalance has been a standing challenge for credit card transactions, as the number of fraudulent transactions is significantly lower than that of non-fraudulent transactions. In this paper, we comprehensively evaluate five oversampling techniques, namely Synthetic Minority Oversampling Technique (SMOTE), Adaptive Synthetic Sampling (ADASYN), Borderline SMOTE, Random Oversampling, and SMOTE Support Vector Machine (SMOTE SVM), in combination with seven machine learning techniques (namely XGBoost, Random Forest, K-Nearest Neighbor, Naive Bayes, Support Vector Machine, LightGBM, and Convolution Neural Network). Our results show oversampling generally improves fraud detection performance and SMOTE SVM is the better oversampling method than other methods under test. Notably, it achieved an accuracy of 76.47% when used with KNN on the smaller dataset and 99.93% with CNN on the larger dataset used in our experiments.
Full Paper (in PDF)

Generation of Clothing Items with Jamdani Motif Elements Using Automated Generative Adversarial Networks

Hujaifa Islam, Samiur Rahman Abir, Md. Sakibur Rahman, Hasan Mahmud, Mohammad Shafiul Alam, Ahsanullah University of Science and Technology, Bangladesh
Abstract:
Clothing serves as an artistic medium for humans to express their preferences, thoughts, and cultural heritage, while the application of machine learning, particularly Generative Adversarial Networks (GANs), remains largely unexplored in the realm of clothing production and design, with designers currently relying on their imaginative skills to create diverse styles. In this article, Conditional Generative Adversarial Networks (cGAN) are used to suggest an automated approach. Neural style transfer and cGAN algorithms are employed. to create traditional clothing with distinctive patterns and a variety of styles. For this study, the Fashion MNIST and Jamdani Motif Dataset datasets were both employed. The conditional GAN model was used to produce several styles of apparel using the MNIST dataset. The Neural Style Transfer model is then used to combine the created picture with the Jamdani Motif pattern from the Jamdani Motif dataset. Using Otsu's image segmentation technique, the foreground, and background of the resulting picture are separated. Performance scores of this model are as follows: Inception Score is 1.3573909, Frechet inception distance is 1272.222597, Kernel Inception Distance is 636200.667, Coverage Metric is 33.79799. We polled several people on our work output, and the results are detailed in a later section. Generate Jamdani clothing using single pattern and remove extra regions using image segmentation.
Full Paper (in PDF)

In-Depth Analysis of Automated Acne Disease Recognition and Classification

Afsana Ahsan Jeny, Masum Shah Junayed, University of Connecticut, Storrs, USA
Md Robel Mia, Daffodil International University, Bangladesh
Md Baharul Islam Florida Gulf Coast University, USA
Abstract:
Facial acne is a common disease, especially among adolescents, negatively affecting individuals both physically and psychologically. Classifying acne is vital for providing appropriate treatment. Traditional visual inspection or expert scanning is time-consuming and challenging to differentiate acne types. This paper introduces an automated expert system for acne recognition and classification. The proposed method employs a machine learning-based technique to classify and evaluate six types of acne diseases, facilitating the diagnosis process for dermatologists. The preprocessing phase includes contrast improvement, smoothing filter application, and RGB to Lab color conversion to eliminate noise and improve classification accuracy. Next, a clustering-based segmentation method, k-means clustering, is applied to segment the disease-affected regions, which then proceed to the feature extraction step. Characteristics of these disease-affected regions are extracted using a combination of gray-level co-occurrence matrix (GLCM) and statistical features. Finally, five different machine learning classifiers are employed to classify acne diseases. The experimental results show that Random Forest (RF) achieves the highest accuracy of 98.50%, which is promising compared to state-of-the-art methods.
Full Paper (in PDF)

A Real-Time Anti-Aliasing Approach for 3D Applications Using Deep Convolutional Neural Network

F. M. Jamius Siam, Zahidul Islam Prince, Ahmed Nafisul Bari, BRAC University, Bangladesh
Jia Uddin, Woosong University, South Korea
Abstract:
In real-time 3D applications, delivering smooth edges in the output images is essential, mainly due to limitations in resolution, memory, and processing power. This paper proposes a deep convolutional neural network-based model designed to address this aliasing issue. Aliasing in an image is characterized by hard, jagged edges that are present especially when the edges do not line up with the pixel grid of the output device. Our approach leverages a deep convolutional neural network to learn these jagged patterns in images from a training dataset and generates anti-aliased output images. The model's architecture includes several layers of convolutional neural networks, max-pooling layers, and convolutional transpose layers. During the experimental analysis, we used a dataset comprising demo 3D scenes created with both the Unity and Unreal game engines. This dataset contains raw and super-sampled images along with images processed with various other anti-aliasing techniques. To assess performance, we used both SSIM and PSNR scores as metrics to analyze the model’s accuracy. The experimental results show that our proposed model not only competes with but often surpasses other state-of-the-art methods like MSAA, FXAA, TAA, and SMAA, by achieving higher SSIM and PSNR scores.
Full Paper (in PDF)

Investigation of Emotional Effects on Brain Network Stimulation through EEG Signals

Mahfuza Akter Maria, M. A. H. Akhand, Khulna University of Engineering and Technology, Bangladesh
Md Abdus Samad Kamal, Gunma University, Japan
Abstract:
There has been growing evidence in recent years which supports that different brain areas are involved in processing emotions. As a result, research on emotion from the perspective of brain networks is becoming popular. The connectivity strength of this network can be changed with different mental states, which can be identified through different frequency bands of the brain signal. In this study, brain functional and effective connectivity networks have been constructed from DEAP emotional EEG data to study how emotion influences patterns of this connectivity. According to the investigation results, more direct correlations are found under positive emotions than negative ones. The brain regions operate more synchronously, and there is less directed flow of information between brain regions during negative emotions. The correlation between brain regions, whether direct or inverse, is higher in the lower frequency band than in the higher frequency band. The flow of information from one brain region to another brain region increases with higher frequency, and there is more synchrony between brain regions in the Gamma frequency band. The findings of this study have substantial implications for the practical application of EEG-based emotion analysis, as well as prospective avenues for future research in this field.
Full Paper (in PDF)

A Low-cost IoT-based Meteorological System Using LoRaWAN and Embedded Technologies: Architecture and Future Trends

Norbert Dajnowski, Andrew Guest, York St John University, UK
Aminu Bello Usman, University of Sunderland, UK
Abdulrazaq Abba, University of East London, UK
Saifur Rahman Sabuj, BRAC University, Bangladesh
Abstract:
The field of meteorological station development is undergoing continuous advancements, driven by the pursuit of more precise data acquisition while also maintaining cost-effectiveness. To achieve this objective, governments and businesses are increasingly harnessing Internet of Things (IoT) platforms to deploy hyperlocal and highly sophisticated meteorological stations. These stations are designed to offer real-time analysis of weather conditions and forecasts with unparalleled precision. In this study, we have designed and implemented a robust yet affordable meteorological system capable of collecting various weather parameters. This system is integrated with a low-cost, long-range data transmission technology, enabling multiple nodes to access the internet through the LoRaWAN network server. Additionally, we have developed a user-friendly graphical user interface (GUI) application for visualizing meteorological data. Our proposed solution demonstrated impressive capabilities. The system consistently recorded and transmitted essential meteorological parameters, such as temperature, humidity, pressure, and wind speed, with high accuracy and reliability. Additionally, our GUI application facilitated user-friendly access to this data, offering clear visual representations of weather conditions and station performance.
Full Paper (in PDF)

The Integrity of Source Code Commenting: Benchmark Dataset and Empirical Analysis

Maksuda Islam, Md Safayat Hossen, Ahsanul Haque, Md. Nazmul Haque, Lutfun Nahar Lota, Islamic University of Technology, Bangladesh
Abstract:
Code comments are a vital software feature for program cognition & software maintainability. For a long time, researchers have been trying to find ways to ensure the consistency of code-comment. While doing that, two of the raised problems have been dataset scarcity and language dependency. To address both problems in this paper, we created a dataset using C# projects; there are no annotated datasets yet on C#. 9,310 code-comment pairs of different C# projects were extracted from a data pool. 4,922 code-comment pairs were annotated after removing NULL, constructor, and variable. Both method-comment and class-comment were considered in this study. We employed two evaluation metrics for the dataset, one is Krippendorff’s Alpha which showed 95.67% similarity among the rating of three annotators for all the pairs & other is Bilingual Evaluation Understudy (BLEU) to validate our human-curated dataset. An ensemble machine learning model with topic modeling is also proposed, which obtained 96.2% using the performance metric AUC-ROC after fitting the model to our proposed dataset.
Full Paper (in PDF)

Hit Counter (since 1 Nov. 2012 for papers)

Vol. 14, No. 1: