International Journal of Data Science
https://ijods.org/index.php/ds
<p><img src="/public/site/images/ijodsadmin/WebsiteHeader-Rev.jpg" width="100%"></p> <div class="well" style="text-align: justify;"> <p style="text-align: center;"><img style="padding: 10px 15px; float: left;" src="/public/site/images/ijodsadmin/CoverWebsite-Rev.jpg" height="250"></p> <p>Data science combines data inferences, algorithm developments, and technology to solve analytically complex problems. Data is the core of discussions. Advanced capabilities can be built with it.</p> <p>The International Journal of Data Science (IJoDS) is an open-access periodical that <a href="https://ijods.org/index.php/ds/fs" target="_blank" rel="noopener">focuses</a> its discussions on the aspects of data capture, data maintenance, data processing, and how to communicate and analyze the data. The journal is an open-access, <a href="https://ijods.org/index.php/ds/prp">peer-reviewed</a> periodical published biannually. Authors should read <a href="https://ijods.org/index.php/ds/ag">the author's guidelines</a> and agree to the <a href="https://ijods.org/index.php/ds/copyright" target="_blank" rel="noopener">copyright and licensing</a> terms prior to <a href="https://ijods.org/index.php/ds/about/submissions">submitting the articles</a>.</p> <p>The Indonesian Society for Knowledge and Human Development (INSIGHT) is a community of scientists. The community office is at the <a href="https://www.pnp.ac.id/" target="_blank" rel="noopener">Padang State Polytechnic</a>, West Sumatra, Indonesia. Its members are professionals and researchers in science, engineering, and technology. The society agreed with EBSCO Information Services to maintain our publication dissemination and license. Click on the EBSCO logo at the right menu or <a href="https://www.ijods.org/publicdoc/INSIGHT-EBSCO.pdf" target="_blank" rel="noopener">this link to read the agreement</a>.</p> </div>en-US<p><a href="https://ijods.org/index.php/ds/copyright" rel="noopener"><button class="btn btn-primary btn-md btn-block" type="button">Click for the Copyright and License Terms</button></a></p>[email protected] (Rahmat Hidayat)[email protected] (Ikhwan Arief)Sun, 30 Jun 2024 00:00:00 +0000OJS 3.1.2.1http://blogs.law.harvard.edu/tech/rss60Identification of Influential Nodes in Social Network: Big Data - Hadoop
https://ijods.org/index.php/ds/article/view/72
<p>Software development and associated data is the most critical factor these days. Currently, people are living in an internet world where data and related artifacts are major sets of information these days. The data is correlated with real-world data. The analysis of large datasets was done as part of the experimental analysis. The dataset for online social media like Facebook and Twitter was taken for the identification of influential nodes. The analysis of the dataset provides an overview and observation of the dataset for Facebook or Twitter. Here, in the current activity, an overview of cloud computing and big data technologies are discussed along with effective methods and approaches to resolve the problem statement. Particularly, big data technologies such as Hadoop provided by Apache for processing and analysis of Gigabyte(GB) or petabyte(PB) scale datasets are discussed for processing data in distributed and parallel data fashion. Here, the processing of large datasets is done by big data technology by implementing Apache Hadoop in online social media. </p>Rajnish Kumar Kumar, Laxmi Ahuja, Suman Mann
Copyright (c) 2024 International Journal of Data Science
https://creativecommons.org/licenses/by-sa/4.0
https://ijods.org/index.php/ds/article/view/72Sun, 30 Jun 2024 00:00:00 +0000The Performance of Drought Indices on Maize Production in Northern Nigeria Using Artificial Neural Network Model
https://ijods.org/index.php/ds/article/view/79
<p>Drought is widely known to put the ecosystem at risk. It ensues when there is a major rainfall shortage that causes hydrological discrepancies and alters the land productive structures. The degree of rainfall influences the growth and harvests of maize, particularly where irrigation is not practicable. In some parts of northern Nigeria, rainfall is unpredictable and often lower than the quantity needed for a viable crop. For the detection, classification, and control of drought conditions, drought indices are used. There has been notable progress in the last few years in terms of modelling droughts by utilizing statistical or physical models. Despite the successes documented by most of these approaches; a plain, effective, and well-built statistical model is the artificial neural network (ANN). The use of artificial neural networks (ANN) to evaluate the impact of drought indices on maize output in the 17 northern Nigerian states is presented in this research. For a 25-year period from 1993 to 2018, observed annual data of drought indices, RDI, and the Palmer drought indices, which comprise SCPDSI, SCPHDI, and SCWLPM, as well as maize yield (measured in tonnes) in Northern states of Nigeria. The ANN model was evaluated using several activation functions (sigmoid, hyperbolic tangent, and rectified linear unit), hidden layers (1, 2, and 3), and training sets (70%, 80%, and 90%). The Mean Square Error (MSE) was employed to evaluate each ANN model's performance. In summary, most of the states' lowest mean square errors (MSEs) were generated via RELU. Also, as the training percentage increases, the mean square error increases.</p>Adedayo A. Adepoju, Tayo P. Ogundunmade, Grace O Adenuga
Copyright (c) 2024 International Journal of Data Science
https://creativecommons.org/licenses/by-sa/4.0
https://ijods.org/index.php/ds/article/view/79Sat, 01 Jun 2024 00:00:00 +0000Improving Acute Leukemia Classification through Recursive Feature Elimination and Multilayer Perceptron Analysis of Gene Expression Data
https://ijods.org/index.php/ds/article/view/81
<p>This study presents an approach to improving the classification of acute leukemia subtypes using gene expression data analysis. Leveraging Recursive Feature Elimination (RFE) as a feature selection technique and Multilayer Perceptron (MLP) as the predictive modeling framework, this research aims to identify the most influential genes for distinguishing between Acute Lymphoblastic Leukemia (ALL) and Acute Myeloid Leukemia (AML) cases. RFE systematically ranks and selects the most discriminative gene attributes, while MLP constructs a predictive model based on these attributes. The results demonstrate the effectiveness of this combined approach, achieving precision, accuracy, F1-Score, and recall rates of approximately 99% for leukemia subtype classification. Furthermore, specific genes contributing most to the model's predictive power and shedding light on potential biomarkers for leukemia diagnosis were identified. This research underscores the significance of RFE and MLP in the analysis of gene expression data and their potential impact on clinical decision-making in the field of oncology.</p>Temitope Ogunbiyi, Michael Adegoke, Adebisi Oluwatosin, Bamidele Aremo, Olufemi Adekunle, Emmanuel Ayoariyo, Austin Udemba
Copyright (c) 2024 International Journal of Data Science
https://creativecommons.org/licenses/by-sa/4.0
https://ijods.org/index.php/ds/article/view/81Sat, 01 Jun 2024 00:00:00 +0000Analyzing of Student Alcohol Consumption and Consequences
https://ijods.org/index.php/ds/article/view/51
<p>Student alcohol consumption is very interesting topic today. It is known that average number of students who consume alcohol today is rapidly increasing. Here is idea to analyze how it affect their studying and grades. To predict the grades of students we have to take a lot of aspect in consideration. This research uses a range of different parameters and machine learning algorithms, such as Exploratory Data Analysis and XGBoost. By using this methodologies and algorithms I have to admit that I didn’t develop the best model for prediction.</p>Aid Semić
Copyright (c) 2024 International Journal of Data Science
https://creativecommons.org/licenses/by-sa/4.0
https://ijods.org/index.php/ds/article/view/51Sat, 01 Jun 2024 00:00:00 +0000Phishing URLs Detection Using Naives Baiyes, Random Forest and LightGBM Algorithms
https://ijods.org/index.php/ds/article/view/88
<p>In response to the increasing complexity of phishing attacks, particularly in Malaysia, this study aims to compare the accuracy and precision effectiveness of three machine learning algorithms Naive Bayes, Random Forest and LightGBM in detecting URL (Uniform Resource Locator) phishing. This research employs a comprehensive four-stages methodology including data collection, preprocessing, feature selection, and classification to analyze data for URL phishing attacks classification. The objectives are to identify phishing attack features based on dataset using and machine learning algorithms, to compare between three classification algorithms of Naïve Bayes, Random Forest, and Light Gradient Boosting Model (LightGBM), and to evaluate the model in terms of accuracy, and precision using machine learning algorithms. Through this comparative analysis, the study seeks to develop a phishing detection model, to identify the suitable features and classification algorithms for the datasets. The result accuracy, precision for NB, Random & LightGBM. The Accuracy result of Naives Baiyes is 94.24%, the result of Random Forest is 94.80% and the result of LightGBM is 95.00%.</p>Cik Feresa Mohd Foozy, Muhammad Amir Izaan Anuar, Andi Maslan, Husaini Aza Mohd Adam, Hairulnizam Mahdin
Copyright (c) 2024 International Journal of Data Science
https://creativecommons.org/licenses/by-sa/4.0
https://ijods.org/index.php/ds/article/view/88Sun, 30 Jun 2024 00:00:00 +0000