DOI: https://doi.org/10.18517/ijods.1.1.1-13.2020
A Data Mining Approach for Parameter Optimization in Weather Prediction
Abstract
More than two decades, there is a number of weather-related websites are available which approximately predict the weather and climate. By extracting essential data from the websites, a predictive data pattern can be produced to show the next day’s weather is with rain or not. By applying different types of web mining and analyzing techniques those extracted weather-related data can be visualized to a typical pattern for weather forecasting with the main deciding factors of weather. With the use of these approaches, reasonably precise forecasts can be made up to about four to five days in advance. For the weather prediction analysis, we need to discover deciding factors of the next day’s weather. Particularly, common weather dependent factors and the relationship of the prediction to the particular phenomenon. The solution proposed by this research can be used to analyze a large amount of weather data which are in different forms in each source. By using predictive mining task our solution allows us to make predictions for future instances according to the model what we have created. Evaluation measurements for the selected data mining technique such as accuracy percentage, TP & FP Rate, Precision, F-Measure, ROC area, SSE, and loglikelihood for classification and clustering leads to create a high-quality model of prediction. Knowledge flow interface provides the data flow to show the processing and analyzing data with precise association rules. In order to evaluate the model, SSE values and time to build the model, are considered in an effective manner.
Article Details
References
Kamlesh Patidar, Preetesh Purohit and Kapil Sharma, “Web Content Mining Using Database Approach and Multilevel Data Tracking Methodology for Digital Library,” Medicaps Institute of Technology & Management IJCST, Vol. 2, Issue 1, March 2011.
M. Baglioni; U. Ferrara; A. Romei; S. Ruggieri; F. Turini, “Preprocessing and Mining Web Log Data for Web Personalization,” LNCS, vol. 2829, pp. 237–249. Springer, Heidelberg (2003).
Michael Azmy, “Web Content Mining Research: A Survey”, DRAFT Version 1, - Nov. 2005.
Menahem Friedman, Mark Last, Yaniv Makover and Abraham Kandel, “Anomaly Detection in web documents using Crisp and fuzzy-based cosine clustering methodology,” Information sciences 177, pp. 467-475 (2007).
Qinbao Song and Martin Shepperd, “Mining web browsing patterns for E-commerce,”, Computers in Industry 57, pp. 622-630 (2006).
Xiaofeng He, Hongyuan Zha, Chris H.Q. Ding and Horst D. Simon, “Web Document Clustering Using Hyperlink Structures,” Computational Statistics & Data Analysis 41, pp. 19-45 (2002).
M. Shamim Khan and Sebastian W. Khor, “Web clustering Using a hybrid neural network,” Applied Soft Computing 4, pp.423-432 (2004).
K.M. Hammouda and M.S. Kamel, “Effective Pharse-Based Document Indexing for Web Document Clustering,” IEEE Trans.Knowledge and Data Eng., vol. 16, no. 10, pp. 1279-1296, Oct. 2004.
O.Zamir and O.Etziono, “Web Document Clustering: A feasibility Demonstration,” Proc. Third Int’l Conf. Research and Development in Information Retrieval (SIGIR),1998.
S. K. Sahu and S. Srivastava, “Review of Web Document Clustering Algorithms,” Third Int’l Conf. IEEE Computing for Sustainable Global Development (INDIACom), 2016.
Y. W. Dou, L. Lu, X. Liu and Daiping Zhang, “Meteorological Data Storage and Management System”, Computer Systems & Applications, vol. 20, no.7, (2011) July, pp. 116-12.
Seif, H. (2016). Naïve Bayes and J48 Classification Algorithms on Swahili Tweets: Perfomance Evaluation. International Journal of Computer Science and Information Security (IJCSIS), 14(1).
Kamber, M. and Pei, J. (2006). Data Mining:concepts and techniques. 2nd ed. heidellberg london: Morgan Kaufmann.
Bharati, M. and Ramageri, A. (2013). Data Mining techniques and applications. Indian Journal of Computer Science and Engineering, 1(4), pp.301-305.
Vanitha, K. and Roch Libia Rani, G. (2010). Analysis of Classification and Clustering Algorithms using Weka For Banking Data. International Journal of Advanced Research in Computer Science, 1(4).