Pre-Processing Techniques applied on mutual funds
DOI:
https://doi.org/10.22399/ijcesen.2843Keywords:
Data Pre-Processing, Mutual Funds, Data Mining, NormalizationAbstract
The raw data obtained from the logs may be noisy, incomplete, and inconsistent that’s why data pre-processing is an essential step in data mining. The quality of data plays a vital role during the evaluation process. The results of the evaluation process primarily depend upon the quality of the data input. So, data pre-processing is the primary and most crucial step before knowledge discovery. This paper is based on two main steps- data pre-processing techniques and results after applied data pre-processing on mutual funds’ data. Data pre-processing transforms the raw data into a structured, understandable format. Moreover, data pre-processing performs not only the transformation of data but also makes it understandable according to need. It is mainly divided into four steps, i.e., data integration, data cleaning, data transformation, and data reduction. This paper takes the fifteen-year NAV data of twenty mutual funds for analysis propose. This paper explains several techniques of data pre-processing to transform the raw data into an understandable format.
References
[1] Gupta, G., Aggarwal, H., & Rani, R. (2016). Segmentation of retail customers based on cluster analysis in building successful CRM. International Journal of Business Information Systems, 23(2), 212–228.
[2] Qamar, H., & Singh, S. (2016). Mutual fund performance prediction. 2016 IEEE International Conference on Distributed Computing, VLSI, Electrical Circuits and Robotics, DISCOVER 2016 - Proceedings, 1, 185–189.
[3] Anish, C. M., & Majhi, B. (2016). Prediction of mutual fund net asset value using low complexity feedback neural network. 2016 IEEE International Conference on Current Trends in Advanced Computing, ICCTAC 2016, 0–4.
[4] Abasova, J., Janosik, J., Simoncicova, V., & Tanuska, P. (2018). Proposal of Effective Pre-processing Techniques of Financial Data. INES 2018 - IEEE 22nd International Conference on Intelligent Engineering Systems, Proceedings, 293–298.
[5] Anish, C. M., & Majhi, B. (2015). An ensemble model for Net asset value prediction. 2015 IEEE Power, Communication and Information Technology Conference, PCITC 2015 - Proceedings, 392–396.
[6] Samsani, S. (2016). An RST based efficient pre-processing technique for handling inconsistent data. 2016 IEEE International Conference on Computational Intelligence and Computing Research, ICCIC 2016.
[7] Gupta, S., & Gupta, A. (2019). Dealing with noise problem in machine learning data-sets: A systematic review. Procedia Computer Science, 161, 466–474.
[8] Aljuaid, T., & Sasi, S. (2016). Proper imputation techniques for missing values in data sets. Proceedings of the 2016 International Conference on Data Science and Engineering, ICDSE 2016.
[9] Wang, X., Ma, Y., & Li, X. (2010). Data mining in inconsistent data. International Conference on Internet Technology and Applications, ITAP 2010 - Proceedings.
[10] Kumar, M., & Kalia, A. (2012). Pre-processing and symbolic representation of stock data. Proceedings - 2012 2nd International Conference on Advanced Computing and Communication Technologies, ACCT 2012, 83–88.
[11] Pan, J., Zhuang, Y., & Fong, S. (2016). The impact of data normalization on stock market prediction: Using SVM and technical indicators. Communications in Computer and Information Science, 652, 72–88.
[12] Nayak, S. C., Misra, B. B., & Behera, H. S. (2014). Impact of Data Normalization on Stock Index Forecasting. International Journal of Computer Information Systems and Industrial Management Applications, 6, 257–269.
[13] Wei, J. (2010). Research on data pre-processing in supermarket customers data mining. 2nd International Conference on Information Engineering and Computer Science - Proceedings, ICIECS 2010.
[14] Chen, A., Liu, F. H., & Wang, S. De. (2019). Data reduction for real-time bridge vibration data on edge. Proceedings - 2019 IEEE International Conference on Data Science and Advanced Analytics, DSAA 2019, 602–603.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 International Journal of Computational and Experimental Science and Engineering

This work is licensed under a Creative Commons Attribution 4.0 International License.