Energy Consumption Prediction in Iran: A Hybrid Machine Learning and Genetic Algorithm Method with Sustainable Development Considerations

Document Type : Research Article


1 ICT Center of Yazd University, Yazd, Iran

2 Department of Economics, Meybod University, Meybod, Iran

3 Faculty of Economics, Institute for Humanities and Cultural Studies, Tehran, Iran

4 Department of Management, Meybod University, Meybod, Iran

5 Department of Computer Engineering, Meybod University, Meybod, Iran


Ensuring energy security is a major concern of policymakers and economic planners. This objective could be achieved by managing the energy supply and its demand. The latter has received less attention, especially in developing countries. Neglect of energy consumption and its accurate forecasting leads to potential outages and also unsustainable development. Nonlinear methods that are consistent with the nature of energy consumption have led to better results. Therefore, in the present study, both aspects of sustainable development in the determinants of energy demand and the nonlinear hybrid method have been used. We introduced a model based on sustainable development indicators to forecast energy consumption in Iran in which the relevant indicators are specified by the determination phase. To forecast energy consumption, we provided a new standard dataset for energy consumption in Iran (IREC) based on the data extracted from the World Bank and Ministry of Energy dataset in Iran. The highlight of this research is that it provided the most efficient features from the dataset using the genetic algorithm and five forecasting approaches based on machine learning methods. The algorithm was able to select 14 features as the most effective indicators in predicting energy consumption from all the 104 ones in the IREC with 500 repetitions. The empirical results indicated that the model can provide important indicators for energy consumption forecasting. The experiment result of the model using the GA-Based feature selection indicates that the hybrid model has had better results and GA-SVM and GA-MLP have the best result respectively.


Banadkooki, F.B., Ehteram, M., Ahmed, A.N., Teo, F.Y., Ebrahimi, M., Fai, C. M., Huang, Y. F., and El-shafie, A. (2020). Suspended sediment load prediction using artificial neural network and ant lion optimization algorithm. Environ Sci Pollut Res, 27, 38094–38116.
Bishop, C. M. (2006). Pattern recognition and machine learning: Springer.
Boughorbel, S., Jarray, F. and El-Anbari, M. (2017). Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric. PloS one, 12(6), e0177678.
Chambers, L. D. (2019). Practical handbook of genetic algorithms: complex coding systems: CRC press.
Chandrashekar, G., and Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering, 40(1), 16–28.
Chen, Y. (2017). A feature-free 30-disease pathological brain detection system by linear regression classifier. CNS & Neurological Disorders-Drug Targets (Formerly Current Drug Targets-CNS & Neurological Disorders), 16(1), 5–10.
Daryaei, A., Bajelan, A., and Khodayeki, M. (2019). The Impact of Stocks Traded-Total Value, Foreign Direct Investment, Number of Students and Fossil Fuel Energy Consumption on NO2 Emissions in Iran. Environmental Energy and Economic Research, 3(4), 335-348.
Daut, M. A. M., Hassan, M., Abdullah,Y. H., Rahman, H. A., Abdullah, M. P., and Hussin, F. (2017). Building electrical energy consumption forecasting analysis using conventional and artificial intelligence methods: A review. Renewable and Sustainable Energy Reviews, 70, 1108–1118.
Deb, C., Zhang, F., Yang, J., Lee, S. E., and Shah, K. W. (2017). A review on time series forecasting techniques for building energy consumption. Renewable and Sustainable Energy Reviews, 74, 902–924.
Delmastro, C., Mutani, G., and Schranz, L. (2016). The evaluation of buildings energy consumption and the optimization of district heating networks: a GIS-based model. Int J Energy Environ Eng ,7, 343–351.
Dietterich, T. G. and Kong, E. B. (1995). Machine learning bias, statistical bias, and statistical variance of decision tree algorithms. Technical report, Department of Computer Science, Oregon State University.
Encyclopædia Britannica, Inc.
Fatemi Bushehri, S. M., and Sardari zarchi, M. (2017). A Proposal for a Model for Diagnosis and Classification of Exceptional Children with learning Disabilities by Using Intelligent Expert Systems. Middle Eastern Journal of Disability Studies,.7, 19.
Fazelpour, F., Tarashkar, N., and Rosen, M. A. (2016). Short-term wind speed forecasting using artificial neural networks for Tehran, Iran. International Journal of Energy and Environmental Engineering, 7(4), 377–390.
Gen, M., and Lin, L. (2007). Genetic algorithms. Wiley Encyclopedia of Computer Science and Engineering, 1–15.
Haouraji, C., Mounir, B., Mounir, I., and Farchi, A. (2020). A correlative approach, combining energy consumption, urbanization, and GDP, for modeling and forecasting Morocco's residential energy consumption. International journal of energy and environmental engineering, 11(1), 163-176.
Hu, H. Wang, L., Peng, L., and Zeng, Y. (2020). Effective energy consumption forecasting using enhanced bagged echo state network. Energy, 193, 116778.
Jamadi, M., Merrikh-Bayat, F. and Bigdeli, M. (2016). Very accurate parameter estimation of single-and double-diode solar cell models using a modified artificial bee colony algorithm. International Journal of Energy and Environmental Engineering, 7(1), 13–25.
Jaramillo, J., Velasquez, J. D., and Franco, C. J. (2017). Research in financial time series forecasting with SVM: Contributions from literature. IEEE Latin America Transactions, 15(1), 145–153.
Karegowda, A. G., Manjunath, A. S., and Jayaram, M. A. (2010). Comparative study of attribute selection using gain ratio and correlation-based feature selection. International Journal of Information Technology and Knowledge Management, 2(2), 271–277.
Kazemi, H., Hosseinzadeh, R. (2020). Decomposition analysis of Changes in Energy Consumption in Iran: Structural Decomposition Analysis. Environmental Energy and Economic Research, 4(3), 231-239.
Li, J. (2017). Feature selection: A data perspective. ACM Computing Surveys (CSUR), 50(6), 1–45.
McClendon, L., and Meghanathan, N. (2015). Using machine learning algorithms to analyze crime data. Machine Learning and Applications: An International Journal (MLAIJ), 2(1), 1–12.
Myttenaere, A., Golden, B., Le Grand, B., and Rossi, F. (2016). Mean absolute percentage error for regression models. Neurocomputing, 192, 38–48.
Niu, D., Wang,Y., and Wu, D. D. (2010). Power load forecasting using support vector machine and ant colony optimization. Expert systems with Applications, 37(3), 2531–2539.
Pino-Mejias, R., Pérez-Fargallo, A., Rubio-Bellido, C., and Pulido-Arcas, J. A. (2017). Comparison of linear regression and artificial neural networks models to predict heating and cooling energy demand, energy consumption and CO2 emissions. Energy, 118, 24–36.
Rostami, O., and Kaveh, M. (2021). Optimal feature selection for SAR image classification using biogeography-based optimization (BBO), artificial bee colony (ABC) and support vector machine (SVM): a combined approach of optimization and machine learning. Computational Geosciences, 25(3), 911–930.
Somu, N., MR, G. R., and Ramamritham, K. (2020). A hybrid model for building energy consumption forecasting using long short term memory networks. Applied Energy, 261, 114131.
Son, H., and Kim, C. (2015). Forecasting short-term electricity demand in residential sector based on support vector regression and fuzzy-rough feature selection with particle swarm optimization. Procedia Engineering, 118, 1162–1168.
Tabakhi, S., Moradi, P., and Akhlaghian, F. (2014). An unsupervised feature selection algorithm based on ant colony optimization. Engineering Applications of Artificial Intelligence, 32, 112–123.
Wei, N., Li, C. Peng, X., Zeng, F., and Lu, X. (2019). Conventional models and artificial intelligence-based models for energy consumption forecasting: A review.  Journal of Petroleum Science and Engineering, 181, 106187.
Vieira, A. C. (2021). Improving flood forecasting through feature selection by a genetic algorithm‐experiments based on real data from an Amazon rainforest river. Earth Science Informatics, 14(1), 37–50.
 Xiao, J.,  Li, Y.,  Xie, L.,  Liu, D., and Huang, J. (2018). A hybrid model based on selective ensemble for energy consumption forecasting in China. Energy, 159, 534–546.
Xue, B., Zhang, M., and Browne, W. N. (2012). Particle swarm optimization for feature selection in classification: A multi-objective approach. IEEE transactions on cybernetics, 43(6), 1656–1671.
Yujun, Y., Yimei, Y., and Jianping, L. (2016). Research on financial time series forecasting based on SVM. in 13th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP), 346–349.
Zhao, X., and Luo, D. (2018). Forecasting fossil energy consumption structure toward low-carbon and sustainable economy in China: Evidence and policy responses. Energy strategy reviews, 22, 303–312.