پیش بینی بارش‌های سنگین منطقه‌ای در جنوب غرب ایران: رویکردی مبتنی بر داده‌کاوی

پذیرفته شده برای ارائه شفاهی

کد مقاله : 1260-SCWMCCAN (R1)

نویسندگان

کوکب شاهقلیان ¹ ، جواد بذرافشان ² ، پرویز ایران نژاد ³ ، داریوش رعنازاده ⁴

¹فارغ التحصیل دکتری دانشگاه تهران

²استاد گروه آبیاری و آبادانی دانشگاه تهران

³استاد بازنشسته گروه فیزیک فضا دانشگاه تهران

⁴دانشجوی دکتری دانشگاه تهران

چکیده

پیش‌بینی دقیق و به‌موقع بارش‌های سنگین منطقه‌ای نقشی حیاتی در پیشگیری از سیل و به حداقل رساندن خسارات ناشی از آن ایفا می‌کند. این مطالعه، تعریفی نوین از بارش سنگین منطقه‌ای بر مبنای الگوی احتمالاتی رگبارها ارائه می‌دهد. برای این منظور، از داده‌های بارش روزانه (1987 تا 2018) مربوط به 12 ایستگاه همدید در جنوب غرب ایران استفاده شد. علاوه بر داده‌های بارش، شش متغیر همدیدی در ترازهای 1000 تا 200 هکتوپاسکال، مربوط به یک تا پنج روز قبل از وقوع بارش سنگین، به عنوان پیش‌بینی‌کننده در نظر گرفته شدند. این متغیرها گستره وسیعی فراتر از منطقه مطالعاتی را پوشش می‌دهند. این پژوهش با استفاده از دو روش انتخاب متغیر و هفت مدل یادگیری ماشین از نوع طبقه‌بندی‌کننده دودویی انجام شد. در سناریوی تعریف شده، از داده‌های روزهای پیش از رخداد بارش سنگین به صورت پیوسته استفاده شد. مطالعه حاضر نشان داد که برای تشخیص دقیق بارش‌های سنگین از بارش‌های معمولی، استفاده از داده‌های بارش تا چهار روز قبل از وقوع بارش در مدل Random Forest با روش انتخاب متغیر Chi-Square، به عنوان برترین روش عمل می‌کند. متغیرهای رطوبت نسبی در سه سطح فشاری 1000، 700 و 500 هکنوپاسکال و مولفه مداری باد در سطح فشاری 1000 هکتوپاسکال متغیرهای با بیشترین تکرار مورد استفاده در این مدل بودند.

کلیدواژه ها

بارش سنگین منطقه‌ای؛ پیش‏بینی؛ داده‏کاوی؛ متغیرهای همدیدی؛ ایران

Title

Predicting Regional Heavy Precipitation in Southwest Iran: A Data Mining Approach

Authors

Kokab Shahgholian, Javad Bazrafshan, Parviz Irannejad, Dariush Ranazadeh

Abstract

Accurate and timely forecasting of regional heavy precipitation plays a crucial role in preventing floods and minimizing the resulting damage. This study presents a novel definition of regional heavy precipitation based on the probabilistic pattern of thunderstorms. For this purpose, daily precipitation data (1987 to 2018) from 12 synoptic stations in southwest Iran were used. In addition to precipitation data, six synoptic variables at pressure levels of 1000 to 200 hPa, corresponding to one to five days before the occurrence of heavy precipitation, were considered as predictors. These variables cover a wide area beyond the study region. This research was conducted using two variable selection methods and seven machine learning models of the binary classification type. In the defined scenario, precipitation data from the days before the occurrence of heavy precipitation was used continuously. The present study showed that for accurate discrimination between heavy and normal precipitation, the use of precipitation data up to four days before the occurrence of precipitation in the Random Forest model with the Chi-Square variable selection method is the best method. The relative humidity variables at three pressure levels of 1000, 700, and 500 hPa and the meridional wind component at the 1000 hPa pressure level were the most frequently used variables in this model.

Keywords

Regional heavy precipitation, Prediction, Data mining, Synoptic variables, Iran