서울시 모기발생상황 지표 예측

서울시 모기발생상황 지표 예측#

데이터 출처 :
https://data.kma.go.kr/stcs/grnd/grndTaList.do?pgmNo=70 (기상청)
https://data.seoul.go.kr/dataList/16/literacyView.do (서울공공데이터포털)

Attention

2016년~ 2019년까지의 일별 모기지수 데이터를 온도,강수량 데이터를 통해 예측해본다.
평가지표는 r2 score

DataLoad

데이터 로드

import pandas as pd
train_x =pd.read_csv('https://raw.githubusercontent.com/Datamanim/mosquito/main/train_x.csv',encoding='euc-kr')
train_y =pd.read_csv('https://raw.githubusercontent.com/Datamanim/mosquito/main/train_y.csv',encoding='euc-kr')
test_x =pd.read_csv('https://raw.githubusercontent.com/Datamanim/mosquito/main/test_x.csv',encoding='euc-kr')
sub    =pd.read_csv('https://raw.githubusercontent.com/Datamanim/mosquito/main/sub.csv')

DATA

데이터셋 확인

train_x.head()

	date	강수량(mm)	평균기온(℃)	최저기온(℃)	최고기온(℃)
0	2019-12-31	0.0	-7.9	-10.9	-4.5
1	2019-12-30	0.4	2.7	-5.7	6.8
2	2019-12-29	1.4	3.8	1.1	6.2
3	2019-12-27	0.0	-1.7	-4.6	2.6
4	2019-12-25	0.0	2.0	-2.7	6.6

train_y.head()

	date	mosquito_ratio
0	2019-12-31	5.5
1	2019-12-30	5.5
2	2019-12-29	5.5
3	2019-12-27	5.5
4	2019-12-25	5.5

baseLine

베이스라인 코드입니다.

print(Ans)

randomforest r2 : 0.8477788464778293 
xgboost r2 : 0.8494664636000008

Tip

제출코드 결과확인

final_mse = FinalMseScore()

submission mse score :  0.8800627717083699