我开始使用Pandas和Sklearn在Python上学习机器学习。
我尝试使用LinearRegression().fit
方法:
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
from sklearn.linear_model import LinearRegression
from sklearn.model_selection import train_test_split
house_data = pd.read_csv(r"C:\Users\yassine\Desktop\ml\OC-tp-ML\house_data.csv")
y = house_data[["price"]]
x = house_data[["surface","arrondissement"]]
X = house_data.iloc[:, 1:3].values
x_train, x_test, y_train, y_test = train_test_split (x, y, test_size=0.25, random_state=1)
model = LinearRegression()
model.fit(x_train, y_train)
当我运行代码时,出现了以下提示信息:
ValueError: Input contains NaN, infinity or a value too large for dtype('float64').
Can You help me please.
NaN
值、无限大
值或极大值,scikit无法处理。请检查数据中的NaN
行并尝试删除它们。 - G. Anderson