我正在努力理解在使用randomizedSearch调整xgbRegressor模型的超参数时,
具体来说,它如何与
以下是代码:
文件说明中说它是参数设置的数量。输出日志将
n_iter
参数的含义。具体来说,它如何与
cv
参数一起使用?以下是代码:
# parameter distributions
params = {
"colsample_bytree": uniform(0.7, 0.3), # fraction of cols to sample
"gamma": uniform(0, 0.5), # min loss reduction required for next split
"learning_rate": uniform(0.03, 0.3), # default 0.1
"max_depth": randint(2, 6), # default 6, controls model complexity and overfitting
"n_estimators": randint(100, 150), # default 100
"subsample": uniform(0.6, 0.4) # % of rows to use in training sample
}
rsearch = RandomizedSearchCV(model, param_distributions=params, random_state=42, n_iter=200, cv=3, verbose=1, n_jobs=1, return_train_score=True)
# Fit model
rsearch.fit(X_train, y_train)
Fitting 3 folds for each of 200 candidates, totalling 600 fits
文件说明中说它是参数设置的数量。输出日志将
n_iter
称为候选项。这到底是什么意思?
n_iter
=200,这意味着模型将根据指定的参数分布和cv
参数的指定值进行200次拟合。 - kms