哑变量的处理

版权声明:本文为博主原创文章,未经博主允许不得转载。 https://blog.csdn.net/G090909/article/details/54629821

library(caret)

将几个离散变量转换为因子,目的便于下面一次性处理哑变量

final_house cluster<factor(clust  cluster)
final_house floow<factor(final h ouse  floow)
final_house type.new<factor(final h ouse  type.new)

筛选出所有因子型变量

factors <- names(final_house)[sapply(final_house, class) == ‘factor’]

将因子型变量转换成公式formula的右半边形式

formula <- f <- as.formula(paste(‘~’, paste(factors, collapse = ‘+’)))
dummy <- dummyVars(formula = formula, data = final_house)
pred <- predict(dummy, newdata = final_house)
head(pred)

将哑变量规整到final_house数据集中

final_house2 <- cbind(final_house,pred)

猜你喜欢

转载自blog.csdn.net/G090909/article/details/54629821