Data Science Competition中的工具汇总

除了基础的pandas,scikit-learn,numpy,matplotlib,seaborn以外

( 1 ) category_encoders github

属于scikit-learn compatible projects之一,下面是Binary Encoding和One-hot Encoding的区别

-----------------------------------------------
|   Level   | "Decimal  | Binary   | One hot  |
|           | encoding" | encoding | encoding |
-----------------------------------------------
| No        |     0     |    000   |  000001  |
| Primary   |     1     |    001   |  000010  |
| Secondary |     2     |    010   |  000100  |
| BSc/BA    |     3     |    011   |  001000  |
| MSc/MA    |     4     |    100   |  010000  |
| PhD       |     5     |    101   |  100000  |
-----------------------------------------------

( 2 )

... to be continued

猜你喜欢

转载自www.cnblogs.com/wuweijiecug/p/10089639.html