Large data synchronization technology and large data DataX cleaned using the techniques of the Kettle

Teaching video: http:? //113.31.104.47/portal/#/course/courseDetail/b34d160db64624732ef152a1118af11a courseId = 1b7e84f4eb8552536e2267093dbd7972

I watched that cloud Rio de training portal, because I did not use experimental Rio de environment, so watching an instructional video that will inevitably encounter some errors

The first is the use of DataX,

Because there is no use Danastudio, so after the completion of the next DataX is running in the CMD

Problems encountered are:
1.DataX support is Python2, and I was python3, so remind me running print to add (), this is not in Python2, there is a Exception as e, and write DataX is Exception, e, this being given in the Python3, these changed after it

Writing 2.json format, because there is no use Danastudio, json to himself to write, so many mistakes encountered

The official gave written format, you can modify according to their needs: https: //github.com/alibaba/DataX

Kettle big data cleaned using the techniques of

This video did not use German Billiton software, and instructional videos so similar, but also encountered some pit

1.jdk version, I was initially carried out jdk10.0, leading to DB connection open, after it replaced jdk1.8

2. Video is in use PostgreSQL, but I want to use is MySQL

3. Because using a mysql, so the target pattern in the final output tables do not write, I want to write on video after being given a public

Guess you like

Origin www.cnblogs.com/liujinxin123/p/12380064.html