Here are a few links to facilitate learning:
What Java knowledge do you need to implement network data extraction : http://www.cnblogs.com/1996swg/p/7354661.html
A specific example of java implementing a web crawler: http://www.cnblogs.com/1996swg/p/7355577.html
About a simple web crawler Demo implemented in Java : https://www.cnblogs.com/huangwentian/p/6484534.html?utm_source=itdadao&utm_medium=referral