1. The new station included slower
2. The quality of the article is not high
Article content is hard to read typography chaos collection to other sites is difficult to be included
3. The site has been down right in
4. spiders do not visit [Site configuration]
Check the site is blocked spiders crawling [robots] do not have to see the site outside the chain log
5. that a lot is not included in the recent collection
The main factors were excluded or punished too few outside the chain did not have enough support outside the chain
4 treatment: website layout Keywords No problem, no problem with the quality of content, and update website content, but also continued to publish outside the chain of weight in some high platform, why Baidu spider is not a collection of web pages regularity?
a. Whether the site has shielded Baidu spider crawl,
1. Review the site's robots.txt file
User-agent:*
Disallow: /
=========== shield search engine spiders to crawl all
User-agent: Baiduspider
Disallow: /
=========== shielding Baidu search engine spiders to crawl
----------------Solution
User-agent: *
Disallow: /wp-admin/
Disallow: /wp-content/
============== The Disallow: / into the specified directory shield
Allow: /
============== allow access
2. website page code between <head> and </ head>, there is no <meta name = "robots" content = "noindex, follow"> or <meta name = "Baiduspider" content = "noindex, follow"> code