python + scrapy analysis blog Home Park 4000 quality blog (illustration)

1, delete invasion

This article is to obtain data of blog Home Park of 4000 data, in order not to put pressure on blog server park, so the source code is not open, in this, please park official team blog at ease.
Obtained data from 2019-7-11 to 2019-9-12, this data will not be made public.
Since the data would involve a number of bloggers, so there are bloggers do not want to publicly available data, contact me promptly deleted.

2, the first on data analysis

Most bloggers write the text top6 ----> (most industrious blogger) ♪ ('∇` *)

Among Movies -> Pian issued 54 is
Zhou Tong -> issued 38
should ICT -> issued 30
cut slightly cold -> 28 issued
dean as if Yin -> 26 issued
urchin little world - > issued 26

Most bloggers Recommended articles top5 ----> (the garden is pushing everybody article) (≧ ∇ ≦) Techno

"Comic |" Royal Park 12 hour program ape "" -> 220 people recommend
, "a text Detailed micro Services Architecture" -> 188 people recommend
".NET Core learning materials selection: Getting Started" -> 155 people recommend
. " NET Core learning materials Highlights: Advanced " -> 152 people recommend
," [site Notice] .NET Core version of the blog site posted a second try " -> 119 people recommend

Most times been viewed articles top5 ----> (garden you most like to see article) ╰ ( ° ‿ ° ) ╯

"Why is the time to embrace .NET CORE? " -> 12660 people visit the
" fault [Announcement] Published .NET Core version of blog sites lead to a large number of 500 error " -> 11373 people visit the
" I (full) in Beijing in recent years " -> 11,282 people visit the
" high-speed driving In other chassis in mind: Windows and Linux deployments have withstood, but the task is arduous repair " -> 9908 people visit
"] [site Notice .NET Core version of the blog site posted a second try " -> 9813 people visit

The most commented articles many times top5 ----> (all blog Park team article oh) ︿ (¯)¯) ︿

"Powered by .NET Core Progress: Verify high concurrent performance problems suspect the Swarm Docker" -> 408 comments
, "[Site Notice] .NET Core version of the blog site posted a second try" -> 394 reviews
"fault [Announcement ] upgrade Ali cloud RDS SQL Server instance failure after " -> 168 reviews
" fault [announcement] published .NET Core version of blog sites lead to a large number of 500 error " -> 153 reviews
" high-speed car change chassis mind: Windows and Linux deployments of anti-lived, but the task is arduous repair " -> 152 comments

Most word title of the article appears -> (everyone's favorite topic of concern) (1 • ㅂ •) و✧
Keyword Number of occurrences Keyword Number of occurrences Keyword Number of occurrences
.net 341 java 292 spring 291
python 153 javascript 116 algorithm 112
sql 100 c# 90 data structure 73
view 71 Architecture 69 Interview 57
programmer 54 Linux 52 Machine Learning 51
database 50 front end 49 mybatis 46
reptile 38 Applets 31 rear end 27
react 26 window 24 css 21
mongodb 19 json 18 c++ 18
html 18 Big Data 16 Ali 14
php 13 Baidu 11 angular 3
Tencent 3

I like to publish in a few weeks it? -> (Saturday and Sunday are not really published, working professional water skiing) (° ー ° 〃)
week A total number of articles published
Monday 668
Tuesday 649
Wednesday 631
Thursday 630
Friday 570
on Saturday 420
on Sunday 430

The peak of the day issued a document -> (blog server when toughest anti park) 9 (͡ ๏ ̯͡ ๏) 6

Guess you like

Origin www.cnblogs.com/Juaoie/p/11517134.html