You never imagined that these are the favorite English words of the Korean girl group

South Korea is really a magical country. A company with only three stars (Samsung), but named its own mobile phone "Galaxy" (meaning galaxy in English), with dozens of "five-star" hotels. Wang Jianlin said that Koreans are much taller than us when it comes to pretending to be forceful. We Chinese are a down-to-earth and hard-working nation, so we can achieve a small goal every day!
After hearing this, Koreans said that invisible force is the most deadly. IGNB.

The naming style of the Korean girl group also continues the same idea. Just a few little girls and a combination can be called "stellar". He is obviously on the earth, but he wants "hello venus" (Hello Venus). Obviously dressed in cool summer clothes, the name of the combination is called "red velvet" (red velvet). It’s not enough to have "girls generation", we need to form another "girls day". If the vocabulary is not big enough, star chasers will seem to lack confidence. It is also thanks to their names that are so literate that I can be confident, brazen and blush to explain to my mother, "Mom, I'm not chasing the girl group, I'm learning English".

Anyone who knows a little about Korean girl groups knows that their lyrics are always mixed with a few English words from time to time, and some of the songs are even English words, such as "lion heart", "troube maker", "vibrato" ", "genie" (genie is actually an English word? It really makes me feel my ignorance again).
Insert picture description here
So, what is the favorite English word of the Korean girl group? We can use the selenium framework, create a crawler tool to extract all their lyrics, and then perform word frequency analysis and statistics.

First, select the music platform to be crawled. For each girl group, QQ Music and NetEase Cloud Music can only display up to 60 songs, and Kugou Music can only display up to 30 songs. To see more content, To download the client, Xiami Music’s attitude towards crawling tools is almost strictly guarded against deadlocks, and there are many obstacles, even using proxy tools, it cannot be broken. Due to various considerations, Kuwo Music was finally selected as the music platform to be crawled. You can see all the songs of the girl group, and the quality of the lyrics is relatively high.

I would like to thank Kuwo Music for its tolerance and sharing spirit.

Let’s analyze the structure of the website first. It is a combination of http://www.kuwo.cn/artist/content?name= and the artist’s name. Replace stellar with your favorite artist’s name, and you can see his homepage That's it.
Insert picture description here
After knowing the URL of the singer's homepage, you can start a simulated browser, enter the URL, and click Enter to see his homepage. The code is the following lines:

driver=webdriver.Opera()
driver.get('http://www.kuwo.cn/artist/content?name=stellar')

Then, get the "Songs" tab, click to switch to the "Songs" page, otherwise, you can only see the basic information introduction of the singers you have watched countless times. This line of code will do:

driver.find_element_by_xpath('//*[@id="tab_music"]/span').click()

Then, it uses regular expressions to extract the id of all the songs on the playlist page, and you can directly access the song page with the song id. The URL example is http://www.kuwo.cn/yinyue/1162458, replace 1162458 with other songs The id can be transferred to the page of other songs, the code is as follows:

patten=re.compile(r'href="/yinyue/\d+"')
result=patten.findall(str(bs_obj))

Then, when we enter the song page, we can extract the lyrics, again using regular expressions:

pat=re.compile('class="lrcItem">.*</p>')
result=pat.findall(str(j))

Insert picture description here
Finally, just output the collected lyrics to a text file. Some results are as follows:
Insert picture description here

Next, we analyze and count word frequency. Read all lyrics files, extract the English words that appear in them, count the number of occurrences of these words, and the final statistical results are as follows:

Insert picture description here
Analyzing this statistical result, it can be seen that the favorite English words of Korean women’s groups are mainly divided into three categories: the
first category is modal particles, such as "hey", "oh", "yeah", "ah", "woo" ","Huh","whoo". As we all know, modal particles have a strong empathy ability, just like the Chinese "Fucking", they can instantly let your friends understand your emotions. In addition, when rehearsing, using more modal particles can also facilitate communication among women group members. Cheer each other up.
The second category is words that express the relationship between characters, such as "you", "me", "my", "baby", "boy", "love", "we", which are used between lovers who seem to be in contact The nickname is to build an imaginary private space where fans and young ladies live together, giving fans an unparalleled, climax immersive experience.
The third category is words that express actions, such as "know", "go", "come", "hear", "come", "let". It is possible that the English textbooks of the Republic of Korea and our country are from the beginning to the end. "Have fun", "have a rest", "go to cinema" are simple phrases that can't make points in college entrance examination English composition. But the advantages of this simple phrase are unparalleled, that is, easy to understand, absolutely easy to understand, and easy to understand at the highest level. Even elementary school students will not be unable to chase the Korean girl group because of insufficient English ability ( Let’s talk about the large-scale sultry moves in the girl group MV that are not suitable for elementary school students.)

All in all, the central idea of ​​this article is: There are quite a few Korean ladies who are good in foreign language.

Guess you like

Origin blog.csdn.net/esa72ya/article/details/89371441