PySpark space calculation learning summary

Before work sometimes involves large data processing operations space, most recently due to the novel coronavirus isolated at home, it is learned under PySpark computing space, but also unlocked a new technology, is very happy, it records.

This article is in windows10 environment inside, want to learn as a series of notes to record, I hope I can stick!

A, windows10 big data environment installation

Two, python big data environment installation

Third, the Windows10 PySpark large data verification program development (in PyCharm and jupyter notebook)

 

Published 42 original articles · won praise 76 · views 20000 +

Guess you like

Origin blog.csdn.net/Ocean111best/article/details/104282848
Recommended