Spark Learning Road-1 - Code World

Spark Learning Road-1

Others 2022-04-22 20:25:14 views: 0

1, spark official website: http://spark.apache.org/

Apache Spark™ is a unified analytics engine for large-scale data processing.

2, four features of spark:

Speed

Run workloads 100x faster.

Apache Spark achieves high performance for both batch and streaming data, using a state-of-the-art DAG scheduler, a query optimizer, and a physical execution engine.

Ease of Use

Write applications quickly in Java, Scala, Python, R, and SQL.

Spark offers over 80 high-level operators that make it easy to build parallel apps. And you can use it interactively from the Scala, Python, R, and SQL shells.

Generality

Combine SQL, streaming, and complex analytics.

Spark powers a stack of libraries including SQL and DataFrames, MLlib for machine learning, GraphX, and Spark Streaming. You can combine these libraries seamlessly in the same application.

Runs Everywhere

Spark runs on Hadoop, Apache Mesos, Kubernetes, standalone, or in the cloud. It can access diverse data sources.

You can run Spark using its standalone cluster mode, on EC2, on Hadoop YARN, on Mesos, or on Kubernetes. Access data in HDFS, Apache Cassandra, Apache HBase, Apache Hive, and hundreds of other data sources.

All package addresses related to spark and packages under apache:

http://archive.apache.org/dist/

Spark runs on both Windows and UNIX-like systems (e.g. Linux, Mac OS). It’s easy to run locally on one machine — all you need is to have javainstalled on your system PATH, or the JAVA_HOME environment variable pointing to a Java installation.

Spark runs on Java 8+, Python 2.7+/3.4+ and R 3.1+. For the Scala API, Spark 2.3.0 uses Scala 2.11. You will need to use a compatible Scala version (2.11.x).

Note that support for Java 7, Python 2.6 and old Hadoop versions before 2.6.5 were removed as of Spark 2.2.0. Support for Scala 2.10 was removed as of 2.3.0.

Guess you like

Origin http://43.154.161.224:23101/article/api/json?id=325254348&siteId=291194637

Spark Learning Road-1

Windows Game Programming Master Skills (Second Edition) Learning Road-1

spark learning (1): spark Introduction

Spark learning road one (spark overview)

Open road of learning Hadoop and Spark

spark learning record -1

spark Learning Day 1

Spark Learning Road (11) Spark Core Tuning Spark Memory Model

Road learning --Spark (5) <br> Spark cluster resource scheduling

Spark learning road (e) mounting a distributed pseudo Spark [rpm]

Spark learning road (5) Spark pseudo-distributed installation

Spark Learning Road (20) Metadata of SparkSQL

Spark learning (1) Using spark for WordCount word count

spark learning (1) --- dataframe operations Daquan

Chapter 6|Spark MLlib Machine Learning (1)

Aji tune tuning Spark Development Road (eight) learning SparkCore of [turn]

Spark Learning Road (10) Shuffle Tuning of SparkCore Tuning

Spark Learning Road (12) Resource Tuning of SparkCore Tuning

Spark Learning Road (19) SparkSQL's custom function UDF

Source interpret Spark Road (XVI) learning SparkCore (b) spark-submit submit scripts

Spark learning two --Spark of RDD

Spark learning (3) RDD of Spark

Spark learning from 0 to 1 (10)-Spark tuning (1)-system and code tuning

spark learning (a): Basic concepts 1_ idempotency.

Spark mlib official document learning and translation notes (1)

Spark source code learning (1) - sparkContext initialization process

Big Data learning day23 ----- spark06 -------- 1. Spark execution flow

Spark learning from 0 to 1 (10)-Spark tuning (4)-Executor's off-heap memory tuning

Spark learning from 0 to 1 (10)-Spark tuning (3)-memory tuning

Spark learning from 0 to 1 (10)-Spark tuning (2)-data localization

Recommended

TIOBE May list: Fortran “resurrected” into Top 10

GCC 14.1 released

Ranking

B. Little Girl and Game【1300 / 回文字符串博弈论】

CIKERS Shane 20190613

"Javascript advanced programming" study notes - the constructor and prototype

beeline hiveserver2 start

springboot - Automatically backup mysql data every day

Data Storage Full Solution--Detailed Persistence Technology

Detailed Explanation of Spring Web MVC DispatcherServlet—Official Original

TCP / IP protocol layers structure and function

Command type literal pos: unknown； Fallback type literal pos: unknown] with root cause

Design of multifunctional curtain controller with indoor anti-theft alarm

Daily

More

2024-05-08(18)

2024-05-07(34)

2024-05-06(6)

2024-05-05(0)

2024-05-04(18)

2024-05-03(8)

2024-05-02(0)

2024-05-01(4)

2024-04-30(36)

2024-04-29(5)