Revision as of 01:07, 7 December 2021

External

Internal

Overview

Spark is a third generation unified analytics engine for large-scale data processing. It natively supports batch processing and stream processing. Stream processing is implemented as micro-batching. It uses HDFS as state backend.

Subjects

Concepts

@@ Line 8: / Line 8: @@
 =Overview=
-Spark is a third generation unified analytics engine for large-scale data processing. It natively supports [[System_Design#Batch_Processing|batch processing]] and [[System_Design#Stream_Processing|stream processing]]. Stream processing is implemented as micro-batching.
+Spark is a third generation unified analytics engine for large-scale data processing. It natively supports [[System_Design#Batch_Processing|batch processing]] and [[System_Design#Stream_Processing|stream processing]]. Stream processing is implemented as micro-batching. It uses [[HDFS]] as state backend.
 =Subjects=
 * [[Spark Concepts|Concepts]]

Spark: Difference between revisions

Revision as of 01:07, 7 December 2021

Contents

External

Internal

Overview

Subjects

Navigation menu

Spark: Difference between revisions

Revision as of 01:07, 7 December 2021

External

Internal

Overview

Subjects

Navigation menu

Search