What are the new features of Apache Spark 2.4, which will be released in 2018?

What are the new features of Apache Spark 2.4, which will be released in 2018?

This article is from the Apache Spark Meetup held at Adobe Systems Inc on September 19, 2018.

The upcoming Apache Spark 2.4 release is the fifth in the 2.x series. This article provides an overview of the key features and enhancements in Apache Spark 2.4.

  • The new scheduling model (Barrier Scheduling) enables users to properly embed distributed deep learning training into Spark stages to simplify the distributed training workflow.
  • Added 35 higher-order functions for array/map operations in Spark SQL.
  • Added a new native AVRO data source based on Databricks' spark-avro module.
  • PySpark also introduces eager evaluation mode for all operations for teaching and debuggability.
  • Spark on K8S supports PySpark and R, and supports client-mode.
  • Various enhancements to Structured Streaming. For example, stateful operators in continuous processing.
  • Various performance improvements to built-in data sources. For example, Parquet nested schema pruning.
  • Support for Scala 2.12.

Click on Shishuo.com to download this PPT.

Summarize

The above is what I introduced to you about the new features of Apache Spark 2.4, which will be launched in 2018. I hope it will be helpful to you. If you have any questions, please leave me a message and I will reply to you in time. I would also like to thank everyone for their support of the 123WORDPRESS.COM website!

You may also be interested in:
  • How to use Spark and Scala to analyze Apache access logs
  • Apache Spark 2.0 jobs take a long time to finish when they are finished

<<:  Notes on using $refs in Vue instances

>>:  How to change mysql password under Centos

Recommend

Two examples of using icons in Vue3

Table of contents 1. Use SVG 2. Use fontAwesome 3...

Install Docker on Centos7 (2020 latest version available, just copy and paste)

Refer to the official documentation here for oper...

How to create Apache image using Dockerfile

Table of contents 1. Docker Image 2. Create an in...

How to fix the footer at the bottom of the page (multiple methods)

As a front-end Web engineer, you must have encoun...

Summary of solutions to common Linux problems

1. Connect Centos7 under VMware and set a fixed I...

Solution to the error in compiling LVGL emulator on Linux

Table of contents 1. Error phenomenon 2. Error An...

Linux beginners in virtual machines configure IP and restart the network

For those who are new to virtual machines or have...

Detailed explanation of several storage methods of docker containers

Table of contents Written in front Several storag...

The process of deploying and running countly-server in docker in win10

I have just come into contact with and become fam...

Tips and precautions for using MySQL index

1. The role of index In general application syste...

Two ways to make IE6 display PNG-24 format images normally

Method 1: Please add the following code after <...

jQuery implements form validation

Use jQuery to implement form validation, for your...