Apache Flink
Apache Flink
Additionally, Flink has connectors for third-party data sources, such as the following:
Amazon EMR supports Flink as a YARN application so that you can manage resources along with other applications within a cluster. Flink-on-YARN allows you to submit transient Flink jobs, or you can create a long-running cluster that accepts multiple jobs and allocates resources according to the overall YARN reservation.
Flink is included in Amazon EMR release versions 5.1.0 and later.
Note
Support for the FlinkKinesisConsumer
class was added in Amazon EMR release
version 5.2.1.
The following table lists the version of Flink included in the latest release of the Amazon EMR 7.x series, along with the components that Amazon EMR installs with Flink.
For the version of components installed with Flink in this release, see Release 7.5.0 Component Versions.
Amazon EMR Release Label | Flink Version | Components Installed With Flink |
---|---|---|
emr-7.5.0 |
Flink 1.19.1 |
emrfs, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, flink-client, flink-jobmanager-config, hudi, delta |
The following table lists the version of Flink included in the latest release of the Amazon EMR 6.x series, along with the components that Amazon EMR installs with Flink.
For the version of components installed with Flink in this release, see Release 6.15.0 Component Versions.
Amazon EMR Release Label | Flink Version | Components Installed With Flink |
---|---|---|
emr-6.15.0 |
Flink 1.17.1 |
emrfs, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, flink-client, flink-jobmanager-config, hudi, delta-standalone-connectors |
The following table lists the version of Flink included in the latest release of the Amazon EMR 5.x series, along with the components that Amazon EMR installs with Flink.
For the version of components installed with Flink in this release, see Release 5.36.2 Component Versions.
Amazon EMR Release Label | Flink Version | Components Installed With Flink |
---|---|---|
emr-5.36.2 |
Flink 1.14.2 |
emrfs, hadoop-client, hadoop-mapred, hadoop-hdfs-datanode, hadoop-hdfs-library, hadoop-hdfs-namenode, hadoop-httpfs-server, hadoop-kms-server, hadoop-yarn-nodemanager, hadoop-yarn-resourcemanager, hadoop-yarn-timeline-server, flink-client, flink-jobmanager-config |
Topics
- Creating a cluster with Flink
- Configuring Flink in Amazon EMR
- Working with Flink jobs in Amazon EMR
- Use the Scala shell
- Finding the Flink web interface
- Flink autoscaler
- Optimizing job restart times for task recovery and scaling operations
- Working with Flink jobs from Zeppelin in Amazon EMR
- Flink release history