Get started with EFA and MPI for HPC workloads on Amazon EC2
This tutorial helps you to launch an EFA and MPI-enabled instance cluster for HPC workloads.
Tasks
- Step 1: Prepare an EFA-enabled security group
- Step 2: Launch a temporary instance
- Step 3: Install the EFA software
- Step 4: (Optional) Enable Open MPI 5
- Step 5: (Optional) Install Intel MPI
- Step 6: Disable ptrace protection
- Step 7. Confirm installation
- Step 8: Install your HPC application
- Step 9: Create an EFA-enabled AMI
- Step 10: Launch EFA-enabled instances into a cluster placement group
- Step 11: Terminate the temporary instance
- Step 12: Enable passwordless SSH
Step 1: Prepare an EFA-enabled security group
An EFA requires a security group that allows all inbound and outbound traffic to and from the security group itself. The following procedure creates a security group that allows all inbound and outbound traffic to and from itself, and that allows inbound SSH traffic from any IPv4 address for SSH connectivity.
Important
This security group is intended for testing purposes only. For your production environments, we recommend that you create an inbound SSH rule that allows traffic only from the IP address from which you are connecting, such as the IP address of your computer, or a range of IP addresses in your local network.
For other scenarios, see Security group rules for different use cases.
To create an EFA-enabled security group
Open the Amazon EC2 console at https://console.aws.amazon.com/ec2/
. -
In the navigation pane, choose Security Groups and then choose Create security group.
-
In the Create security group window, do the following:
-
For Security group name, enter a descriptive name for the security group, such as
EFA-enabled security group
. -
(Optional) For Description, enter a brief description of the security group.
-
For VPC, select the VPC into which you intend to launch your EFA-enabled instances.
-
Choose Create security group.
-
-
Select the security group that you created, and on the Details tab, copy the Security group ID.
-
With the security group still selected, choose Actions, Edit inbound rules, and then do the following:
-
Choose Add rule.
-
For Type, choose All traffic.
-
For Source type, choose Custom and paste the security group ID that you copied into the field.
-
Choose Add rule.
-
For Type, choose SSH.
-
For Source type, choose Anywhere-IPv4.
-
Choose Save rules.
-
-
With the security group still selected, choose Actions, Edit outbound rules, and then do the following:
-
Choose Add rule.
-
For Type, choose All traffic.
-
For Destination type, choose Custom and paste the security group ID that you copied into the field.
-
Choose Save rules.
-
Step 2: Launch a temporary instance
Launch a temporary instance that you can use to install and configure the EFA software components. You use this instance to create an EFA-enabled AMI from which you can launch your EFA-enabled instances.
To launch a temporary instance
Open the Amazon EC2 console at https://console.aws.amazon.com/ec2/
. -
In the navigation pane, choose Instances, and then choose Launch Instances to open the new launch instance wizard.
-
(Optional) In the Name and tags section, provide a name for the instance, such as
EFA-instance
. The name is assigned to the instance as a resource tag (Name=
).EFA-instance
-
In the Application and OS Images section, select an AMI for one of the supported operating systems.
-
In the Instance type section, select a supported instance type.
-
In the Key pair section, select the key pair to use for the instance.
-
In the Network settings section, choose Edit, and then do the following:
-
For Subnet, choose the subnet in which to launch the instance. If you do not select a subnet, you can't enable the instance for EFA.
-
For Firewall (security groups), choose Select existing security group, and then select the security group that you created in the previous step.
-
Expand the Advanced network configuration section.
For Network interface 1, select Network card index = 0, Device index = 0, and Interface type = EFA with ENA.
(Optional) If you are using a multi-card instance type, such as
p4d.24xlarge
orp5.48xlarge
, for each additional network interface required, choose Add network interface, for Network card index select the next unused index, and then select Device index = 1 and Interface type = EFA with ENA or EFA-only.
-
-
In the Storage section, configure the volumes as needed.
-
In the Summary panel on the right, choose Launch instance.
Note
Consider requiring the use of IMDSv2 for the temporary instance as well as the AMI that you will create in Step 9 unless you have already set IMDSv2 as the default for the account. For more information about IMDSv2 configuration steps, see Configure instance metadata options for new instances.
Step 3: Install the EFA software
Install the EFA-enabled kernel, EFA drivers, Libfabric, and Open MPI stack that is required to support EFA on your temporary instance.
The steps differ depending on whether you intend to use EFA with Open MPI, with Intel MPI, or with Open MPI and Intel MPI.
To install the EFA software
-
Connect to the instance you launched. For more information, see Connect to your Linux instance using SSH.
-
To ensure that all of your software packages are up to date, perform a quick software update on your instance. This process may take a few minutes.
-
Amazon Linux 2023, Amazon Linux 2, RHEL 8/9, Rocky Linux 8/9
$
sudo yum update -y -
Ubuntu and Debian
$
sudo apt-get update && sudo apt-get upgrade -y -
SUSE Linux Enterprise
$
sudo zypper update -y
-
-
Reboot the instance and reconnect to it.
-
Download the EFA software installation files. The software installation files are packaged into a compressed tarball (
.tar.gz
) file. To download the latest stable version, use the following command.$
curl -O https://efa-installer.amazonaws.com/aws-efa-installer-1.37.0.tar.gzYou can also get the latest version by replacing the version number with
latest
in the preceding command. (Optional) Verify the authenticity and integrity of the EFA tarball (
.tar.gz
) file.We recommend that you do this to verify the identity of the software publisher and to check that the file has not been altered or corrupted since it was published. If you do not want to verify the tarball file, skip this step.
Note
Alternatively, if you prefer to verify the tarball file by using an MD5 or SHA256 checksum instead, see Verify the EFA installer using a checksum.
-
Download the public GPG key and import it into your keyring.
$
wget https://efa-installer.amazonaws.com/aws-efa-installer.key && gpg --import aws-efa-installer.keyThe command should return a key value. Make a note of the key value, because you need it in the next step.
-
Verify the GPG key's fingerprint. Run the following command and specify the key value from the previous step.
$
gpg --fingerprintkey_value
The command should return a fingerprint that is identical to
4E90 91BC BB97 A96B 26B1 5E59 A054 80B1 DD2D 3CCC
. If the fingerprint does not match, don't run the EFA installation script, and contact AWS Support. -
Download the signature file and verify the signature of the EFA tarball file.
$
wget https://efa-installer.amazonaws.com/aws-efa-installer-1.37.0.tar.gz.sig && gpg --verify ./aws-efa-installer-1.37.0.tar.gz.sigThe following shows example output.
gpg: Signature made Wed 29 Jul 2020 12:50:13 AM UTC using RSA key ID DD2D3CCC gpg: Good signature from "Amazon EC2 EFA <ec2-efa-maintainers@amazon.com>" gpg: WARNING: This key is not certified with a trusted signature! gpg: There is no indication that the signature belongs to the owner. Primary key fingerprint: 4E90 91BC BB97 A96B 26B1 5E59 A054 80B1 DD2D 3CCC
If the result includes
Good signature
, and the fingerprint matches the fingerprint returned in the previous step, proceed to the next step. If not, don't run the EFA installation script, and contact AWS Support.
-
-
Extract the files from the compressed
.tar.gz
file and navigate into the extracted directory.$
tar -xf aws-efa-installer-1.37.0.tar.gz && cd aws-efa-installer -
Install the EFA software. Do one of the following depending on your use case.
Note
EFA does not support NVIDIA GPUDirect with SUSE Linux. If you are using SUSE Linux, you must additionally specify the
--skip-kmod
option to prevent kmod installation. By default, SUSE Linux does not allow out-of-tree kernel modules. -
If the EFA installer prompts you to reboot the instance, do so and then reconnect to the instance. Otherwise, log out of the instance and then log back in to complete the installation.
Step 4: (Optional) Enable Open MPI 5
Note
Perform this step only if you intend to use Open MPI 5.
From EFA 1.30.0, both Open MPI 4 and Open MPI 5 are installed by default. Alternatively, you can choose to install only Open MPI 4 or Open MPI 5.
If you chose to install Open MPI 5 in Step 3: Install the EFA software, and you intend to use it, you must perform the following steps to enable it.
To enable Open MPI 5
-
Add Open MPI 5 to the PATH environment variable.
$
module load openmpi5 -
Verify that Open MPI 5 is enabled for use.
$
which mpiccThe command should return the Open MPI 5 installation directory -
/opt/amazon/openmpi5
. -
(Optional) To ensure that Open MPI 5 is added to PATH environment variable each time the instance starts, do the following:
If you need to remove Open MPI 5 from the PATH environment variable, run the following command and remove the command from the shell startup scripts.
$
module unload openmpi5
Step 5: (Optional) Install Intel MPI
Important
Perform this step only if you intend to use Intel MPI. If you intend to only use Open MPI, skip this step.
Intel MPI requires an additional installation and environment variable configuration.
Prerequisite
Ensure that the user performing the following steps has sudo permissions.
To install Intel MPI
-
To download the Intel MPI installation script, do the following
-
Visit the Intel website
. -
In the Intel MPI Library section of the webpage, choose the link for the Intel MPI Library for Linux Offline installer.
-
-
Run the installation script that you downloaded in the previous step.
$
sudo bashinstallation_script_name
.sh -
In the installer, choose Accept & install.
-
Read the Intel Improvement Program, choose the appropriate option, and then choose Begin Installation.
-
When the installation completes, choose Close.
-
By default, Intel MPI uses its embedded (internal) Libfabric. You can configure Intel MPI to use the Libfabric that ships with the EFA installer instead. Typically, the EFA installer ships with a later version of Libfabric than Intel MPI. In some cases, the Libfabric that ships with the EFA installer is more performant than that of Intel MPI. To configure Intel MPI to use the Libfabric that ships with the EFA installer, do one of the following depending on your shell.
-
Add the following source command to your shell script to source the
vars.sh
script from the installation directory to set up the compiler environment each time the instance starts. Do one of the following depending on your shell. -
By default, if EFA is not available due to a misconfiguration, Intel MPI defaults to the TCP/IP network stack, which might result in slower application performance. You can prevent this by setting
I_MPI_OFI_PROVIDER
toefa
. This causes Intel MPI to fail with the following error if EFA is not available:Abort (XXXXXX) on node 0 (rank 0 in comm 0): Fatal error in PMPI_Init: OtherMPI error, MPIR_Init_thread (XXX)........: MPID_Init (XXXX)..............: MPIDI_OFI_mpi_init_hook (XXXX): open_fabric (XXXX)............: find_provider (XXXX)..........: OFI fi_getinfo() failed (ofi_init.c:2684:find_provider:
Do one of the following depending on your shell.
-
By default, Intel MPI doesn't print debugging information. You can specify different verbosity levels to control the debugging information. Possible values (in order of the amount of detail they provide) are:
0
(default),1
,2
,3
,4
,5
. Level1
and higher prints thelibfabric version
andlibfabric provider
. Uselibfabric version
to check whether Intel MPI is using the internal Libfabric or the Libfabric that ships with the EFA installer. If it's using the internal Libfabric, the version is suffixed withimpi
. Uselibfabric provider
to check with Intel MPI is using EFA or the TCP/IP network. If it's using EFA, the value isefa
. If it's using TCP/IP, the value istcp;ofi_rxm
.To enable debugging information, do one of the following depending on your shell.
-
By default, Intel MPI uses the operating system’s shared memory (
shm
) for intra-node communication, and it uses Libfabric (ofi
) only for inter-node communication. Generally, this configuration provides the best performance. However, in some cases the Intel MPI shm fabric can cause certain applications to hang indefinitely.To resolve this issue, you can force Intel MPI to use Libfabric for both intra-node and inter-node communication. To do this, do one of the following depending on your shell.
Note
The EFA Libfabric provider uses the operating system's shared memory for intra-node communication. This means that setting
I_MPI_FABRICS
toofi
yields similar performance to the defaultshm:ofi
configuration. -
Log out of the instance and then log back in.
If you no longer want to use Intel MPI, remove the environment variables from the shell startup scripts.
Step 6: Disable ptrace protection
To improve your HPC application's performance, Libfabric uses the instance's local memory for interprocess communications when the processes are running on the same instance.
The shared memory feature uses Cross Memory Attach (CMA), which is not supported with ptrace protection. If you are using a Linux distribution that has ptrace protection enabled by default, such as Ubuntu, you must disable it. If your Linux distribution does not have ptrace protection enabled by default, skip this step.
To disable ptrace protection
Do one of the following:
-
To temporarily disable ptrace protection for testing purposes, run the following command.
$
sudo sysctl -w kernel.yama.ptrace_scope=0 -
To permanently disable ptrace protection, add
kernel.yama.ptrace_scope = 0
to/etc/sysctl.d/10-ptrace.conf
and reboot the instance.
Step 7. Confirm installation
To confirm successful installation
-
To confirm that MPI was successfully installed, run the following command:
$
which mpicc-
For Open MPI, the returned path should include
/opt/amazon/
-
For Intel MPI, the returned path should include
/opt/intel/
. If you do not get the expected output, ensure that you have sourced the Intel MPIvars.sh
script.
-
-
To confirm that the EFA software components and Libfabric were successfully installed, run the following command.
$
fi_info -p efa -t FI_EP_RDMThe command should return information about the Libfabric EFA interfaces. The following example shows the command output.
provider: efa fabric: EFA-fe80::94:3dff:fe89:1b70 domain: efa_0-rdm version: 2.0 type: FI_EP_RDM protocol: FI_PROTO_EFA
Step 8: Install your HPC application
Install the HPC application on the temporary instance. The installation procedure varies depending on the specific HPC application. For more information, see Manage software on your AL2 instance in the Amazon Linux 2 User Guide.
Note
Refer to your HPC application’s documentation for installation instructions.
Step 9: Create an EFA-enabled AMI
After you have installed the required software components, you create an AMI that you can reuse to launch your EFA-enabled instances.
To create an AMI from your temporary instance
Open the Amazon EC2 console at https://console.aws.amazon.com/ec2/
. -
In the navigation pane, choose Instances.
-
Select the temporary instance that you created and choose Actions, Image, Create image.
-
For Create image, do the following:
-
For Image name, enter a descriptive name for the AMI.
-
(Optional) For Image description, enter a brief description of the purpose of the AMI.
-
Choose Create image.
-
-
In the navigation pane, choose AMIs.
-
Locate the AMI tht you created in the list. Wait for the status to change from
pending
toavailable
before continuing to the next step.
Step 10: Launch EFA-enabled instances into a cluster placement group
Launch your EFA-enabled instances into a cluster placement group using the EFA-enabled AMI that you created in Step 7, and the EFA-enabled security group that you created in Step 1.
Note
-
It is not an absolute requirement to launch your EFA-enabled instances into a cluster placementgroup. However, we do recommend running your EFA-enabled instances in a cluster placement group as it launches the instances into a low-latency group in a single Availability Zone.
-
To ensure that capacity is available as you scale your cluster’s instances, you can create a Capacity Reservation for your cluster placement group. For more information, see Create Capacity Reservations in cluster placement groups.
To launch an instance
Open the Amazon EC2 console at https://console.aws.amazon.com/ec2/
. -
In the navigation pane, choose Instances, and then choose Launch Instances to open the new launch instance wizard.
-
(Optional) In the Name and tags section, provide a name for the instance, such as
EFA-instance
. The name is assigned to the instance as a resource tag (Name=
).EFA-instance
-
In the Application and OS Images section, choose My AMIs, and then select the AMI that you created in the previous step.
-
In the Instance type section, select a supported instance type.
-
In the Key pair section, select the key pair to use for the instance.
-
In the Network settings section, choose Edit, and then do the following:
-
For Subnet, choose the subnet in which to launch the instance. If you do not select a subnet, you can't enable the instance for EFA.
-
For Firewall (security groups), choose Select existing security group, and then select the security group that you created in the previous step.
-
Expand the Advanced network configuration section.
For Network interface 1, select Network card index = 0, Device index = 0, and Interface type = EFA with ENA.
(Optional) If you are using a multi-card instance type, such as
p4d.24xlarge
orp5.48xlarge
, for each additional network interface required, choose Add network interface, for Network card index select the next unused index, and then select Device index = 1 and Interface type = EFA with ENA or EFA-only.
-
-
(Optional) In the Storage section, configure the volumes as needed.
-
In the Advanced details section, for Placement group name, select the cluster placement group into which to launch the instances. If you need to create a new cluster placement group, choose Create new placement group.
-
In the Summary panel on the right, for Number of instances, enter the number of EFA-enabled instances that you want to launch, and then choose Launch instance.
Step 11: Terminate the temporary instance
At this point, you no longer need the instance that you launched in Step 2. You can terminate the instance to stop incurring charges for it.
To terminate the temporary instance
Open the Amazon EC2 console at https://console.aws.amazon.com/ec2/
. -
In the navigation pane, choose Instances.
-
Select the temporary instance that you created and then choose Actions, Instance state, Terminate (delete) instance.
-
When prompted for confirmation, choose Terminate (delete).
Step 12: Enable passwordless SSH
To enable your applications to run across all of the instances in your cluster, you must enable passwordless SSH access from the leader node to the member nodes. The leader node is the instance from which you run your applications. The remaining instances in the cluster are the member nodes.
To enable passwordless SSH between the instances in the cluster
-
Select one instance in the cluster as the leader node, and connect to it.
-
Disable
strictHostKeyChecking
and enableForwardAgent
on the leader node. Open~/.ssh/config
using your preferred text editor and add the following.Host * ForwardAgent yes Host * StrictHostKeyChecking no
-
Generate an RSA key pair.
$
ssh-keygen -t rsa -N "" -f ~/.ssh/id_rsaThe key pair is created in the
$HOME/.ssh/
directory. -
Change the permissions of the private key on the leader node.
$
chmod 600 ~/.ssh/id_rsa chmod 600 ~/.ssh/config -
Open
~/.ssh/id_rsa.pub
using your preferred text editor and copy the key. -
For each member node in the cluster, do the following:
-
Connect to the instance.
-
Open
~/.ssh/authorized_keys
using your preferred text editor and add the public key that you copied earlier.
-
-
To test that the passwordless SSH is functioning as expected, connect to your leader node and run the following command.
$
sshmember_node_private_ip
You should connect to the member node without being prompted for a key or password.