Terms A simple example Ownership and mutability More terms!Rewriting our example to be more efficient (single-threaded concurrency)Rewriting our example to be more efficient (multi-threaded concurrency)Debugging multi-threaded apps

Concurrency

The AWS SDK for Rust doesn't provide concurrency control but users have many options for implementing their own.

Terms

Terms related to this subject are easy to confuse and some terms have become synonyms even though they originally represented separate concepts. In this guide, we'll define the following:

Task: Some "unit of work" that your program will run to completion, or attempt to run to completion.
Sequential Computing: When several tasks are executed one after another.
Concurrent Computing: When several tasks are executed in overlapping time periods.
Concurrency: The ability of a computer to complete multiple tasks in an arbitrary order.
Multitasking: The ability of a computer to run several tasks concurrently.
Race Condition: When the behavior of your program changes based on when a task is started or how long it takes to process a task.
Contention: Conflict over access to a shared resource. When two or more tasks want to access a resource at the same time, that resource is "in contention".
Deadlock: A state in which no more progress can be made. This typically happens because two tasks want to acquire each other's resources but neither task will release their resource until the other's resource becomes available. Deadlocks lead to a program becoming partly or completely unresponsive.

A simple example

Our first example is a sequential program. In later examples, we'll change this code using concurrency techniques. Later examples reuse the same build_client_and_list_objects_to_download() method and make changes within main().

The following example task is to download all the files in an Amazon Simple Storage Service bucket:

Start by listing all the files. Save the keys in a list.
Iterate over the list, downloading each file in turn


const EXAMPLE_BUCKET: &str = "<an-example-bucket>";

// This initialization function won't be reproduced in
// examples following this one, in order to save space.
async fn build_client_and_list_objects_to_download() {
    let cfg = aws_config::load_defaults(aws_config::BehaviorVersion::latest()).await;
    let client = Client::new(&cfg);
    let objects_to_download: Vec<_> = client
        .list_objects_v2()
        .bucket(EXAMPLE_BUCKET)
        .send()
        .await
        .expect("listing objects succeeds")
        .contents()
        .into_iter()
        .flat_map(aws_sdk_s3::types::Object::key)
        .map(ToString::to_string)
        .collect();
         
    (client, objects_to_download)
}


#[tokio::main]
async fn main() {
    let (client, objects_to_download) =
        build_client_and_list_objects_to_download().await;
    
    for object in objects_to_download {
        let res = client
            .get_object()
            .key(&object)
            .bucket(EXAMPLE_BUCKET)
            .send()
            .await
            .expect("get_object succeeds");
        let body = res.body.collect().await.expect("reading body succeeds").into_bytes();
        std::fs::write(object, body).expect("write succeeds");
    }
}

Note

In these examples, we won't be handling errors, and we assume that the example bucket has no objects with keys that look like file paths. Thus, we won't cover creating nested directories.

Because of the architecture of modern computers, we can rewrite this program to be much more efficient. We'll do that in a later example, but first, let's learn a few more concepts.

Ownership and mutability

Each value in Rust has a single owner. When an owner goes out of scope, all values it owns will also be dropped. The owner can provide either one or more immutable references to a value or a single mutable reference. The Rust compiler is responsible for ensuring that no reference outlives its owner.

Additional planning and design is needed when multiple tasks need to mutably access the same resource. In sequential computing, each task can mutably access the same resource without contention because they run one after another in a sequence. However, in concurrent computing, tasks can run in any order, and at the same time. Therefore, we must do more to prove to the compiler that multiple mutable references are impossible (or at least to crash if they do occur).

The Rust standard library provides many tools to help us accomplish this. For more information on these topics, see Variables and Mutability and Understanding Ownership in The Rust Programming Language book.

More terms!

The following are lists of "synchronization objects". Altogether, they are the tools necessary to convince the compiler that our concurrent program won't break ownership rules.

Standard library synchronization objects:

Arc: An Atomically Reference-Counted pointer. When data is wrapped in an Arc, it can be shared freely, without worrying about any specific owner dropping the value early. In this sense, the ownership of the value becomes "shared". Values within an Arc cannot be mutable, but might have interior mutability.
Barrier: Ensures multiple threads will wait for each other to reach a point in the program, before continuing execution all together.
Condvar: a Condition Variable providing the ability to block a thread while waiting for an event to occur.
Mutex: a Mutual Exclusion mechanism that ensures that at most one thread at a time is able to access some data. Generally speaking, a Mutex lock should never be held across an .await point in the code.

Tokio synchronization objects:

While the AWS SDKs are intended to be async-runtime-agnostic, we recommend the use of tokio synchronization objects for specific cases.

Mutex: Similar to the standard library's Mutex, but with a slightly higher cost. Unlike the standard Mutex, this one can be held across an .await point in the code.
Sempahore: A variable used to control access to a common resource by multiple tasks.

Rewriting our example to be more efficient (single-threaded concurrency)

In the following modified example, we use futures_util::future::join_all to run ALL get_object requests concurrently.


#[tokio::main]
async fn main() {
    let (client, objects_to_download) =
        build_client_and_list_objects_to_download().await;
        
    let get_object_futures = objects_to_download.into_iter().map(|object| {
        let req = client
            .get_object()
            .key(&object)
            .bucket(EXAMPLE_BUCKET);

        async {
            let res = req
                .send()
                .await
                .expect("get_object succeeds");
            let body = res.body.collect().await.expect("body succeeds").into_bytes();
           // Note that we MUST use the async runtime's preferred way
           // of writing files. Otherwise, this call would block,
           // potentially causing a deadlock.
            tokio::fs::write(object, body).await.expect("write succeeds");
        }
    });

    futures_util::future::join_all(get_object_futures).await;
}

This is the simplest way to benefit from concurrency, but it also has a few issues that might not be obvious at first glance:

We create all the request inputs at the same time. If we don't have enough memory to hold all the get_object request inputs then we'll run into an "out-of-memory" allocation error.
We create and await all the futures at the same time. Amazon S3 throttles requests if we try to download too much at once.

To fix both of these issues, we must limit the amount of requests that we're sending at any one time. We'll do this with a tokio semaphore:


const CONCURRENCY_LIMIT: usize = 50; 

#[tokio::main(flavor = "current_thread")]
async fn main() {
    let (client, objects_to_download) =
        build_client_and_list_objects_to_download().await;
    let concurrency_semaphore = Arc::new(Semaphore::new(CONCURRENCY_LIMIT));

    let get_object_futures = objects_to_download.into_iter().map(|object| {
        // Since each future needs to acquire a permit, we need to clone
        // the Arc'd semaphore before passing it in.
        let semaphore = concurrency_semaphore.clone();
        // We also need to clone the client so each task has its own handle.
        let client = client.clone();
        async move {
            let permit = semaphore
                .acquire()
                .await
                .expect("we'll get a permit if we wait long enough");
            let res = client
                .get_object()
                .key(&object)
                .bucket(EXAMPLE_BUCKET)
                .send()
                .await
                .expect("get_object succeeds");
            let body = res.body.collect().await.expect("body succeeds").into_bytes();
            tokio::fs::write(object, body).await.expect("write succeeds");
            std::mem::drop(permit);
        }
    });

    futures_util::future::join_all(get_object_futures).await;
}

We've fixed the potential memory usage issue by moving the request creation into the async block. This way, requests won't be created until it's time to send them.

Note

If you have the memory for it, it might be more efficient to create all your request inputs at once and hold them in memory until they're ready to be sent. To try this, move request input creation outside of the async block.

We've also fixed the issue of sending too many requests at once by limiting requests in flight to CONCURRENCY_LIMIT.

Note

The right value for CONCURRENCY_LIMIT is different for every project. When constructing and sending your own requests, try to set it as high as you can without running into throttling errors. While it's possible to dynamically update your concurrency limit based on the ratio of successful to throttled responses that a service sends back, that's outside the scope of this guide due to its complexity.

Rewriting our example to be more efficient (multi-threaded concurrency)

In the previous two examples, we performed our requests concurrently. While this is more efficient than running them synchronously, we can make things even more efficient by using multi-threading. To do this with tokio, we'll need to spawn them as separate tasks.

Note

This example requires that you use the multi-threaded tokio runtime. This runtime is gated behind the rt-multi-thread feature. And, of course, you'll need to run your program on a multi-core machine.


// Set this based on the amount of cores your target machine has.
const THREADS: usize = 8; 

#[tokio::main(flavor = "multi_thread")]
async fn main() {
    let (client, objects_to_download) =
        build_client_and_list_objects_to_download().await;
    let concurrency_semaphore = Arc::new(Semaphore::new(THREADS));

    let get_object_task_handles = objects_to_download.into_iter().map(|object| {
        // Since each future needs to acquire a permit, we need to clone
        // the Arc'd semaphore before passing it in.
        let semaphore = concurrency_semaphore.clone();
        // We also need to clone the client so each task has its own handle.
        let client = client.clone();
        
        // Note this difference! We're using `tokio::task::spawn` to
        // immediately begin running these requests.
        tokio::task::spawn(async move {
            let permit = semaphore
                .acquire()
                .await
                .expect("we'll get a permit if we wait long enough");
            let res = client
                .get_object()
                .key(&object)
                .bucket(EXAMPLE_BUCKET)
                .send()
                .await
                .expect("get_object succeeds");
            let body = res.body.collect().await.expect("body succeeds").into_bytes();
            tokio::fs::write(object, body).await.expect("write succeeds");
            std::mem::drop(permit);
        })
    });

    futures_util::future::join_all(get_object_task_handles).await;
}

Dividing work into tasks can be complex. Doing I/O (input/output) is typically blocking. Runtimes might struggle to balance the needs of long-running tasks with those of short-running tasks. Whatever runtime you choose, be sure to read their recommendations for the most efficient way to divide your work into tasks. For the tokio runtime recommendations, see Module tokio::task.

Debugging multi-threaded apps

Tasks running concurrently can be run in any order. As such, the logs of concurrent programs can very difficult to read. In the SDK for Rust, we recommend using the tracing logging system. It can group logs with their specific tasks, no matter when they're running. For guidance, see Enable logging of AWS SDK for Rust code.

A very useful tool for identifying tasks that have locked up is tokio-console, which is a diagnostic and debugging tool for asynchronous Rust programs. By instrumenting and running your program, and then running the tokio-console app, you can see a live view of the tasks your program is running. This view includes helpful information like the amount of time a task has spent waiting to acquire shared resources or the amount of times it has been polled.

Warning Javascript is disabled or is unavailable in your browser.

To use the Amazon Web Services Documentation, Javascript must be enabled. Please refer to your browser's Help pages for instructions.

Document Conventions

Best practices

Error handling