Workflow Idempotency

Rahul Revo

Published Sep 11, 2022

This post is a continuation from the workflow basics post.

Idempotency of a workflow

Idempotency is the property of certain operations such that they can be applied multiple times without changing the result beyond the initial application.

For example in HTTP, the methods GET, PUT and DELETE should be implemented in an idempotent manner. If you attempt to GET a resource multiple times, assuming that there are no other changes done the result should be the same. Similarly updating a resource using PUT or attempting to delete a resource multiple times with DELETE should result in the same effect.

Idempotency is a useful property since it helps with recovery in failures in distributed scenarios. If an operation has failed with a transient error it can be retried. If it was completed deterministically (either success or failure), then retrying it should not change the result anyways.

Messaging systems like SQS have “at-least-once delivery” guarantees for delivery of messages. That means that on rare occasions, a message can be sent multiple times. The Retry Pattern is recommended for HTTP clients to handle transient failures to improve stability of an application. These are some common scenarios where a client might trigger a workflow multiple times with idempotency expectations.

Let’s get back to AWS Step functions and look at how idempotency can be achieved.

Idempotency based on name

The API to start a new workflow Execution is StartExecution. To start a workflow execution the parameters to be provided are-

stateMachineArn - identifier for the workflow
name - identifier for the workflow execution
Input - input parameters for first state of workflow as a JSON string

StartExecution is built as an idempotent operation.

If StartExecution is called with the same name and input as a running execution, then the call will succeed with the same response of executionArn and startDate.
If StartExecution is called with an existing name (90 days period) and if the input is different or the execution is completed then an error signaling that the execution name exists is returned.

Recommended by LinkedIn

🚀 How to Build a Generic REST Workflow Engine — A…

Bruno Delzant 1 year ago

N8N? Sounds good or does good?

Ashik Reddy 4 months ago

API Chaining: Automating Complex Workflows and…

Mesut KILICARSLAN 2 years ago

To build an idempotent operation based on workflows it is possible to use the name field. A parameter for the operation will need to be used as the name of the workflow execution. If the workflow execution throws an error with an existing name, it will be possible to lookup the previous execution via the DescribeExecution call and return the response. See an example of this pattern here.

Using the provided idempotency based on name is fairly simple. However this operation is limited to upto 90 days of retention. This pattern also works for idempotency at the scope of the entire workflow. It is not possible to retry parts of the steps for a workflow. See the following pattern for more complex scenarios.

Idempotency based on external state

For complex long running workflows, intermediate steps in a workflow can fail. It might be possible to restart a workflow that is able to resume from failed intermediate steps. Hence each state within the workflow should be idempotent in itself.

To make individual states in a workflow idempotent, you should extract the metadata about a workflow execution (or computation as in the diagram) to an external datastore like dynamodb. Each individual state within an execution can then look up the current metadata and then resume or skip the work of that state.

The diagram above is from an AWS talk - Under the Covers of AWS: Its Core Distributed Systems. The talk covers various primitives to build distributed systems including workflows.

With metadata stored in an external service, the name of the workflow execution does not matter. From the input, some unique identifiers will instead need to look up keys in the metadata store and each step can store additional information.

Using an external datastore adds complexity to the solution but it allows for more control. It also removes the 90 day limits on workflow idempotency of the previous pattern.

Service Integration patterns with async flows

To view or add a comment, sign in

Workflow Idempotency

Rahul Revo

Idempotency of a workflow

Idempotency based on name

Recommended by LinkedIn

Idempotency based on external state

Next

More articles by Rahul Revo

Others also viewed

Email Ingestion and processing using Kofax RPA

Let It MCP: Why Forcing Custom AI Integrations Is Holding You Back

n8n: Open-Source Workflow Automation Powerhouse

Converging Worlds of API and Agent Management

Automation Is Quietly Redefining How Business Central Gets Built

What MCP really is and Why IT leaders should pay attention

Mastering MuleSoft Integration: Concepts, Best Practices & AI-Augmented Delivery

Digital Transformation Series #3 - SOAP in Automation Framework

The Future of Integration & Automation: From Rigid Workflows to AI Reasoning (MCP)

Explore content categories

Idempotency of a workflow

Idempotency based on name

Recommended by LinkedIn

Idempotency based on external state

Next

More articles by Rahul Revo

[tech] Building with AI editors

Announcing US Visa Bulletin Hub

[tech] Technical Primer on MCP

[tech] Early 2025 AI thoughts

Peak efficiency

[tech] Typespec for API first development

Disagree and commit

Consistency models for service data

DIY Photo frame

Change

Others also viewed

Email Ingestion and processing using Kofax RPA

Let It MCP: Why Forcing Custom AI Integrations Is Holding You Back

n8n: Open-Source Workflow Automation Powerhouse

Converging Worlds of API and Agent Management

Automation Is Quietly Redefining How Business Central Gets Built

What MCP really is and Why IT leaders should pay attention

Mastering MuleSoft Integration: Concepts, Best Practices & AI-Augmented Delivery

Digital Transformation Series #3 - SOAP in Automation Framework

The Future of Integration & Automation: From Rigid Workflows to AI Reasoning (MCP)

Similar topics

Understanding Idempotency in Distributed Systems

Resolving Delays in AWS Workflow Automation

Explore content categories