RFC: Function Idempotency Helper

## Key information

* RFC PR: awslabs/aws-lambda-powertools-python#245 
* Related issue(s), if known: _<none>_
* Area: _Utilities_
* Meet [tenets](https://awslabs.github.io/aws-lambda-powertools-python/#tenets): _Yes_


## Summary
[summary]: #summary

Helper to facilitate writing [**Idempotent**](https://en.wikipedia.org/wiki/Idempotence) Lambda functions.
The developer would specify (via JMESPath) which value from the **event** will be used as a **unique** execution identifier, then this helper would search a persistence layer (e.g. DynamoDB) for that ID; if present, get the return value and **skip** the function execution, otherwise run the function normally and **persist** the return value + execution ID.


## Motivation
[motivation]: #motivation

Idempotency is a very useful design characteristic of any system. It enables the seamless separation of successful and failed executions, and is particularly useful in Lambdas used by [AWS Step Functions](https://aws.amazon.com/step-functions/). It is also a design principle on the [AWS Well Architected Framework - Serverless Lens](https://docs.aws.amazon.com/wellarchitected/latest/serverless-applications-lens/wellarchitected-serverless-applications-lens.pdf)

Broader description of this idea can be found [here](https://aws.amazon.com/premiumsupport/knowledge-center/lambda-function-idempotent/)


## Proposal
[proposal]: #proposal

Define a Python Decorator `@idempotent` which would receive as arguments a) the JMESPath of the _event_ key to use as execution ID, b) {optional} storage backend configuration, e.g. DynamoDB table name, or ElasticSearch URL + Index).

This decorator would wrap the function execution in the following way (_pseudo-python_):

```python
from aws_lambda_powertools.itempotent import PersistenceLayer

def idempotent(func, event_key, persistence_config):
  def wrapper(*args, **kwargs):
    persistence = PersistenceLayer(persistence_config)
    key = jmespath.find(event_key, **kwargs['event'])
    persistence.find(key)

    if persistence.executed_successfully():
      return persistence.result()

    try:
      result = func(*args, **kwargs)
      persistence.save_success(key, result)
      return result
    except Exception => e:
      persistence.save_error(key, e)

  return wrapper
```

Usage then would be similar to:

```python
from aws_lambda_powertools.itempotent import itempotent

@idempotent(event_key='Event.UniqueId', persistence_config='dynamodb://lambda-idemp-table')
def handler(event, context):
  # Normal function code here
  return {'result': 'OK', 'message': 'working'}
```

The decorator would first extract the unique execution ID from the Lambda event using the JMESPath provided, then check the persistence layer for a previous **successfull** execution of the function and - if found - get the previous returned value, de-serialize it (using base64 or something else) and return it instead; otherwise, execute the function handler normally, catch the returned object, serialize + persist it and finally return.

The _Persistence_ layer could be implemented initially with DynamoDB, and either require the DDB table to exist before running the function, or create it during the first execution. It should be in such way as to allow different backends in the future (e.g. Redis for VPC-enabled lambdas).


## Drawbacks
[drawbacks]: #drawbacks

This solution could have noticeable performance impacts on the execution of Lambda functions. Every execution would require at at least 1, at most 2 accesses to the persistence layer.

No additional dependencies are required - DynamoDB access is provided by boto3, object serialisation can use Python's native base64encode/decode


## Rationale and alternatives
[rationale-and-alternatives]: #rationale-and-alternatives

* **What other designs have been considered? Why not them?**
No other designs considered at the moment. Open to suggestions.

* **What is the impact of not doing this?**
Implemention of idempotent Lambda functions will have to be done 'manually' in every function.


## Unresolved questions
[unresolved-questions]: #unresolved-questions

- How to make the persistence layer access as fast as possible?
- Which other persistence layers to consider (DynamoDB, ElasticSearch, Redis, MySQL)?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

RFC: Function Idempotency Helper #28

Key information

Summary

Motivation

Proposal

Drawbacks

Rationale and alternatives

Unresolved questions

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

RFC: Function Idempotency Helper #28

Description

Key information

Summary

Motivation

Proposal

Drawbacks

Rationale and alternatives

Unresolved questions

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions