Skip to content

This repository deploys a serverless service that takes text files uploaded to a bucket, converts them to an MP3 and sends the output to an email address

License

Notifications You must be signed in to change notification settings

ran-isenberg/aws-text-to-speech

Repository files navigation

AWS Serverless Text to Speech Service (Python)

license PythonSupport version github-star-badge issues

alt text

Blogs website

Contact details | [email protected]

Features

  • A Serverless service that takes text files uploaded to a bucket, converts them to an MP3 and sends the output to an email address
  • Uses Amazon Polly
  • Python Serverless service with a recommended file structure.
  • CI/CD pipelines based on Github actions with python linters, static code analysis, complexity checks and style formatters.
  • Unit, integration and E2E test folders ready for implementation.

This is not a production ready code but more of an advanced POC.

I use it to convert my blog's text to audio for accessibility reasons.

It uploads any text file in the /text folder to S3, turns them into .mp3 files and sends them back to an email address.

The email address is hardcoded in the service/logic/email.py file and can be changed.

Here's a deep dive into the design: https://www.ranthebuilder.cloud/post/serverless-empowers-accessibility-convert-text-to-speech-with-amazon-polly

Important: make sure you enable SES to send emails to the email address you choose https://docs.aws.amazon.com/ses/latest/dg/creating-identities.html

Who gave me this Idea?

I was inspired to design a solution for myself after seeing this excellent YouTube video by Johannes Koch and Jimmy Dahlqvist.

Design

For the service design and further information checkout my blog post here.

Architecture

alt text

Flow of events:

  • Text file is uploaded to S3
  • A Lambda function is triggered with a 'create object' event.
  • The Lambda function reads the text file, and uses AWS Polly wrapper to start a polly text to speech task that will save the output the origin bucket as an .mp3 file
  • The Lambda function waits for the task to complete by polling the task status
  • Once completed, the function downloads the file and sends it as an email attachment to an email address of your choice

Design Considerations

  1. Why do I use Lambda function? Why not a step function?

That's definitely an improvement, but this was just a quick POC to automate my personal needs and provide a code example for a blog post.

In a production code, you should use a step function state machine that waits until the task is completed.

  1. Why do I send the file via email, as it is already on the bucket?

For my needs, I want to upload the mp3 file to my website and remove it from my personal AWS account. You can alter the behavior as you wish.

Getting started

Prerequisites

  • Docker - install Docker. Required for the Lambda layer packaging process.
  • AWS CDK - Required for synth & deploying the AWS Cloudformation stack.
  • Python 10
  • poetry - Make sure to run poetry config --local virtualenvs.in-project true so all dependencies are installed in the project '.venv' folder.
  • For Windows based machines, use the Makefile_windows version (rename to Makefile). Default Makefile is for Mac/Linux.

Creating a Developer Environment

  1. Run make dev
  2. Run poetry install

Deploy CDK

Create a cloudformation stack by running make deploy.

Deleting the stack

CDK destroy can be run with make destroy.

Preparing Code for PR

Run make pr. This command will run all the required checks, pre commit hooks, linters, code formats, flake8 and tests, so you can be sure GitHub's pipeline will pass.

The command auto fixes errors in the code for you.

If there's an error in the pre-commit stage, it gets auto fixed. However, are required to run make pr again so it continues to the next stages.

Be sure to commit all the changes that make pr does for you.

Building dev/lambda_requirements.txt

lambda_requirements.txt

CDK requires a requirements.txt in order to create a zip file with the Lambda layer dependencies. It's based on the project's poetry.lock file.

``make deploy` command will generate it automatically for you.

dev_requirements.txt

This file is used during GitHub CI to install all the required Python libraries without using poetry.

File contents are created out of the Pipfile.lock.

make deploy make deps commands generate it automatically.

** How to turn a text file to turn into speech?

Put a text file (.txt) into the /text folder. Deploy the CDK stack with 'make deploy' command.

It will be uploaded into an S3 bucket and turned into an mp3 file sent to you via email.

The email address is hardcoded and can be found at service/logic/email.py file.

When you add a new file: you can either upload to directly to the bucket or add them to the /text folder and run 'make deploy'.

Code Contributions

Code contributions are welcomed. Read this guide.

Code of Conduct

Read our code of conduct here.

Connect

Credits

License

This library is licensed under the MIT License. See the LICENSE file.

About

This repository deploys a serverless service that takes text files uploaded to a bucket, converts them to an MP3 and sends the output to an email address

Resources

License

Code of conduct

Stars

Watchers

Forks

Sponsor this project

 

Packages

No packages published