AWS Serverless Text to Speech Service (Python)

Blogs website

Contact details | [email protected]

Features

A Serverless service that takes text files uploaded to a bucket, converts them to an MP3 and sends the output to an email address
Uses Amazon Polly
Python Serverless service with a recommended file structure.
CI/CD pipelines based on Github actions with python linters, static code analysis, complexity checks and style formatters.
Unit, integration and E2E test folders ready for implementation.

This is not a production ready code but more of an advanced POC.

I use it to convert my blog's text to audio for accessibility reasons.

It uploads any text file in the /text folder to S3, turns them into .mp3 files and sends them back to an email address.

The email address is hardcoded in the service/logic/email.py file and can be changed.

Here's a deep dive into the design: https://www.ranthebuilder.cloud/post/serverless-empowers-accessibility-convert-text-to-speech-with-amazon-polly

Important: make sure you enable SES to send emails to the email address you choose https://docs.aws.amazon.com/ses/latest/dg/creating-identities.html

Who gave me this Idea?

I was inspired to design a solution for myself after seeing this excellent YouTube video by Johannes Koch and Jimmy Dahlqvist.

Design

For the service design and further information checkout my blog post here.

Architecture

Flow of events:

Text file is uploaded to S3
A Lambda function is triggered with a 'create object' event.
The Lambda function reads the text file, and uses AWS Polly wrapper to start a polly text to speech task that will save the output the origin bucket as an .mp3 file
The Lambda function waits for the task to complete by polling the task status
Once completed, the function downloads the file and sends it as an email attachment to an email address of your choice

Design Considerations

Why do I use Lambda function? Why not a step function?

That's definitely an improvement, but this was just a quick POC to automate my personal needs and provide a code example for a blog post.

In a production code, you should use a step function state machine that waits until the task is completed.

Why do I send the file via email, as it is already on the bucket?

For my needs, I want to upload the mp3 file to my website and remove it from my personal AWS account. You can alter the behavior as you wish.

Getting started

Prerequisites

Docker - install Docker. Required for the Lambda layer packaging process.
AWS CDK - Required for synth & deploying the AWS Cloudformation stack.
Python 10
poetry - Make sure to run poetry config --local virtualenvs.in-project true so all dependencies are installed in the project '.venv' folder.
For Windows based machines, use the Makefile_windows version (rename to Makefile). Default Makefile is for Mac/Linux.

Creating a Developer Environment

Run make dev
Run poetry install

Deploy CDK

Create a cloudformation stack by running make deploy.

Deleting the stack

CDK destroy can be run with make destroy.

Preparing Code for PR

Run make pr. This command will run all the required checks, pre commit hooks, linters, code formats, flake8 and tests, so you can be sure GitHub's pipeline will pass.

The command auto fixes errors in the code for you.

If there's an error in the pre-commit stage, it gets auto fixed. However, are required to run make pr again so it continues to the next stages.

Be sure to commit all the changes that make pr does for you.

Building dev/lambda_requirements.txt

lambda_requirements.txt

CDK requires a requirements.txt in order to create a zip file with the Lambda layer dependencies. It's based on the project's poetry.lock file.

``make deploy` command will generate it automatically for you.

dev_requirements.txt

This file is used during GitHub CI to install all the required Python libraries without using poetry.

File contents are created out of the Pipfile.lock.

make deploy make deps commands generate it automatically.

** How to turn a text file to turn into speech?

Put a text file (.txt) into the /text folder. Deploy the CDK stack with 'make deploy' command.

It will be uploaded into an S3 bucket and turned into an mp3 file sent to you via email.

The email address is hardcoded and can be found at service/logic/email.py file.

When you add a new file: you can either upload to directly to the bucket or add them to the /text folder and run 'make deploy'.

Code Contributions

Code contributions are welcomed. Read this guide.

Code of Conduct

Read our code of conduct here.

Connect

Email: [email protected]
Blog Website RanTheBuilder
LinkedIn: ranisenberg
Twitter: IsenbergRan

Credits

AWS Polly wrapper examples (Python)

License

This library is licensed under the MIT License. See the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.github		.github
.vscode		.vscode
cdk		cdk
service		service
tests		tests
text		text
.coveragerc		.coveragerc
.gitignore		.gitignore
.markdownlint.yaml		.markdownlint.yaml
.pre-commit-config.yaml		.pre-commit-config.yaml
.style		.style
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
app.py		app.py
banner.png		banner.png
cdk.context.json		cdk.context.json
cdk.json		cdk.json
hld.png		hld.png
mypy.ini		mypy.ini
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AWS Serverless Text to Speech Service (Python)

Features

Who gave me this Idea?

Design

Architecture

Design Considerations

Getting started

Prerequisites

Creating a Developer Environment

Deploy CDK

Deleting the stack

Preparing Code for PR

Building dev/lambda_requirements.txt

lambda_requirements.txt

dev_requirements.txt

** How to turn a text file to turn into speech?

Code Contributions

Code of Conduct

Connect

Credits

License

About

Releases 7

Sponsor this project

Packages

Languages

License

ran-isenberg/aws-text-to-speech

Folders and files

Latest commit

History

Repository files navigation

AWS Serverless Text to Speech Service (Python)

Features

Who gave me this Idea?

Design

Architecture

Design Considerations

Getting started

Prerequisites

Creating a Developer Environment

Deploy CDK

Deleting the stack

Preparing Code for PR

Building dev/lambda_requirements.txt

lambda_requirements.txt

dev_requirements.txt

** How to turn a text file to turn into speech?

Code Contributions

Code of Conduct

Connect

Credits

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 7

Sponsor this project

Packages 0

Languages

Packages