Description
Description:
Amazon Polly is a service that turns text into lifelike speech, allowing you to create applications that talk, and build entirely new categories of speech-enabled products.
Polly’s Text-to-Speech (TTS) service uses advanced deep learning technologies to synthesize natural sounding human speech. With dozens of lifelike voices across a broad set of languages, you can build speech-enabled applications that work in many different countries.
In addition to Standard TTS voices,
Amazon Polly offers
Neural Text-to-Speech (NTTS) voices that deliver advanced improvements in speech quality through a new machine learning approach. Polly’s Neural TTS technology also supports two speaking styles that allow you to better match the delivery style of the speaker to the application: a
Newscaster reading style that is tailored to news narration use cases, and a
Conversational speaking style that is ideal for two-way communication like telephony applications.
Online Demo
Features of Amazon Polly
- Support for over 33+ Languages and Dialects
- Support for over 87+ Different Voices and Accents
- Powered By:
- Natural sounding voices (Neural Voices)
- Advanced Deep Learning Technology
- Various Combination of SSML effects for all Voices
- Mix up to 20 voices in a single synthesize task
- Synthesize up to 60K mixed voice synthesize text with just few clicks
- Powerful Sound Studio that supports 2 audio formats
- Add Background Music to your text
- Merge synthesize results with similar formats
- Multiple Audio Output Formats:
- Store & redistribute speech easily via social media
- Near Real-time text synthesize
- Customize & control speech output
- Optimize Your Streaming Audio
- Adjust Speech Rate, Pitch, and Loudness
- Adjust Speaking Emphasis
- Pronounce digits/dates/words/abbreviations properly
- Add work/phrase replacement effect
- Mute/Beep Out any part of text/sentence
- Store results in:
- Local Server
- Amazon S3
- Wasabi Storage
- Conveniently Share synthesize results or Download
- Fully Responsive Interface
- Closely Monitor Estimated Spending for Cloud TTS Services
- One Click Auto Update Option
- Developed with PHP 7.4.x and Laravel 8.4.x
- Detailed and Comprehensive Documentation
Cloud Vendor Text to Speech Prices
Notes
Please note, for the script to work correctly, you need to have valid AWS account.
Latest Changes
22.04.2022 - 2.0
- New: Full redesign with Laravel Framework
- New: Powerful integrated Sound Studio
- New: Mixing up to 20 voices in a single synthesize task
16.05.2020 - 1.5
- Update: Standard Voices character limit increase
- Update: Neural Voices character limit increase
- Update: Direct Keys include simplified
- Update: Documentation
17.03.2020 - 1.4
- Update: Support for raw PCM audio stream formats added for Large Text
- Fix: JS bug fixes
30.01.2020 - 1.3
- Update: Support for Neural TTS added, provides high quality life like voices
- Update: Support for Large Text added, output results are directly sent to Amazon S3
- Update: Additional voice effects are added for Neural TTS
- Update: Additional voice effects are added for Standard TTS
- Fix: Minor bug fixes
16.11.2019 - 1.2
- Update: AWS PHP SDK v3 is now included with the package
- Update: App can now run directly with only IAM Access and Secret Access Keys
- Update: Additional voice effects are added
11.11.2019 - 1.1
- Fix: Audio Player play/pause fix during direct play
- Fix: Additional Settings dropdown sign fix
29.10.2019 - 1.0
- Initial Release