Thursday, May 8, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

A new generative engine and three voices are now generally available on Amazon Polly

May 9, 2024
in Cloud & Programming
Reading Time: 3 mins read
0 0
A A
0
Share on FacebookShare on Twitter


Today, we are pleased to announce the general availability of the generative engine of Amazon Polly featuring three voices: Ruth and Matthew in American English and Amy in British English. This new generative engine has been trained with a mix of publicly available and proprietary data, encompassing various voices, languages, and styles. It boasts the highest precision in rendering context-dependent prosody, pausing, spelling, dialectal properties, foreign word pronunciation, and more.

Amazon Polly is an advanced machine learning (ML) service that seamlessly converts text into natural-sounding speech, known as text-to-speech (TTS) technology. With Amazon Polly, users can now access high-quality, humanlike voices in multiple languages, allowing for the selection of the ideal voice for different locales and countries to enhance speech-enabled applications.

Amazon Polly offers a range of voice options, including neural, long-form, and generative voices, all of which deliver significant improvements in speech quality, delivering highly expressive, emotionally adept voices. Users can customize features such as speech rate, pitch, and volume using Speech Synthesis Markup Language (SSML) tags, and enjoy fast response times for lifelike voices and engaging user experiences.

The new generative engine in Amazon Polly now supports four voice engines: standard, neural, long-form, and generative voices.

Standard TTS voices, introduced in 2016, utilize traditional concatenative synthesis, stringing together phonemes of recorded speech to produce natural-sounding synthesized speech. However, variations in speech and segmentation techniques limit the quality of speech.

Neural TTS (NTTS) voices, introduced in 2019, employ a sequence-to-sequence neural network to convert phonemes into spectrograms and a neural vocoder for generating audio signals, resulting in even higher quality humanlike voices.

Long-form voices, introduced in 2023, utilize cutting-edge deep learning TTS technology to captivate listeners’ attention for longer content like news articles, training materials, and marketing videos.

In February 2024, Amazon introduced the Big Adaptive Streamable TTS with Emergent abilities (BASE) model, enabling the generative engine in Amazon Polly to create humanlike synthetically generated voices for use in various applications.

Here are the new generative voices:

Name
Locale
Gender
Language
Sample prompt
NTTS voices
Generative voices

Ruth
en_US
Female
English (US)
Selma was lying on the ground halfway down the steps. ‘Selma! Selma!’ we shouted in panic.

Matthew
en_US
Male
English (US)
The guards were standing outside with some of our neighbours, listening to a transistor radio. ‘Any good news?’ I asked. ‘No, we’re listening to the names of people who were killed yesterday,’ Bruno replied.

Amy
en_GB
Female
English (British)
What are you looking at?’ he said as he stood over me. They got off the bus and started searching the baggage compartment. The tension on the bus was like a dark, menacing cloud that hovered above us.

You can select from these voice options to suit your application and use case. For more information on the generative engine, refer to the Generative voices section in the AWS documentation.

To get started with using generative voices, access the new voices via the AWS Management Console, AWS Command Line Interface (AWS CLI), or AWS SDKs.

To begin, go to the Amazon Polly console in the US (N. Virginia) Region and navigate to the Text-to-Speech menu in the left pane. Choose the voice of Ruth or Matthew in English (US) or Amy in English (UK) to access the Generative engine. Input your text, listen to, or download the generated voice output.

Using the CLI, you can list the voices that utilize the new generative engine:

$ aws polly describe-voices –output json –region us-east-1 \\
| jq -r ‘.Voices[] | select(.SupportedEngines | index(“generative”)) | .Name’

Matthew
Amy
Ruth

Now, run the synthesize-speech CLI command to synthesize sample text into an audio file (hello.mp3) using the generative engine and a supported voice ID.

$ aws polly synthesize-speech –output-format mp3 –region us-east-1 \\
–text “Hello. This is my first generative voices!” \\
–voice-id Matthew –engine generative hello.mp3

For more code examples using AWS SDKs, visit the Code and application examples section in the AWS documentation. Explore Java and Python code examples, application examples for web applications in Java or Python, as well as iOS and Android applications.

The new generative voices of Amazon Polly are now available in the US East (N. Virginia) Region. Pay only for what you use based on the number of characters converted to speech. Learn more on the Amazon Polly Pricing page.

Try out the new generative voices in the Amazon Polly console today and provide feedback to AWS re:Post for Amazon Polly or through your usual AWS Support contacts.

— Channy



Source link

Tags: AmazonengineGenerallygenerativePollyvoices
Previous Post

Google DeepMind Introduces AlphaFold 3: A Revolutionary AI Model that can Predict the Structure and Interactions of All Life’s Molecules with Unprecedented Accuracy

Next Post

‘Seriously Underwater’ Home Mortgages Tick Up Across the US

Related Posts

Top 20 Javascript Libraries You Should Know in 2024
Cloud & Programming

Top 20 Javascript Libraries You Should Know in 2024

June 10, 2024
Simplify risk and compliance assessments with the new common control library in AWS Audit Manager
Cloud & Programming

Simplify risk and compliance assessments with the new common control library in AWS Audit Manager

June 6, 2024
Simplify Regular Expressions with RegExpBuilderJS
Cloud & Programming

Simplify Regular Expressions with RegExpBuilderJS

June 6, 2024
How to learn data visualization to accelerate your career
Cloud & Programming

How to learn data visualization to accelerate your career

June 6, 2024
BitTitan Announces Seasoned Tech Leader Aaron Wadsworth as General Manager
Cloud & Programming

BitTitan Announces Seasoned Tech Leader Aaron Wadsworth as General Manager

June 6, 2024
Copilot Studio turns to AI-powered workflows
Cloud & Programming

Copilot Studio turns to AI-powered workflows

June 6, 2024
Next Post
‘Seriously Underwater’ Home Mortgages Tick Up Across the US

‘Seriously Underwater’ Home Mortgages Tick Up Across the US

Q4 results today: SBI, Asian Paints among 69 companies to announce earnings

Q4 results today: SBI, Asian Paints among 69 companies to announce earnings

Ethical Implications of AI in Software Development

Ethical Implications of AI in Software Development

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

May 19, 2024
A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

A faster, better way to prevent an AI chatbot from giving toxic responses | MIT News

April 10, 2024
Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

Part 1: ABAP RESTful Application Programming Model (RAP) – Introduction

November 20, 2023
Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

December 6, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In