Sunday, June 8, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

Streamline diarization using AI as an assistive technology: ZOO Digital’s story

February 20, 2024
in Data Science & ML
Reading Time: 2 mins read
0 0
A A
0
Share on FacebookShare on Twitter



ZOO Digital offers comprehensive localization and media services for adapting original TV and movie content to various languages, regions, and cultures. This facilitates globalization for top content creators worldwide. Renowned in the entertainment industry, ZOO Digital provides high-quality localization and media services on a large scale, including dubbing, subtitling, scripting, and compliance. Traditional localization processes involve manual speaker diarization, where audio streams are segmented based on the speaker’s identity before dubbing into another language can occur. This manual process can be time-consuming, taking 1–3 hours to localize a 30-minute episode. ZOO Digital aims to streamline localization to under 30 minutes through automation.

To achieve this goal, ZOO Digital collaborated with AWS Prototyping to deploy scalable machine learning (ML) models for diarizing media content using Amazon SageMaker, focusing on the WhisperX model. By leveraging automation, ZOO Digital seeks to accelerate content localization workflows to meet the growing demand for localized content.

The collaboration involved storing original media files in an Amazon Simple Storage Service (Amazon S3) bucket, triggering an AWS Lambda function when new files are detected. The Lambda function then invoked the SageMaker endpoint for inference using the WhisperX model, which combines transcription, alignment, and diarization for media assets. WhisperX utilizes models from Hugging Face, including the Whisper model for transcriptions, the Wav2Vec2 model for timestamp alignment, and the pyannote model for diarization.

To host the WhisperX model on SageMaker, model artifacts were pre-downloaded and saved in the serving container during initiation. An inference script was created to load the models and run the transcription, alignment, and diarization processes during inference. The collaboration demonstrated the potential of deploying WhisperX on SageMaker for efficient and cost-effective processing of large media files, such as movies and TV series.

Overall, ZOO Digital’s collaboration with AWS Prototyping showcased the benefits of leveraging machine learning and automation to enhance the localization process for content creators and entertainment industry professionals.



Source link

Tags: assistivediarizationDigitalsStoryStreamlinetechnologyZOO
Previous Post

G2’s Best Software 2024 Winners

Next Post

Exploring Gemini 1.5: How Google’s Latest Multimodal AI Model Elevates the AI Landscape Beyond Its Predecessor

Related Posts

AI Compared: Which Assistant Is the Best?
Data Science & ML

AI Compared: Which Assistant Is the Best?

June 10, 2024
5 Machine Learning Models Explained in 5 Minutes
Data Science & ML

5 Machine Learning Models Explained in 5 Minutes

June 7, 2024
Cohere Picks Enterprise AI Needs Over ‘Abstract Concepts Like AGI’
Data Science & ML

Cohere Picks Enterprise AI Needs Over ‘Abstract Concepts Like AGI’

June 7, 2024
How to Learn Data Analytics – Dataquest
Data Science & ML

How to Learn Data Analytics – Dataquest

June 6, 2024
Adobe Terms Of Service Update Privacy Concerns
Data Science & ML

Adobe Terms Of Service Update Privacy Concerns

June 6, 2024
Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart
Data Science & ML

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

June 6, 2024
Next Post
Exploring Gemini 1.5: How Google’s Latest Multimodal AI Model Elevates the AI Landscape Beyond Its Predecessor

Exploring Gemini 1.5: How Google's Latest Multimodal AI Model Elevates the AI Landscape Beyond Its Predecessor

10 cobots guarantee ‘just in time’ manufacture gearboxes

10 cobots guarantee ‘just in time’ manufacture gearboxes

More intense exercise reduces post-concussion anxiety in teens

More intense exercise reduces post-concussion anxiety in teens

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
Accenture creates a regulatory document authoring solution using AWS generative AI services

Accenture creates a regulatory document authoring solution using AWS generative AI services

February 6, 2024
Managing PDFs in Node.js with pdf-lib

Managing PDFs in Node.js with pdf-lib

November 16, 2023
Graph neural networks in TensorFlow – Google Research Blog

Graph neural networks in TensorFlow – Google Research Blog

February 6, 2024
13 Best Books, Courses and Communities for Learning React — SitePoint

13 Best Books, Courses and Communities for Learning React — SitePoint

February 4, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In