Tuesday, June 17, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

On-device real-time few-shot face stylization – Google Research Blog

September 25, 2023
in AI Technology
Reading Time: 3 mins read
0 0
A A
0
Share on FacebookShare on Twitter



Posted by Haolin Jia, Software program Engineer, and Qifei Wang, Senior Software program Engineer, Core ML

Lately, there was a rising curiosity in built-in augmented actuality (AR) experiences utilizing real-time face characteristic era and modifying features in cellular purposes. This consists of purposes briefly movies, digital actuality, and gaming. In consequence, there’s a want for light-weight and high-quality face era and modifying fashions, usually primarily based on generative adversarial community (GAN) strategies. Nevertheless, most GAN fashions are computationally complicated and require a big coaching dataset. Moreover, it is very important use GAN fashions responsibly.

On this submit, we introduce MediaPipe FaceStylizer, an environment friendly design for few-shot face stylization that addresses the challenges of mannequin complexity and information effectivity whereas adhering to Google’s accountable AI Ideas. The mannequin consists of a face generator and a face encoder used as GAN inversion to map pictures into latent code for the generator. Now we have developed a mobile-friendly synthesis community for the face generator, which generates high-quality pictures from coarse to positive granularities. Now we have additionally designed loss features for the auxiliary heads of the generator to distill the coed generator from the trainer StyleGAN mannequin, leading to a light-weight mannequin with excessive era high quality. The MediaPipe FaceStylizer resolution is obtainable in open supply by way of MediaPipe.

Customers can fine-tune the generator utilizing MediaPipe Mannequin Maker to be taught a method from one or a couple of pictures. They’ll then deploy the personalized mannequin to on-device face stylization purposes utilizing MediaPipe FaceStylizer. This enables for few-shot on-device face stylization.

To assist customers in adapting MediaPipe FaceStylizer to completely different kinds, we’ve got constructed an end-to-end pipeline. The pipeline features a GAN inversion encoder and an environment friendly face generator mannequin. Customers can fine-tune the mannequin with a couple of examples of the specified model pictures utilizing MediaPipe Mannequin Maker. The fine-tuning course of freezes the encoder module and solely fine-tunes the generator. The generator is educated to reconstruct a picture of an individual’s face within the model of the enter pictures. This enables MediaPipe FaceStylizer to adapt to personalised kinds and stylize take a look at pictures of actual human faces.

The generator utilized in MediaPipe FaceStylizer, known as BlazeStyleGAN, is predicated on the StyleGAN mannequin household. It accommodates a mapping community and a synthesis community. Nevertheless, we’ve got designed a extra environment friendly synthesis community to cut back computational complexity whereas sustaining era high quality. Now we have educated BlazeStyleGAN by distilling it from a trainer StyleGAN mannequin, utilizing a multi-scale perceptual loss and adversarial loss within the distillation course of.

Now we have additionally launched an environment friendly GAN inversion because the encoder to assist image-to-image stylization. The encoder is outlined by a MobileNet V2 spine and educated with pure face pictures.

Now we have documented the mannequin complexities when it comes to parameter numbers and computing FLOPs. BlazeStyleGAN considerably reduces the mannequin complexity in comparison with StyleGAN, whereas sustaining era high quality. Now we have benchmarked the inference time of MediaPipe FaceStylizer on numerous high-end cellular units and achieved real-time efficiency.

The mannequin has been educated with a various dataset of human faces and has been evaluated for equity. It performs properly and is balanced when it comes to human gender, skin-tone, and ages.

Now we have supplied visible samples of face stylization outcomes utilizing MediaPipe FaceStylizer, demonstrating high-quality face stylization for widespread kinds.

MediaPipe Options will likely be releasing MediaPipe FaceStylizer to the general public. Customers can make the most of MediaPipe Mannequin Maker to coach a personalized face stylization mannequin and deploy it to purposes throughout platforms utilizing the MediaPipe Duties FaceStylizer API.

We wish to acknowledge the contributions of assorted groups throughout Google that made this work attainable.



Source link

Tags: BlogfacefewshotGoogleOndevicerealtimeResearchstylization
Previous Post

Groundbreaking soft valve technology enabling sensing and control integration in soft robots

Next Post

Mobile InsiderQ2 2023 – B-Stock Solutions

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Mobile InsiderQ2 2023 – B-Stock Solutions

Mobile InsiderQ2 2023 - B-Stock Solutions

The Kore.ai NLU Engines and When to Use Them

The Kore.ai NLU Engines and When to Use Them

The Main Pillars Of Software Development In The USA

The Main Pillars Of Software Development In The USA

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
Managing PDFs in Node.js with pdf-lib

Managing PDFs in Node.js with pdf-lib

November 16, 2023
Accenture creates a regulatory document authoring solution using AWS generative AI services

Accenture creates a regulatory document authoring solution using AWS generative AI services

February 6, 2024
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

The Importance of Choosing a Reliable Affiliate Network and Why Olavivo is Your Ideal Partner

October 30, 2023
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In