Friday, May 16, 2025
News PouroverAI
Visit PourOver.AI
No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing
News PouroverAI
No Result
View All Result

World scale inverse reinforcement learning in Google Maps – Google Research Blog

September 12, 2023
in AI Technology
Reading Time: 2 mins read
0 0
A A
0
Share on FacebookShare on Twitter



Posted by Matt Barnes, Software Engineer, Google Research

Routing in Google Maps is a highly useful and frequently used feature. Determining the best route from point A to point B involves considering various factors such as estimated time of arrival, tolls, directness, road conditions, and user preferences. To understand these preferences, we analyze real-world travel patterns using inverse reinforcement learning (IRL). IRL is a technique that involves recovering the underlying reward function based on observed sequential decision making behavior.

Scaling IRL algorithms has been a challenge due to the need to solve a reinforcement learning subroutine at each update step. The sheer size of world-scale Markov decision processes (MDPs) makes it difficult to fit into memory for computation. Additionally, when applying IRL to routing, it is necessary to consider all reasonable routes between origin and destination, making it difficult to break the MDP into smaller components.

In our work, “Massively Scalable Inverse Reinforcement Learning in Google Maps,” we address these scalability limitations by introducing advances in graph compression, parallelization, and a new IRL algorithm called Receding Horizon Inverse Planning (RHIP). RHIP allows fine-grained control over performance trade-offs and has achieved a 16-24% improvement in global route match rate compared to the suggested route in Google Maps. This represents the largest instance of IRL in a real-world setting to date.

One of the benefits of IRL is that it can handle goal-conditioned problems, where the MDP changes slightly based on the destination state. By learning the reward function, we can use a powerful inference-time trick to evaluate the rewards once in an offline batch setting, saving the results to an in-memory database. This eliminates the need for online inference of a parameterized model or policy, resulting in improved serving costs and latency.

To scale IRL to world-sized MDPs, we compress the graph and shard the global MDP using a sparse Mixture of Experts (MoE) based on geographic regions. We then apply classic IRL algorithms to solve the local MDPs and estimate the loss. RHIP, our generalized IRL algorithm, combines robust yet expensive stochastic policies in the local region with cheaper deterministic planners beyond a certain horizon. This allows us to control computational costs and discover the optimal performance sweet spot.

The RHIP policy has shown significant improvements in global route match rate for driving and two-wheelers, providing more accurate and faster routes compared to other IRL policies. By examining road segments with large differences in learned rewards compared to baseline rewards, we can further improve Google Maps routes.

In conclusion, by introducing scalability advancements to classic IRL algorithms, we have been able to train reward models on large-scale problems, making this the largest instance of IRL in a real-world setting to date. For more details on our work, please refer to the paper.

Acknowledgements: We would like to thank all the contributors and teams involved in this project for their valuable discussions and suggestions.



Source link

Tags: BlogGoogleinverseLearningMapsreinforcementResearchscaleWorld
Previous Post

Market Awaits News on Inflation and Crypto ETFs – Blockchain News, Opinion, TV and Jobs

Next Post

Introducing OpenAI Dublin

Related Posts

How insurance companies can use synthetic data to fight bias
AI Technology

How insurance companies can use synthetic data to fight bias

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset
AI Technology

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper
AI Technology

Decoding Decoder-Only Transformers: Insights from Google DeepMind’s Paper

June 9, 2024
How Game Theory Can Make AI More Reliable
AI Technology

How Game Theory Can Make AI More Reliable

June 9, 2024
Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs
AI Technology

Buffer of Thoughts (BoT): A Novel Thought-Augmented Reasoning AI Approach for Enhancing Accuracy, Efficiency, and Robustness of LLMs

June 9, 2024
Deciphering Doubt: Navigating Uncertainty in LLM Responses
AI Technology

Deciphering Doubt: Navigating Uncertainty in LLM Responses

June 9, 2024
Next Post
Introducing OpenAI Dublin

Introducing OpenAI Dublin

Key Data Science Concepts Taught in Online Learning Platforms

Key Data Science Concepts Taught in Online Learning Platforms

The Impact of AI-Powered Predictive Analytics on Marketing Strategies in the Era of Big Data

The Impact of AI-Powered Predictive Analytics on Marketing Strategies in the Era of Big Data

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Trending
  • Comments
  • Latest
Is C.AI Down? Here Is What To Do Now

Is C.AI Down? Here Is What To Do Now

January 10, 2024
23 Plagiarism Facts and Statistics to Analyze Latest Trends

23 Plagiarism Facts and Statistics to Analyze Latest Trends

June 4, 2024
Porfo: Revolutionizing the Crypto Wallet Landscape

Porfo: Revolutionizing the Crypto Wallet Landscape

October 9, 2023
A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

A Complete Guide to BERT with Code | by Bradney Smith | May, 2024

May 19, 2024
How To Build A Quiz App With JavaScript for Beginners

How To Build A Quiz App With JavaScript for Beginners

February 22, 2024
Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

Saginaw HMI Enclosures and Suspension Arm Systems from AutomationDirect – Library.Automationdirect.com

December 6, 2023
Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

Can You Guess What Percentage Of Their Wealth The Rich Keep In Cash?

June 10, 2024
AI Compared: Which Assistant Is the Best?

AI Compared: Which Assistant Is the Best?

June 10, 2024
How insurance companies can use synthetic data to fight bias

How insurance companies can use synthetic data to fight bias

June 10, 2024
5 SLA metrics you should be monitoring

5 SLA metrics you should be monitoring

June 10, 2024
From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

From Low-Level to High-Level Tasks: Scaling Fine-Tuning with the ANDROIDCONTROL Dataset

June 10, 2024
UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

UGRO Capital: Targeting to hit milestone of Rs 20,000 cr loan book in 8-10 quarters: Shachindra Nath

June 10, 2024
Facebook Twitter LinkedIn Pinterest RSS
News PouroverAI

The latest news and updates about the AI Technology and Latest Tech Updates around the world... PouroverAI keeps you in the loop.

CATEGORIES

  • AI Technology
  • Automation
  • Blockchain
  • Business
  • Cloud & Programming
  • Data Science & ML
  • Digital Marketing
  • Front-Tech
  • Uncategorized

SITEMAP

  • Disclaimer
  • Privacy Policy
  • DMCA
  • Cookie Privacy Policy
  • Terms and Conditions
  • Contact us

Copyright © 2023 PouroverAI News.
PouroverAI News

No Result
View All Result
  • Home
  • AI Tech
  • Business
  • Blockchain
  • Data Science & ML
  • Cloud & Programming
  • Automation
  • Front-Tech
  • Marketing

Copyright © 2023 PouroverAI News.
PouroverAI News

Welcome Back!

Login to your account below

Forgotten Password? Sign Up

Create New Account!

Fill the forms bellow to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In