Seeing Our Reflection in LLMs. When LLMs give us outputs that revealâ¦ | by Stephanie Kirmer

Seeing Our Reflection in LLMs. When LLMs give us outputs that revealâ¦ | by Stephanie Kirmer | Mar, 2024

When LLMs provide us with outputs that expose flaws in human society, should we be willing to listen to their insights? By now, many of you may have heard about Google’s new LLM, Gemini, creating images of racially diverse individuals in Nazi attire. This incident has prompted a discussion about how models may have blind spots, leading us to apply expert rules to their predictions to prevent them from generating inappropriate results.

This issue is not uncommon in machine learning, especially when the training data is flawed or limited. For example, in my own experience, I encountered challenges when predicting the delivery times of packages to business offices. While the model could accurately estimate when the package would arrive near the office, it struggled to account for instances when deliveries occurred outside of business hours. To address this, we implemented a simple rule to adjust the prediction to the next hour the office was open, improving the accuracy of the results.

However, this approach introduces complexities, as it creates multiple sets of predictions to manage. While the original model’s output is used for performance monitoring and metrics, the adjusted predictions reflect what the customer actually experiences in the application. This deviation from traditional model outputs requires a different perspective when evaluating the model’s impact.

Similar challenges may arise with LLMs like Gemini, where models may apply undisclosed prompt modifications to alter their outputs. Without these adjustments, the models may reflect the biases and inequalities present in the content they were trained on, including racism, sexism, and other forms of discrimination.

While there is a desire to improve representation of underrepresented populations, implementing these tweaks may not be a sustainable solution. Constantly modifying prompts to address specific issues can lead to unintended consequences and further complexities in managing the models’ outputs.

Instead of solely criticizing the technology when faced with problematic outputs, we should take the opportunity to understand why these results occur. By engaging in thoughtful debates about the appropriateness of the model’s responses, we can make decisions that align with our values and principles.

In conclusion, finding a balance between reflecting the realities of human society and upholding ethical standards in LLM outputs is a continual challenge. Rather than seeking a perfect solution, we must be willing to adapt and reassess our approaches to ensure that these models serve a positive purpose while avoiding harmful impacts.

Source link