Recently, memory elements have been added to commercial large language models (LLMs) to provide customized replies. LLMs can modify their behavior based on personal information since its memory keeps track of specifics like user demographics and particular traits. The effects of incorporating individualized data into the context, however, have not been fully evaluated. Customization may be difficult, especially when dealing with delicate subjects. In order to comprehend how several state-of-the-art LLMs behave in various personalization scenarios—with a particular focus on hate speech—we analyze them in this research. In order to detect hate speech, the researchers ask the models to adopt national identities and employ various languages. Results show that context personalization has a major impact on LLMs’ answers in this delicate area. The researchers penalize inconsistent hate speech classifications generated with and without nation or language-specific information in order to reduce undesired biases. Both when no context is given and in customized settings, the updated models show better performance.

https://arxiv.org/abs/2505.02252v1

By author

Leave a Reply

Your email address will not be published. Required fields are marked *