搜索 — ResearchTracker

This study critically evaluates gender assignment methods within academic contexts, employing a comparative analysis of diverse techniques, including a SVM classifier, gender-guesser, genderize.io, and a Cultural Consensus Theory based classifier. Emphasizing the significance of transparency, data sources, and methodological considerations, the research introduces nomquamgender, a cultural consensus-based method, and applies it to Teseo, a Spanish dissertation database. The results reveal a substantial reduction in the number of individuals with unknown gender compared to traditional methods relying on INE data. The nuanced differences in gender distribution underscore the importance of methodological choices in gender studies, urging for transparent, comprehensive, and freely accessible methods to enhance the accuracy and reliability of gender assignment in academic research. After reevaluating the problem of gender imbalances in the doctoral system we can conclude that it's still evident although the trend is clearly set for its reduction. Finaly, specific problems related to some disciplines, including STEM fields and seniority roles are found to be worth of attention in the nea

From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs

arXiv2024-07-05作者：Marion Bartl, Susan Leavy

Gender bias is not only prevalent in Large Language Models (LLMs) and their training data, but also firmly ingrained into the structural aspects of language itself. Therefore, adapting linguistic structures within LLM training data to promote gender-inclusivity can make gender representations within the model more inclusive. The focus of our work are gender-exclusive affixes in English, such as in 'show-girl' or 'man-cave', which can perpetuate gender stereotypes and binary conceptions of gender. We use an LLM training dataset to compile a catalogue of 692 gender-exclusive terms along with gender-neutral variants and from this, develop a gender-inclusive fine-tuning dataset, the 'Tiny Heap'. Fine-tuning three different LLMs with this dataset, we observe an overall reduction in gender-stereotyping tendencies across the models. Our approach provides a practical method for enhancing gender inclusivity in LLM training data and contributes to incorporating queer-feminist linguistic activism in bias mitigation research in NLP.

搜索结果：Gender

Gender assignment in doctoral theses: revisiting Teseo with a method based on cultural consensus theory

From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs

Leveraging Large Language Models to Measure Gender Representation Bias in Gendered Language Corpora

Relating Word Embedding Gender Biases to Gender Gaps: A Cross-Cultural Analysis

Building Bridges: A Dataset for Evaluating Gender-Fair Machine Translation into German

GG-BBQ: German Gender Bias Benchmark for Question Answering

Exploring Gender Bias in Large Language Models: An In-depth Dive into the German Language

Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs

Gender Lost In Translation: How Bridging The Gap Between Languages Affects Gender Bias in Zero-Shot Multilingual Translation

Stereotypical gender actions can be extracted from Web text

Gender and Digital Platform Work During Turbulent Times

Voice Passing : a Non-Binary Voice Gender Prediction System for evaluating Transgender voice transition

Overcoming Obstacles: Challenges of Gender Inequality in Undergraduate ICT Programs

Whose wife is it anyway? Assessing bias against same-gender relationships in machine translation

Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps

Gender Bias of LLM in Economics: An Existentialism Perspective

Gender-specific Call of Duty: A Note on the Neglect of Conscription in Gender Equality Indices

Neural Machine Translation Doesn't Translate Gender Coreference Right Unless You Make It

An Empirical Study of Gendered Stereotypes in Emotional Attributes for Bangla in Multilingual Large Language Models

Articulatory Configurations across Genders and Periods in French Radio and TV archives