搜索结果：comparative

共找到 20 条结果

高级筛选 ▾

Comparative Separation: Evaluating Separation on Comparative Judgment Test Data

arXiv2026-01-11作者：Xiaoyin Xi, Neeku Capak, Kate Stockwell

This research seeks to benefit the software engineering society by proposing comparative separation, a novel group fairness notion to evaluate the fairness of machine learning software on comparative judgment test data. Fairness issues have attracted increasing attention since machine learning software is increasingly used for high-stakes and high-risk decisions. It is the responsibility of all software developers to make their software accountable by ensuring that the machine learning software do not perform differently on different sensitive groups -- satisfying the separation criterion. However, evaluation of separation requires ground truth labels for each test data point. This motivates our work on analyzing whether separation can be evaluated on comparative judgment test data. Instead of asking humans to provide the ratings or categorical labels on each test data point, comparative judgments are made between pairs of data points such as A is better than B. According to the law of comparative judgment, providing such comparative judgments yields a lower cognitive burden for humans than providing ratings or categorical labels. This work first defines the novel fairness notion c

搜索结果：comparative

Comparative Separation: Evaluating Separation on Comparative Judgment Test Data

Comparing Without Saying: A Dataset and Benchmark for Implicit Comparative Opinion Mining from Same-User Reviews

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge

Pre-training Language Models for Comparative Reasoning

Applied Explainability for Large Language Models: A Comparative Study

Efficient Story Point Estimation With Comparative Learning

Multidimensional Sorting: Comparative Statics

Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models

A Comparative Study of PyCaret AutoML and CNN-BiLSTM for Binary Hate Speech Detection in Indonesian Twitter

A Comparative Analysis of Machine Learning and Deep Learning Models for Tweet Sentiment Classification: A Case Study on the Sentiment140 Dataset

Comparative evaluation of future collider options

Finetuning LLMs for Comparative Assessment Tasks

Yield Curve Forecasting using Machine Learning and Econometrics: A Comparative Analysis

Comparative Analysis of AutoML and BiLSTM Models for Cyberbullying Detection on Indonesian Instagram Comments

GCRE-GPT: A Generative Model for Comparative Relation Extraction

Bringing Comparative Cognition To Computers

Robust Probabilistic Load Forecasting for a Single Household: A Comparative Study from SARIMA to Transformers on the REFIT Dataset

Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews

Estimation of the surface mechanical properties of soft tissues mimicking phantoms using impact analyses: a comparative study

Challenges of Heterogeneity in Big Data: A Comparative Study of Classification in Large-Scale Structured and Unstructured Domains

搜索结果：comparative

Comparative Separation: Evaluating Separation on Comparative Judgment Test Data

Comparing Without Saying: A Dataset and Benchmark for Implicit Comparative Opinion Mining from Same-User Reviews

NeuroComparatives: Neuro-Symbolic Distillation of Comparative Knowledge

Pre-training Language Models for Comparative Reasoning

Applied Explainability for Large Language Models: A Comparative Study

Efficient Story Point Estimation With Comparative Learning

Multidimensional Sorting: Comparative Statics

Six Llamas: Comparative Religious Ethics Through LoRA-Adapted Language Models

A Comparative Study of PyCaret AutoML and CNN-BiLSTM for Binary Hate Speech Detection in Indonesian Twitter

A Comparative Analysis of Machine Learning and Deep Learning Models for Tweet Sentiment Classification: A Case Study on the Sentiment140 Dataset

Comparative evaluation of future collider options

Finetuning LLMs for Comparative Assessment Tasks

Yield Curve Forecasting using Machine Learning and Econometrics: A Comparative Analysis

Comparative Analysis of AutoML and BiLSTM Models for Cyberbullying Detection on Indonesian Instagram Comments

GCRE-GPT: A Generative Model for Comparative Relation Extraction

Bringing Comparative Cognition To Computers

Robust Probabilistic Load Forecasting for a Single Household: A Comparative Study from SARIMA to Transformers on the REFIT Dataset

Comparing Apples to Apples: Generating Aspect-Aware Comparative Sentences from User Reviews

Estimation of the surface mechanical properties of soft tissues mimicking phantoms using impact analyses: a comparative study

Challenges of Heterogeneity in Big Data: A Comparative Study of Classification in Large-Scale Structured and Unstructured Domains