搜索 — ResearchTracker

Flaky tests can make automated software testing unreliable due to their unpredictable behavior. These tests can pass or fail on the same code base on multiple runs. However, flaky tests often do not refer to any fault, even though they can cause the continuous integration (CI) pipeline to fail. A common type of flaky test is the order-dependent (OD) test. The outcome of an OD test depends on the order in which it is run with respect to other test cases. Several studies have explored the detection and repair of OD tests. However, their methods require re-runs of tests multiple times, that are not related to the order dependence. Hence, prioritizing potential OD tests is necessary to reduce the re-runs. In this paper, we propose a method to prioritize potential order-dependent tests. By analyzing shared static fields in test classes, we identify tests that are more likely to be order-dependent. In our experiment on 27 project modules, our method successfully prioritized all OD tests in 23 cases, reducing test executions by an average of 65.92% and unnecessary re-runs by 72.19%. These results demonstrate that our approach significantly improves the efficiency of OD test detection by l

Flaky Tests in a Large Industrial Database Management System: An Empirical Study of Fixed Issue Reports for SAP HANA

arXiv2026-02-03作者：Alexander Berndt, Thomas Bach, Sebastian Baltes

Flaky tests yield different results when executed multiple times for the same version of the source code. Thus, they provide an ambiguous signal about the quality of the code and interfere with the automated assessment of code changes. While a variety of factors can cause test flakiness, approaches to fix flaky tests are typically tailored to address specific causes. However, the prevalent root causes of flaky tests can vary depending on the programming language, application domain, or size of the software project. Since manually labeling flaky tests is time-consuming and tedious, this work proposes an LLMs-as-annotators approach that leverages intra- and inter-model consistency to label issue reports related to fixed flakiness issues with the relevant root cause category. This allows us to gain an overview of prevalent flakiness categories in the issue reports. We evaluated our labeling approach in the context of SAP HANA, a large industrial database management system. Our results suggest that SAP HANA's tests most commonly suffer from issues related to concurrency (23%, 130 of 559 analyzed issue reports). Moreover, our results suggest that different test types face different flak

搜索结果：Tests

Reduction of Test Re-runs by Prioritizing Potential Order Dependent Flaky Tests

Flaky Tests in a Large Industrial Database Management System: An Empirical Study of Fixed Issue Reports for SAP HANA

Improved Tests for Mediation

Completeness of Finitely Weighted Kleene Algebra With Tests

Tests of Classical Gravity with Radio Pulsars

A Beginner's Guide to Black Hole Imaging and Associated Tests of General Relativity

Tests of Three-Flavor Chiral Perturbation Theory

Alignment tests for low CMB multipoles

Circularly Symmetric Tests of Goodness-of-Fit

Embedding Kozen-Tiuryn Logic into Residuated One-Sorted Kleene Algebra with Tests

Probing Gravity -- Fundamental Aspects of Metric Theories and their Implications for Tests of General Relativity

KitBit: A New AI Model for Solving Intelligence Tests and Numerical Series

RMST-based multiple contrast tests in general factorial designs

Significance tests for comparing digital gene expression profiles

Solar-system tests of the relativistic gravity

Role of Statistical tests in Estimation of the Security of a New Encryption Algorithm

Testing Gravity in the Laboratory

TAM-Eval: Evaluating LLMs for Automated Unit Test Maintenance

Testing Gravity with Binary Black Hole Gravitational Waves

Testing General Relativity with Gravitational Waves: An Overview