搜索 — ResearchTracker

Large language models (LLMs) can generate programs that pass unit tests, but passing tests does not guarantee reliable runtime behavior. We find that different correct solutions to the same task can show very different memory and performance patterns, which can lead to hidden operational risks. We present a framework to measure execution-time memory stability across multiple correct generations. At the solution level, we introduce Dynamic Mean Pairwise Distance (DMPD), which uses Dynamic Time Warping to compare the shapes of memory-usage traces after converting them into Monotonic Peak Profiles (MPPs) to reduce transient noise. Aggregating DMPD across tasks yields a model-level Model Instability Score (MIS). Experiments on BigOBench and CodeContests show substantial runtime divergence among correct solutions. Instability often increases with higher sampling temperature even when pass@1 improves. We also observe correlations between our stability measures and software engineering indicators such as cognitive and cyclomatic complexity, suggesting links between operational behavior and maintainability. Our results support stability-aware selection among passing candidates in CI/CD to

CURA: Size Isnt All You Need -- A Compact Universal Architecture for On-Device Intelligence

arXiv2025-09-29作者：Jae-Bum Seo, Muhammad Salman, Lismer Andres Caceres-Najarro

Existing on-device AI architectures for resource-constrained environments face two critical limitations: they lack compactness, with parameter requirements scaling proportionally to task complexity, and they exhibit poor generalizability, performing effectively only on specific application domains (e.g., models designed for regression tasks cannot adapt to natural language processing (NLP) applications). In this paper, we propose CURA, an architecture inspired by analog audio signal processing circuits that provides a compact and lightweight solution for diverse machine learning tasks across multiple domains. Our architecture offers three key advantages over existing approaches: (1) Compactness: it requires significantly fewer parameters regardless of task complexity; (2) Generalizability: it adapts seamlessly across regression, classification, complex NLP, and computer vision tasks; and (3) Complex pattern recognition: it can capture intricate data patterns while maintaining extremely low model complexity. We evaluated CURA across diverse datasets and domains. For compactness, it achieved equivalent accuracy using up to 2,500 times fewer parameters compared to baseline models. For

查看原文 ↗

Neural Encoding Detection is Not All You Need for Synthetic Speech Detection

arXiv2026-04-17作者：Luca Cuccovillo, Xin Wang, Milica Gerhardt

This paper reviews the current state and emerging trends in synthetic speech detection. It outlines the main data-driven approaches, discusses the advantages and drawbacks of focusing future research solely on neural encoding detection, and offers recommendations for promising research directions. Unlike works that introduce new detection methods or datasets, this paper aims to guide future state-of-the-art research in the field and to highlight the risk of overcommitting to approaches that may not stand the test of time.

查看原文 ↗

Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction

arXiv2025-01-13作者：Lukas Glaszner, Martin Zach

Diffusion model have been successfully applied to many inverse problems, including MRI and CT reconstruction. Researchers typically re-purpose models originally designed for unconditional sampling without modifications. Using two different posterior sampling algorithms, we show empirically that such large networks are not necessary. Our smallest model, effectively a ResNet, performs almost as good as an attention U-Net on in-distribution reconstruction, while being significantly more robust towards distribution shifts. Furthermore, we introduce models trained on natural images and demonstrate that they can be used in both MRI and CT reconstruction, out-performing model trained on medical images in out-of-distribution cases. As a result of our findings, we strongly caution against simply re-using very large networks and encourage researchers to adapt the model complexity to the respective task. Moreover, we argue that a key step towards a general diffusion-based prior is training on natural images.

查看原文 ↗

Goppa Codes: Key to High Efficiency and Reliability in Communications

arXiv2024-04-11作者：Behrooz Mosallaei, Farzaneh Ghanbari, Sepideh Farivar

In this paper, we study some codes of algebraic geometry related to certain maximal curves. Quantum stabilizer codes obtained through the self orthogonality of Hermitian codes of this error correcting do not always have good parameters. However, appropriate parameters found that the Hermitian self-orthogonal code quantum stabilizer code has good parameters. Therefore, we investigated the quantum stabilizer code at a certain maximum curve and modified its parameters. Algebraic geometry codes show promise for enabling high data rate transmission over noisy power line communication channels.

查看原文 ↗

Lensing Machines: Representing Perspective in Latent Variable Models

arXiv2022-01-20作者：Karthik Dinakar, Henry Lieberman

Many datasets represent a combination of different ways of looking at the same data that lead to different generalizations. For example, a corpus with examples generated by different people may be mixtures of many perspectives and can be viewed with different perspectives by others. It isnt always possible to represent the viewpoints by a clean separation, in advance, of examples representing each viewpoint and train a separate model for each viewpoint. We introduce lensing, a mixed initiative technique to extract lenses or mappings between machine learned representations and perspectives of human experts, and to generate lensed models that afford multiple perspectives of the same dataset. We apply lensing for two classes of latent variable models: a mixed membership model, a matrix factorization model in the context of two mental health applications, and we capture and imbue the perspectives of clinical psychologists into these models. Our work shows the benefits of the machine learning practitioner formally incorporating the perspective of a knowledgeable domain expert into their models rather than estimating unlensed models themselves in isolation.

查看原文 ↗

Correlation of velocity and density contributions to spectroscopic channel maps: Reality check on Kalberla et.al (2022)

arXiv2022-02-16作者：Ka Ho Yuen, Ka Wai Ho, Alex Lazarian

The existence of magnetized turbulence in the interstellar HI is well accepted. A number of techniques to obtain turbulence spectrum and magnetic field direction and strength have been developed and successfully applied to HI spectroscopic data. To better separate the imprints of density and velocity fluctuations to the channel maps, a new theory-based technique, the Velocity Decomposition Algorithm (VDA,Yuen et.al 2021), has been created. The technique demonstrates that the intensity fluctuations are separated into a component pv that mostly arises from velocity fluctuations and pd that mostly arise from density fluctuations. The VDA helps to clarify the nature of the filamentary structure observed in channel maps. A recent publication (Kalberla et.al 2022,K22) claims that the application of VDA to HI4PI data provides negative correlation of pv and pd,which according to the authors invalidates the technique since it requires that pv and pd have zero correlation. However, the quantities pv and pd given by VDA are naturally orthogonal which can be trivially checked analytically or numerically. That means the correct application of the VDA to any data must provide zero correlation. T

查看原文 ↗

On the Fly Self-Organized Base Station Placement

arXiv2013-02-18作者：Hirley Alves, Mehdi Bennis, Walid Saad

In this paper, we address the deployment of base stations (BSs) in a one-dimensional network in which the users are randomly distributed.In order to take into account the users' distribution to optimally place the BSs we optimize the uplink MMSE sum rate. Moreover, given a massive number of antennas at the BSs we propose a novel random matrix theory-based technique so as to obtain tight approximations for the MMSE sum rate in the uplink. We investigate a cooperative (CP) scenario where the BSs jointly decode the messages and a non-cooperative (NCP) scheme in which the BS can only decode its own users. Our results show that the CP strategy considerably outperforms the NCP case. Moreover, we show that there exists a trade off in the BS deployment regarding the position of each BS. Thus, through location games we can optimize the position of each BS in order to maximize the system performance.

查看原文 ↗

Reduction of Surgical Risk Through the Evaluation of Medical Imaging Diagnostics

arXiv2020-03-08作者：Marco A. V. M. Grinet, Nuno M. Garcia, Ana I. R. Gouveia

Computer aided diagnosis (CAD) of Breast Cancer (BRCA) images has been an active area of research in recent years. The main goals of this research is to develop reliable automatic methods for detecting and diagnosing different types of BRCA from diagnostic images. In this paper, we present a review of the state of the art CAD methods applied to magnetic resonance (MRI) and mammography images of BRCA patients. The review aims to provide an extensive introduction to different features extracted from BRCA images through texture and statistical analysis and to categorize deep learning frameworks and data structures capable of using metadata to aggregate relevant information to assist oncologists and radiologists. We divide the existing literature according to the imaging modality and into radiomics, machine learning, or combination of both. We also emphasize the difference between each modality and methods strengths and weaknesses and analyze their performance in detecting BRCA through a quantitative comparison. We compare the results of various approaches for implementing CAD systems for the detection of BRCA. Each approachs standard workflow components are reviewed and summary tables

查看原文 ↗

What goes on inside rumour and non-rumour tweets and their reactions: A Psycholinguistic Analyses

arXiv2021-11-09作者：Sabur Butt, Shakshi Sharma, Rajesh Sharma

In recent years, the problem of rumours on online social media (OSM) has attracted lots of attention. Researchers have started investigating from two main directions. First is the descriptive analysis of rumours and secondly, proposing techniques to detect (or classify) rumours. In the descriptive line of works, where researchers have tried to analyse rumours using NLP approaches, there isnt much emphasis on psycho-linguistics analyses of social media text. These kinds of analyses on rumour case studies are vital for drawing meaningful conclusions to mitigate misinformation. For our analysis, we explored the PHEME9 rumour dataset (consisting of 9 events), including source tweets (both rumour and non-rumour categories) and response tweets. We compared the rumour and nonrumour source tweets and then their corresponding reply (response) tweets to understand how they differ linguistically for every incident. Furthermore, we also evaluated if these features can be used for classifying rumour vs. non-rumour tweets through machine learning models. To this end, we employed various classical and ensemble-based approaches. To filter out the highly discriminative psycholinguistic features, we

查看原文 ↗

搜索结果：isnt

Correctness isnt Efficiency: Runtime Memory Divergence in LLM-Generated Code

CURA: Size Isnt All You Need -- A Compact Universal Architecture for On-Device Intelligence

Neural Encoding Detection is Not All You Need for Synthetic Speech Detection

Bigger Isn't Always Better: Towards a General Prior for Medical Image Reconstruction

Goppa Codes: Key to High Efficiency and Reliability in Communications

Lensing Machines: Representing Perspective in Latent Variable Models

Correlation of velocity and density contributions to spectroscopic channel maps: Reality check on Kalberla et.al (2022)

On the Fly Self-Organized Base Station Placement

Reduction of Surgical Risk Through the Evaluation of Medical Imaging Diagnostics

What goes on inside rumour and non-rumour tweets and their reactions: A Psycholinguistic Analyses