搜索 — ResearchTracker

Room impulse responses (RIRs) are essential for many acoustic signal processing tasks, yet measuring them densely across space is often impractical. In this work, we propose RIR-Former, a grid-free, one-step feed-forward model for RIR reconstruction. By introducing a sinusoidal encoding module into a transformer backbone, our method effectively incorporates microphone position information, enabling interpolation at arbitrary array locations. Furthermore, a segmented multi-branch decoder is designed to separately handle early reflections and late reverberation, improving reconstruction across the entire RIR. Experiments on diverse simulated acoustic environments demonstrate that RIR-Former consistently outperforms state-of-the-art baselines in terms of normalized mean square error (NMSE) and cosine distance (CD), under varying missing rates and array configurations. These results highlight the potential of our approach for practical deployment and motivate future work on scaling from randomly spaced linear arrays to complex array geometries, dynamic acoustic scenes, and real-world environments.

RTA-Former: Reverse Transformer Attention for Polyp Segmentation

arXiv2024-01-22作者：Zhikai Li, Murong Yi, Ali Uneri

Polyp segmentation is a key aspect of colorectal cancer prevention, enabling early detection and guiding subsequent treatments. Intelligent diagnostic tools, including deep learning solutions, are widely explored to streamline and potentially automate this process. However, even with many powerful network architectures, there still comes the problem of producing accurate edge segmentation. In this paper, we introduce a novel network, namely RTA-Former, that employs a transformer model as the encoder backbone and innovatively adapts Reverse Attention (RA) with a transformer stage in the decoder for enhanced edge segmentation. The results of the experiments illustrate that RTA-Former achieves state-of-the-art (SOTA) performance in five polyp segmentation datasets. The strong capability of RTA-Former holds promise in improving the accuracy of Transformer-based polyp segmentation, potentially leading to better clinical decisions and patient outcomes. Our code is publicly available on GitHub.

搜索结果：former

RIR-Former: Coordinate-Guided Transformer for Continuous Reconstruction of Room Impulse Responses

RTA-Former: Reverse Transformer Attention for Polyp Segmentation

SFi-Former: Sparse Flow Induced Attention for Graph Transformer

Proto-Former: Unified Facial Landmark Detection by Prototype Transformer

SLAM-Former: Putting SLAM into One Transformer

Role of Fragility of the Glass Formers in the Yielding Transition under Oscillatory Shear

Mobile-Former: Bridging MobileNet and Transformer

HOIST-Former: Hand-held Objects Identification, Segmentation, and Tracking in the Wild

Accuracy Improvement of Cell Image Segmentation Using Feedback Former

AccidentBlip: Agent of Accident Warning based on MA-former

In-Context Former: Lightning-fast Compressing Context for Large Language Model

Ethnic Conflicts, Civil War and Economic Growth: Region-Level Evidence from former Yugoslavia

X-Former: In-Memory Acceleration of Transformers

MoCHA-former: Moiré-Conditioned Hybrid Adaptive Transformer for Video Demoiréing

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

SpA-Former: Transformer image shadow detection and removal via spatial attention

NAR-Former V2: Rethinking Transformer for Universal Neural Network Representation Learning

$\infty$-former: Infinite Memory Transformer

A connection between the structural alpha-relaxation and the beta-relaxation found in bulk metallic glass-formers

Jamming, relaxation, and memory in a structureless glass former