共找到 20 条结果
We study the possibility of designing $N^{o(1)}$-round protocols for problems of substantially super-linear polynomial-time (sequential) complexity in the model of Massively Parallel Computation, where $N$ is the input size. We show that if the machines are not equipped with relatively large local memory and their number does not exceed $N$, then the exponent of the average time complexity of the local computation performed by a machine in a round (in terms of local memory size) in such protocols must be larger than the exponent of the time complexity of the given problem.
Magnetorheological fluids consist of micrometer-sized magnetic particles in a carrier liquid. Sufficiently strong external magnetic fields lead to the formation of string-like particle aggregates. We demonstrate that hydrodynamic interactions, that is, mutual couplings via induced flows, play a substantial role during the structuring process. They support the formation of slender chains instead of more compact clusters in the absence of mutual hydrodynamic interactions between the particles. This fundamental insight is substantial from an application perspective, due to the enormous technical importance and potential of structured magnetorheological materials.
Trotterization is one of the central approaches for simulating quantum many-body dynamics on quantum computers or tensor networks. In addition to its simple implementation, recent studies have revealed that its error and cost can be reduced if the initial state is closed in the low-energy subspace. However, the improvement by the low-energy property rapidly vanishes as the Trotter order grows in the previous studies, and thus, it is mysterious whether there exists genuine advantage of low-energy initial states. In this Letter, we resolve this problem by proving the optimal error bound and cost of Trotterization for low-energy initial states. For generic local Hamiltonians composed of positive-semidefinite terms, we show that the Trotter error is at most linear in the initial state energy $Δ$ and polylogarithmic in the system size $N$. As a result, the computational cost becomes substantially small for low-energy states with $Δ\in o(Ng)$ compared to the one for arbitrary initial states, where $g$ denotes the energy per site and $Ng$ means the whole-system energy. Our error bound and cost of Trotterization achieve the theoretically-best scaling in the initial state energy $Δ$. In add
We study the possibility of designing $N^{o(1)}$-round protocols for problems of substantially super-linear polynomial-time (sequential) complexity on the congested clique with about $N^{1/2}$ nodes, where $N$ is the input size. We show that the average time complexity of the local computation performed at a clique node (in terms of the size of the data received by the node) in such protocols has to be substantially larger than the time complexity of the given problem.
Medical image synthesis generates additional imaging modalities that are costly, invasive or harmful to acquire, which helps to facilitate the clinical workflow. When training pairs are substantially misaligned (e.g., lung MRI-CT pairs with respiratory motion), accurate image synthesis remains a critical challenge. Recent works explored the directional registration module to adjust misalignment in generative adversarial networks (GANs); however, substantial misalignment will lead to 1) suboptimal data mapping caused by correspondence ambiguity, and 2) degraded image fidelity caused by morphology influence on discriminators. To address the challenges, we propose a novel Deformation-aware GAN (DA-GAN) to dynamically correct the misalignment during the image synthesis based on multi-objective inverse consistency. Specifically, in the generative process, three levels of inverse consistency cohesively optimise symmetric registration and image generation for improved correspondence. In the adversarial process, to further improve image fidelity under misalignment, we design deformation-aware discriminators to disentangle the mismatched spatial morphology from the judgement of image fideli
Although language model (LM) agents have demonstrated increased performance in multiple domains, including coding and web-browsing, their success in cybersecurity has been limited. We present EnIGMA, an LM agent for autonomously solving Capture The Flag (CTF) challenges. We introduce new tools and interfaces to improve the agent's ability to find and exploit security vulnerabilities, focusing on interactive terminal programs. These novel Interactive Agent Tools enable LM agents, for the first time, to run interactive utilities, such as a debugger and a server connection tool, which are essential for solving these challenges. Empirical analysis on 390 CTF challenges across four benchmarks demonstrate that these new tools and interfaces substantially improve our agent's performance, achieving state-of-the-art results on NYU CTF, Intercode-CTF, and CyBench. Finally, we analyze data leakage, developing new methods to quantify it and identifying a new phenomenon we term soliloquizing, where the model self-generates hallucinated observations without interacting with the environment. Our code and development dataset are available at https://github.com/SWE-agent/SWE-agent/tree/v0.7 and htt
Dynamic processes on networks, be it information transfer in the Internet, contagious spreading in a social network, or neural signaling, take place along shortest or nearly shortest paths. Unfortunately, our maps of most large networks are substantially incomplete due to either the highly dynamic nature of networks, or high cost of network measurements, or both, rendering traditional path finding methods inefficient. We find that shortest paths in large real networks, such as the network of protein-protein interactions (PPI) and the Internet at the autonomous system (AS) level, are not random but are organized according to latent-geometric rules. If nodes of these networks are mapped to points in latent hyperbolic spaces, shortest paths in them align along geodesic curves connecting endpoint nodes. We find that this alignment is sufficiently strong to allow for the identification of shortest path nodes even in the case of substantially incomplete networks. We demonstrate the utility of latent-geometric path-finding in problems of cellular pathway reconstruction and communication security.
Most countries are suffering severely from the ongoing covid-19 pandemic despite various levels of preventive measures. A common question is if and when a country or region will reach herd immunity $h$. The classical herd immunity level $h_C$ is defined as $h_C=1-1/R_0$, where $R_0$ is the basic reproduction number, for covid-19 estimated to lie somewhere in the range 2.2-3.5 depending on country and region. It is shown here that the disease-induced herd immunity level $h_D$, after an outbreak has taken place in a country/region with a set of preventive measures put in place, is actually substantially smaller than $h_C$. As an illustration we show that if $R_0=2.5$ in an age-structured community with mixing rates fitted to social activity studies, and also categorizing individuals into three categories: low active, average active and high active, and where preventive measures affect all mixing rates proportionally, then the disease-induced herd immunity level is $h_D=43\%$ rather than $h_C=1-1/2.5=60\%$. Consequently, a lower fraction infected is required for herd immunity to appear. The underlying reason is that when immunity is induced by disease spreading, the proportion infecte
Understanding the loss surface of a neural network is fundamentally important to the understanding of deep learning. This paper presents how piecewise linear activation functions substantially shape the loss surfaces of neural networks. We first prove that {\it the loss surfaces of many neural networks have infinite spurious local minima} which are defined as the local minima with higher empirical risks than the global minima. Our result demonstrates that the networks with piecewise linear activations possess substantial differences to the well-studied linear neural networks. This result holds for any neural network with arbitrary depth and arbitrary piecewise linear activation functions (excluding linear functions) under most loss functions in practice. Essentially, the underlying assumptions are consistent with most practical circumstances where the output layer is narrower than any hidden layer. In addition, the loss surface of a neural network with piecewise linear activations is partitioned into multiple smooth and multilinear cells by nondifferentiable boundaries. The constructed spurious local minima are concentrated in one cell as a valley: they are connected with each othe
This work explores a distributed computing setting where $K$ nodes are assigned fractions (subtasks) of a computational task in order to perform the computation in parallel. In this setting, a well-known main bottleneck has been the inter-node communication cost required to parallelize the task, because unlike the computational cost which could keep decreasing as $K$ increases, the communication cost remains approximately constant, thus bounding the total speedup gains associated to having more computing nodes. This bottleneck was substantially ameliorated by the recent introduction of coded MapReduce techniques which allowed each node --- at the computational cost of having to preprocess approximately $t$ times more subtasks --- to reduce its communication cost by approximately $t$ times. In reality though, the associated speed up gains were severely limited by the requirement that larger $t$ and $K$ necessitated that the original task be divided into an extremely large number of subtasks. In this work we show how node cooperation, along with a novel assignment of tasks, can help to dramatically ameliorate this limitation. The result applies to wired as well as wireless distribute
Deep neural networks produce state-of-the-art results when trained on a large number of labeled examples but tend to overfit when small amounts of labeled examples are used for training. Creating a large number of labeled examples requires considerable resources, time, and effort. If labeling new data is not feasible, so-called semi-supervised learning can achieve better generalisation than purely supervised learning by employing unlabeled instances as well as labeled ones. The work presented in this paper is motivated by the observation that transfer learning provides the opportunity to potentially further improve performance by exploiting models pretrained on a similar domain. More specifically, we explore the use of transfer learning when performing semi-supervised learning using self-learning. The main contribution is an empirical evaluation of transfer learning using different combinations of similarity metric learning methods and label propagation algorithms in semi-supervised learning. We find that transfer learning always substantially improves the model's accuracy when few labeled examples are available, regardless of the type of loss used for training the neural network.
R-parity stands as an ad hoc assumption in the most popular version of the supersymmetric standard model. More than fifteen years' studies of R-parity violations have been restricted to various limiting scenarios. We illustrate how the single-VEV parametrization provides a workable framework to analyze the phenomenology of the complete theory of supersymmetry without R-parity. In our comprehensive study of various aspects of the resulting leptonic phenomenology at tree-level, we find that the physical $τ$ lepton could actually bear substantial gaugino and higgsino components, making it very different from the $e$ and the $μ$.
In a recent breakthrough, Kalmynin resolved conjectures of Lev--Sonn and Sárközy on additive decompositions of multiplicative subgroups of prime fields. In this paper, inspired by a related conjecture of Sárközy, we prove multiplicative analogues of Kalmynin's results. We show that for every proper multiplicative subgroup $G$, the shifted set $(G-1)\setminus\{0\}$ cannot be written as a product set nontrivially, addressing a conjecture of Sárközy. In addition, we prove that no nonzero shift of any coset of a proper multiplicative subgroup is a ratio set of the form $A/A$. Our results substantially sharpen previous theorems of Shkredov and the authors.
The scientific peer-review process is facing a shortage of human resources due to the rapid growth in the number of submitted papers. The use of language models to reduce the human cost of peer review has been actively explored as a potential solution to this challenge. A method has been proposed to evaluate the level of substantiation in scientific reviews in a manner that is interpretable by humans. This method extracts the core components of an argument, claims and evidence, and assesses the level of substantiation based on the proportion of claims supported by evidence. The level of substantiation refers to the extent to which claims are based on objective facts. However, when assessing the level of substantiation, simply detecting the presence or absence of supporting evidence for a claim is insufficient; it is also necessary to accurately assess the logical inference between a claim and its evidence. We propose a new evaluation metric for scientific review comments that assesses the logical inference between claims and evidence. Experimental results show that the proposed method achieves a higher correlation with human scores than conventional methods, indicating its potentia
Long-term environmental monitoring requires the ability to reconstruct and align 3D models across repeated site visits separated by months or years. However, existing Structure-from-Motion (SfM) pipelines implicitly assume near-simultaneous image capture and limited appearance change, and therefore fail when applied to long-term monitoring scenarios such as coral reef surveys, where substantial visual and structural change is common. In this paper, we show that the primary limitation of current approaches lies in their reliance on post-hoc alignment of independently reconstructed sessions, which is insufficient under large temporal appearance change. We address this limitation by enforcing cross-session correspondences directly within a joint SfM reconstruction. Our approach combines complementary handcrafted and learned visual features to robustly establish correspondences across large temporal gaps, enabling the reconstruction of a single coherent 3D model from imagery captured years apart, where standard independent and joint SfM pipelines break down. We evaluate our method on long-term coral reef datasets exhibiting significant real-world change, and demonstrate consistent join
Minkowski tensors, also known as tensor valuations, provide robust $n$-point information for a wide range of random spatial structures. Local estimators for point clouds, e.g., representing voxelized data, however, are unavoidably biased even in the limit of infinitely high resolution. Here, we substantially improve a recently proposed, asymptotically unbiased algorithm to estimate Minkowski tensors from point clouds. Our improved algorithm is more robust and efficient. Moreover we generalize the theoretical foundations for an asymptotically bias-free estimation of the interfacial tensors, among others, to the case of finite unions of compact sets with positive reach, which is relevant for many applications like rough surfaces or composite materials. As a realistic test case of random spatial structures, we consider random (beta) polytopes. We first derive explicit expressions of the expected Minkowski tensors, which we then compare to our simulation results. We obtain precise estimates with relative errors of a few percent for practically relevant resolutions. Finally, we apply our methods to real data of metallic grains and nanorough surfaces, and we provide an open-source python
Instructional videos are the dominant medium for learning physical tasks, yet they rarely match the user's real-world visual context. Motor simulation and cognitive load theories predict this mismatch should matter, but we do not know (1) how much it could affect task completion, (2) which visual attributes are responsible, and (3) how users experience it. We conduct two complementary studies (56 participants, 86+ hours, four first-aid and culinary tasks) in which we use Wizard-of-Oz recordings to control the degree of visual alignment in instructional videos. In Study 1 (N=16), we prepare In-Context instructional videos (ICON) -- fully aligned with the user's visual perception -- to compare against business-as-usual Internet videos. ICON yields statistically significant improvements: 11.1% higher completion quality and 15.5% faster completion. Qualitative analysis reveals four visual context attributes responsible for the effect: Task Object Intrinsics, Task Object State, Environmental Context, and Observational Context. Study 2 (N=40) ablates each attribute by systematically misaligning one at a time from an otherwise fully aligned video, confirming all four produce consistent de
We argue that rotation of a thin superconducting cylinder can increase the critical superconducting temperature substantially. A purely rotational effect originates from the tendency of a steadily rotating mechanical system to maximize its moment of inertia. A condensation of Cooper pairs in a rotating cylinder decouples a part of the normal electron fraction from rotation, thus producing a circulating electric current of an uncompensated electric charge of lattice ions. The current produces the magnetic field that stores energy of rotation, thus increasing the moment of inertia. In the presence of an external magnetic field, another enhancement effect originates from the interaction energy of the dipole magnetic moment of the normal component with the background magnetic field. In both cases, rotation of the cylindrical shell promotes the formation of condensate that decouples from mechanical rotation. We give quantitative estimates for a thin cylinder of aluminum.
Human induced changes in atmospheric aerosols have introduced a climate forcing by modifying cloud droplet number concentration, liquid water, and cloud fraction. This forcing is subject to large uncertainties as cloud adjustments have not only complex dependence on background conditions, but also temporal fluctuations, especially those driven by diel variations in solar heating. However, direct observations of such diel changes are still limited. Here, we present observational evidence of substantial diel changes in the cloud adjustments to aerosols within ship tracks, linear lines of polluted clouds captured in satellite images. We developed a novel method to automatically determine the age of each ship-track segment and analyze cloud adjustments to aerosols. We show that more aged polluted clouds with extended nighttime exposure exhibit higher increases in cloud fraction. By contrast, liquid water path adjustments follow a non-monotonic pattern: they generally decrease with time before reversing trend in clouds formed at nighttime. Most of these diel contrasts are statistically significant and likely stem from differences in solar heating and cloud precipitation. The increase in
We investigate the transition rates of a centripetally accelerated atom inside a high-quality cavity and show that they can be extensively tuned by adjusting the cavity resonance and the rotation frequency. Crucially, while inertial atoms cannot be excited in vacuum, rotation induces spontaneous excitation via the circular Unruh effect, with the cavity serving only as an amplifier. Using experimentally feasible parameters, we demonstrate that, in one scenario, the excitation rate can reach $\sim 10^7~\mathrm{s}^{-1}$ while emission remains negligible, enabling substantial population inversion. In another scenario, both excitation and emission can simultaneously attain $\sim 10^7~\mathrm{s}^{-1}$, corresponding to millions of transitions per second for a single atom. These findings highlight a powerful method for manipulating atomic transition rates for quantum applications and open a promising route toward experimental verification of the circular Unruh effect with state-of-the-art quantum technologies.