共找到 20 条结果
Existing Natural Language Processing (NLP) resources often lack the task-specific information required for real-world problems and provide limited coverage of lesser-known or newly introduced entities. For example, business organizations and health care providers may need to be classified into a variety of different taxonomic schemes for specific application tasks. Our goal is to enable domain experts to easily create a task-specific classifier for entities by providing only entity names and gold labels as training data. Our framework then dynamically acquires descriptive text about each entity, which is subsequently used as the basis for producing a text-based classifier. We propose a novel text acquisition method that leverages both web and large language models (LLMs). We evaluate our proposed framework on two classification problems in distinct domains: (i) classifying organizations into Standard Industrial Classification (SIC) Codes, which categorize organizations based on their business activities; and (ii) classifying healthcare providers into healthcare provider taxonomy codes, which represent a provider's medical specialty and area of practice. Our best-performing model ac
Rhetorical figures play a major role in our everyday communication as they make text more interesting, more memorable, or more persuasive. Therefore, it is important to computationally detect rhetorical figures to fully understand the meaning of a text. We provide a comprehensive overview of computational approaches to lesser-known rhetorical figures. We explore the linguistic and computational perspectives on rhetorical figures, emphasizing their significance for the domain of Natural Language Processing. We present different figures in detail, delving into datasets, definitions, rhetorical functions, and detection approaches. We identified challenges such as dataset scarcity, language limitations, and reliance on rule-based methods.
In many common situations, a Bayesian credible interval will be, given the same data, very similar to a frequentist confidence interval, and researchers will interpret these intervals in a similar fashion. However, no predictable similarity exists when credible intervals are based on model-averaged posteriors whenever one of the two nested models under consideration is a so called ''point-null''. Not only can this model-averaged credible interval be quite different than the frequentist confidence interval, in some cases it may be undefined. This is a lesser-known correlate of the Jeffreys-Lindley paradox and is of particular interest given the popularity of the Bayes factor for testing point-null hypotheses.
Baseilhac-Benedetti, following ideas of Kashaev, introduced invariants of pseudo-Anosov homeomorphisms of punctured hyperbolic surfaces that depend on a complex root of unity of odd order. Around the same time, Bonahon-Liu introduced another set of invariants of pseudo-Anosov homeomorphisms at roots of unity. A little later, Dimofte and the first author introduced invariants of cusped hyperbolic 3-manifolds at roots of unity using their geometric representation. In another effort, Bonahon-Wong-Yang introduced another set of invariants of pseudo-Anosov homeomorphisms at roots of unity. All these invariants are conjecturally closely related, and our aim is to prove a precise relation between the Baseilhac-Benedetti invariants, the Bonahon-Liu-Wong-Yang and the lesser-known abelian $\mathfrak{gl}_1$-invariants.
We propose a data-driven and context-aware approach to bootstrap trustworthiness of homogeneous Internet of Things (IoT) services in Mobile Edge Computing (MEC) based industrial IoT (IIoT) systems. The proposed approach addresses key limitations in adapting existing trust bootstrapping approaches into MEC-based IIoT systems. These key limitations include, the lack of opportunity for a service consumer to interact with a lesser-known service over a prolonged period of time to get a robust measure of its trustworthiness, inability of service consumers to consistently interact with their peers to receive reliable recommendations of the trustworthiness of a lesser-known service as well as the impact of uneven context parameters in different MEC environments causing uneven trust environments for trust evaluation. In addition, the proposed approach also tackles the problem of data sparsity via enabling knowledge sharing among different MEC environments within a given MEC topology. To verify the effectiveness of the proposed approach, we carried out a comprehensive evaluation on two real-world datasets suitably adjusted to exhibit the context-dependent trust information accumulated in MEC
In this work, we study the impact of QA fine-tuning data on downstream factuality. We show that fine-tuning on lesser-known facts that are poorly stored during pretraining yields significantly worse factuality than fine-tuning on well-known facts, even when all facts are seen during pretraining. We prove this phenomenon theoretically, showing that training on lesser-known facts can lead the model to ignore subject entity names and instead output a generic plausible response even when the relevant factual knowledge is encoded in the model. On three question answering benchmarks (PopQA, Entity Questions, and MMLU) and two language models (Llama-2-7B and Mistral-7B), we find that (i) finetuning on a completely factual but lesser-known subset of the data deteriorates downstream factuality (5-10%) and (ii) finetuning on a subset of better-known examples matches or outperforms finetuning on the entire dataset. Ultimately, our results shed light on the interaction between pretrained knowledge and finetuning data and demonstrate the importance of taking into account how facts are stored in the pretrained model when fine-tuning for knowledge-intensive tasks.
Wikipedia is an invaluable resource for factual information about a wide range of entities. However, the quality of articles on less-known entities often lags behind that of the well-known ones. This study proposes a novel approach to enhancing Wikipedia's B and C category biography articles by leveraging personal narratives such as autobiographies and biographies. By utilizing a multi-staged retrieval-augmented generation technique -- REVerSum -- we aim to enrich the informational content of these lesser-known articles. Our study reveals that personal narratives can significantly improve the quality of Wikipedia articles, providing a rich source of reliable information that has been underutilized in previous studies. Based on crowd-based evaluation, REVerSum generated content outperforms the best performing baseline by 17% in terms of integrability to the original Wikipedia article and 28.5\% in terms of informativeness. Code and Data are available at: https://github.com/sayantan11995/wikipedia_enrichment
This article offers a gentle introduction to the axiom of choice. We introduce the axiom, discuss some common objections to it, and present three kinds of reasons to accept it. Although the exposition is aimed at non-experts in set theory, we also include some lesser-known results.
We focus on a sequence of functions $\{f_n\}$, defined on a compact manifold with boundary $S$, converging in the $C^k$ metric to a limit $f$. A common assumption implicitly made in the empirical sciences is that when such functions represent random processes derived from data, the topological features of $f_n$ will eventually resemble those of $f$. In this work, we investigate the validity of this claim under various regularity assumptions, with the goal of finding conditions sufficient for the number of local maxima, minima and saddle of such functions to converge. In the $C^1$ setting, we do so by employing lesser-known variants of the Poincaré-Hopf and mountain pass theorems, and in the $C^2$ setting we pursue an approach inspired by the homotopy-based proof of the Morse Lemma. To aid practical use, we end by reformulating our central theorems in the language of the empirical processes.
We provide an analytical framework for describing the propagation of light in waveguide arrays, considering both infinite and semi-infinite cases. The interaction up to second neighbors is taken into account, which makes for a more realistic setup. We show that these solutions follow a distinctive structural pattern. This pattern reflects a transition from conventional Bessel functions to the lesser-known one-parameter generalized Bessel functions, offering new insights into the propagation dynamics in these systems.
Robust flavor-polarized phases are a striking hallmark of many flat-band moiré materials. In this work, we trace the origin of this spontaneous polarization to a lesser-known quantum-geometric quantity: the quantum-geometric dipole. Analogous to how the quantum metric governs the spatial spread of wavepackets, we show that the quantum-geometric dipole sets the characteristic size of particle-hole excitations, e.g. magnons in a ferromagnet, which in turn boosts their gap and stiffness. Indeed, the larger the particle-hole separation, the weaker the mutual attraction, and the stronger the excitation energy. In topological bands, this energy enhancement admits a lower bound within the local-mode approximation, highlighting the crucial role of topology in flat-band ferromagnetism. We illustrate these effects in microscopic models, emphasizing their generality and relevance to moiré materials. Our results establish the quantum-geometric dipole as a predictive geometric indicator for ferromagnetism in flat bands, a crucial prerequisite for topological order.
We compare several different notions of filtered derived commutative ring, discussing HKR-filtered Hochschild homology, Hodge-filtered de Rham cohomology, and the lesser-known Hodge-filtered infinitesimal cohomology. Our main result is that de Rham cohomology is the crystallization of infinitesimal cohomology.
Ronald Wilfrid Gurney is one of the lesser-known research students of the Cavendish Laboratory in the mid 1920s. Gurney made significant contributions to the application of quantum mechanics to problems related to tunneling of alpha-particles from nuclei, to formation of images in photographic plates, the understanding of the origin of color-centres in salt crystals, and in the theory of semiconductors. He was the first physicist to apply quantum mechanics to the theory of electrochemistry and ionic solutions. He also made fundamental contributions to ballistics research. Gurney wrote a number of textbooks on fundamental and applied quantum mechanics in a distinctive style which are still useful as educational resources. In addition to his scientific contributions, he travelled extensively, and during and after World War II worked in the United States. During the cold war, he got entangled in the Klaus Fuchs affair and lost his employment. He died at the age of 54 in 1953 from a stroke. With the approach of the 100th year anniversary of quantum mechanics, it is timely to commemorate the life and contributions of this somewhat forgotten physicist.
The Mulhouse mathematician Jean-Henri (or Johann Heinrich) Lambert (August 26 or 28, 1728; September 25, 1777) is well known for having devised the conformal conic projection in 1772, which is still used in some graphical outputs of our weather forecasting models, under the name "Lambert projection" Less well known is that he also devised the idea of a minimum temperature value corresponding to $-270$°C in 1777, the year of his death and therefore 70 years before Lord Kelvin. But Johann Heinrich Lambert also published in 1771 an even lesser-known article, which is the subject of this publication. This article, written in Old French and published in a German journal, described in a rather striking manner the idea of a global observation network where the Earth would be divided into different zones where observers would record the same wind, temperature, pressure, and other current weather data at the same fixed times. The goal would then be to pool all this data to seek to understand how meteorological phenomena evolve in space and time, and to make meteorology a science on a par with astronomy.
The single harmonic oscillator and double-well potentials are important systems in quantum mechanics. The single harmonic oscillator is {\it the} paradigm in physics, and is taught in nearly all beginner undergraduate classes, while the double-well potential illustrates the two important principles of quantum tunnelling and linear superposition. While exact analytical solutions of the Schrodinger equation exist for both of these potentials, they are also employed to benchmark the use of approximate techniques which may be the only recourse for more complicated potentials. In this paper, we review the Wentzel-Kramers-Brillouin (WKB) approximation for both these potentials. While this approximation is known for its accurate energies, we will instead emphasize how poor the WKB wave functions are. The inaccuracy of the WKB wave functions will then motivate us to adopt the lesser-known Modified Airy Function (MAF) approximation, which alleviates the deficiencies of the WKB wave functions. We will review the MAF solution to the simple harmonic oscillator potential, and then apply the MAF to the double-well potential. We find accurate eigenvalues and, more importantly, very accurate wave
In network theory, a triad census is a method designed to categorize and enumerate the various types of subgraphs with three nodes and their connecting edges within a network. Triads serve as fundamental building blocks for comprehending the structure and dynamics of networks, and the triad census offers a systematic approach to their classification. Typically, triad counts are obtained numerically, but lesser-known methods have been developed to precisely evaluate them without the need for sampling. In our study, we build upon Moody's matrix approach, presenting general diagrammatic rules that systematically and intuitively generate closed formulas for the occurrence numbers of triads in a network.
The lattice problem for models of Peano Arithmetic ($\mathsf{PA}$) is to determine which lattices can be represented as lattices of elementary submodels of a model of $\mathsf{PA}$, or, in greater generality, for a given model $\mathcal{M}$, which lattices can be represented as interstructure lattices of elementary submodels $\mathcal{K}$ of an elementary extension $\mathcal{N}$ such that $\mathcal{M} \preccurlyeq \mathcal{K} \preccurlyeq \mathcal{N}$. The problem has been studied for the last 60 years and the results and their proofs show an interesting interplay between the model theory of PA, Ramsey style combinatorics, lattice representation theory, and elementary number theory. We present a survey of the most important results together with a detailed analysis of some special cases to explain and motivate a technique developed by James Schmerl for constructing elementary extensions with prescribed interstructure lattices. The last section is devoted to a discussion of lesser-known results about lattices of elementary submodels of countable recursively saturated models of PA.
Heretofore, the Serret-Frenet frame has been the ubiquitous choice for analyzing the elastic deformations of beam elements. It is well known that this frame is undefined at the inflection points and straight segments of the beam where its curvature is zero, leading to singularities and errors in their numerical analysis. On the other hand, there exists a lesser-known frame called Bishop which does not have the caveats of the Serret-Frenet frame and is well-defined everywhere along the beam centerline. Leveraging the Bishop frame, in this paper, we propose a new spatial, singularity-free low-order beam element based on the absolute nodal coordinate formulation for both small and large deformation applications. This element, named ANCF14, has a constant mass matrix and can capture longitudinal, transverse (bending) and torsional deformations. It is a two-noded element with 7 degrees of freedom per node, which are global nodal coordinates, nodal slopes and their cross-sectional rotation about the centerline. The newly developed element is tested through four complex benchmarks. Comparing the ANCF14 results with theoretical and numerical results provided in other studies confirms the e
This paper explores the effectiveness of using large language models (LLMs) for personalized movie recommendations from users' perspectives in an online field experiment. Our study involves a combination of between-subject prompt and historic consumption assessments, along with within-subject recommendation scenario evaluations. By examining conversation and survey response data from 160 active users, we find that LLMs offer strong recommendation explainability but lack overall personalization, diversity, and user trust. Our results also indicate that different personalized prompting techniques do not significantly affect user-perceived recommendation quality, but the number of movies a user has watched plays a more significant role. Furthermore, LLMs show a greater ability to recommend lesser-known or niche movies. Through qualitative analysis, we identify key conversational patterns linked to positive and negative user interaction experiences and conclude that providing personal context and examples is crucial for obtaining high-quality recommendations from LLMs.
As is well known, Paul Drude put forward the very first quantitative theory of electrical conduction in metals in 1900. He could successfully account for the Wiedemann-Franz law which states that the ratio of thermal to electrical conductivity divided by temperature is a constant called the Lorenz number. As it turns out, in Drude's derivation, there is a lucky cancellation of two errors. Drude's under-estimate (by an order of 100) of the value of square of the average electron velocity compensated his over-estimate of the electronic heat capacity (by the same order of 100). This compensation or cancellation of two errors lead to a value of the Lorenz number very close to its experimental value. This is well known. There is another error of a factor of two which Drude made when he calculated two different relaxation times for heat conductivity and electrical conductivity. In this article we highlight how and why this error occurred in Drude's derivation and how it was removed 5 years later (that is in 1905) by Hendrik Lorentz when he used the Boltzmann equation and a single relaxation time. This article is of pedagogical value and may be useful to undergraduate/graduate students le