Recommender systems are usually designed by engineers, researchers, designers, and other members of development teams. These systems are then evaluated based on goals set by the aforementioned teams and other business units of the platforms operating the recommender systems. This design approach emphasizes the designers' vision for how the system can best serve the interests of users, providers, businesses, and other stakeholders. Although designers may be well-informed about user needs through user experience and market research, they are still the arbiters of the system's design and evaluation, with other stakeholders' interests less emphasized in user-centered design and evaluation. When extended to recommender systems for social good, this approach results in systems that reflect the social objectives as envisioned by the designers and evaluated as the designers understand them. Instead, social goals and operationalizations should be developed through participatory and democratic processes that are accountable to their stakeholders. We argue that recommender systems aimed at improving social good should be designed *by* and *with*, not just *for*, the people who will experience
Visual geometry transformers have become powerful architectures for multi-view 3D reconstruction, enabling joint prediction of multiple 3D attributes in a feed-forward manner. However, their computational cost grows quadratically with the input sequence length due to the global attention layers inside these models. This limits both their scalability and efficiency. In this work, we address this challenge with a simple yet general strategy: restricting the number of key/value tokens that each query interacts with during global attention. To achieve effective token selection, we introduce a two-stage framework. First, an inter-frame selection step operates at the frame level to identify frames that should be preserved. Second, an intra-frame selection step further discards more redundant tokens within the selected frames. Our analysis highlights the advantage of a diversity-based strategy for inter-frame selection, which ensures broad coverage of the scene. For intra-frame selection, we show that layer-aware sparsification is necessary, with the selection process guided by the entropy of the global attention pattern. Our approach offers a superior speed-accuracy trade-off compared to
This work examines the role of recommender systems in promoting sustainability, social responsibility, and accountability, with a focus on alignment with the United Nations Sustainable Development Goals (SDGs). As recommender systems become increasingly integrated into daily interactions, they must go beyond personalization to support responsible consumption, reduce environmental impact, and foster social good. We explore strategies to mitigate the carbon footprint of recommendation models, ensure fairness, and implement accountability mechanisms. By adopting these approaches, recommender systems can contribute to sustainable and socially beneficial outcomes, aligning technological advancements with the SDGs focused on environmental sustainability and social well-being.
Suppose $G$ is a simple algebraic group defined over an algebraically closed field of good characteristic $p$. In 2018 Korhonen showed that if $H$ is a connected reductive subgroup of $G$ which contains a distinguished unipotent element $u$ of $G$ of order $p$, then $H$ is $G$-irreducible in the sense of Serre. We present a short and uniform proof of this result under an extra hypothesis using so-called good $A_1$ subgroups of $G$, introduced by Seitz. In the process we prove some new results about good $A_1$ subgroups of $G$ and their properties. We also formulate a counterpart of Korhonen's theorem for overgroups of $u$ which are finite groups of Lie type. Moreover, we generalize both results above by removing the restriction on the order of $u$ under a mild condition on $p$ depending on the rank of $G$, and we present an analogue of Korhonen's theorem for Lie algebras.
Numerous pre-training techniques for visual document understanding (VDU) have recently shown substantial improvements in performance across a wide range of document tasks. However, these pre-trained VDU models cannot guarantee continued success when the distribution of test data differs from the distribution of training data. In this paper, to investigate how robust existing pre-trained VDU models are to various distribution shifts, we first develop an out-of-distribution (OOD) benchmark termed Do-GOOD for the fine-Grained analysis on Document image-related tasks specifically. The Do-GOOD benchmark defines the underlying mechanisms that result in different distribution shifts and contains 9 OOD datasets covering 3 VDU related tasks, e.g., document information extraction, classification and question answering. We then evaluate the robustness and perform a fine-grained analysis of 5 latest VDU pre-trained models and 2 typical OOD generalization algorithms on these OOD datasets. Results from the experiments demonstrate that there is a significant performance gap between the in-distribution (ID) and OOD settings for document images, and that fine-grained analysis of distribution shifts
We study the mechanism design problem of selling a public good to a group of agents by a principal in the correlated private value environment. We assume the principal only knows the expectations of the agents' values, but does not know the joint distribution of the values. The principal evaluates a mechanism by the worst-case expected revenue over joint distributions that are consistent with the known expectations. We characterize maxmin public good mechanisms among dominant-strategy incentive compatible and ex-post individually rational mechanisms for the two-agent case and for a special $N$-agent ($N>2$) case.
The works of Poincare, Birkhoff, Witt and Cartier, Milnor, Moore on the connected cocommutative Hopf algebras translated in the language of operads means that the triple of operads (Com, As, Lie) endowed with the Hopf compatiblity relation is good. In this paper, we focus on left dipterous (resp. right dipterous) algebras which are associative algebras with an extra left (resp. right) module on themselves and look for good triples were $As$ is replaced by the dipterous operad Dipt. Since the work of Loday and Ronco, the triple of operads (As, Dipt, B_\infty) endowed with the semi-Hopf compatibility relations is known to be good. In this paper, we prove that the triple of operads (As, Dipt, Grove) endowed with the so-called (nonunital) semi-infinitesimal compatibility relations is good. For that, explicit constructions of the free dipterous algebra and the free grove-algebra over a K-vector space V are given. These constructions turn out to be related to rooted planar trees and the little an large Schroeder numbers. Many examples of dipterous algebras are given, notably the free L-dipterous algebras. As a corollary of our results, we also recover that the triple of operads (2As, Dip
In this paper we study continuous-time stochastic control problems with both monotone and classical controls motivated by the so-called public good contribution problem. That is the problem of n economic agents aiming to maximize their expected utility allocating initial wealth over a given time period between private consumption and irreversible contributions to increase the level of some public good. We investigate the corresponding social planner problem and the case of strategic interaction between the agents, i.e. the public good contribution game. We show existence and uniqueness of the social planner's optimal policy, we characterize it by necessary and sufficient stochastic Kuhn-Tucker conditions and we provide its expression in terms of the unique optional solution of a stochastic backward equation. Similar stochastic first order conditions prove to be very useful for studying any Nash equilibria of the public good contribution game. In the symmetric case they allow us to prove (qualitative) uniqueness of the Nash equilibrium, which we again construct as the unique optional solution of a stochastic backward equation. We finally also provide a detailed analysis of the so-ca
We study classical and quantum LDPC codes of constant rate obtained by the lifted product construction over non-abelian groups. We show that the obtained families of quantum LDPC codes are asymptotically good, which proves the qLDPC conjecture. Moreover, we show that the produced classical LDPC codes are also asymptotically good and locally testable with constant query and soundness parameters, which proves a well-known conjecture in the field of locally testable codes.
The AI for social good movement has now reached a state in which a large number of one-off demonstrations have illustrated that partnerships of AI practitioners and social change organizations are possible and can address problems faced in sustainable development. In this paper, we discuss how moving from demonstrations to true impact on humanity will require a different course of action, namely open platforms containing foundational AI capabilities to support common needs of multiple organizations working in similar topical areas. We lend credence to this proposal by describing three example patterns of social good problems and their AI-based solutions: natural language processing for making sense of international development reports, causal inference for providing guidance to vulnerable individuals, and discrimination-aware classification for supporting unbiased allocation decisions. We argue that the development of such platforms will be possible through convenings of social change organizations, AI companies, and grantmaking foundations.
Pretty good state transfer in networks of qubits occurs when a continuous-time quantum walk allows the transmission of a qubit state from one node of the network to another, with fidelity arbitrarily close to 1. We prove that in a Heisenberg chain with n qubits there is pretty good state transfer between the nodes at the j-th and (n-j+1)-th position if n is a power of 2. Moreover, this condition is also necessary for j=1. We obtain this result by applying a theorem due to Kronecker about Diophantine approximations, together with techniques from algebraic graph theory.
Knowledge distillation (KD) is a general neural network training approach that uses a teacher model to guide the student model. Existing works mainly study KD from the network output side (e.g., trying to design a better KD loss function), while few have attempted to understand it from the input side. Especially, its interplay with data augmentation (DA) has not been well understood. In this paper, we ask: Why do some DA schemes (e.g., CutMix) inherently perform much better than others in KD? What makes a "good" DA in KD? Our investigation from a statistical perspective suggests that a good DA scheme should reduce the covariance of the teacher-student cross-entropy. A practical metric, the stddev of teacher's mean probability (T. stddev), is further presented and well justified empirically. Besides the theoretical understanding, we also introduce a new entropy-based data-mixing DA scheme, CutMixPick, to further enhance CutMix. Extensive empirical studies support our claims and demonstrate how we can harvest considerable performance gains simply by using a better DA scheme in knowledge distillation.
We modify the transchromatic character maps to land in a faithfully flat extension of Morava E-theory. Our construction makes use of the interaction between topological and algebraic localization and completion. As an application we prove that centralizers of tuples of commuting prime-power order elements in good groups are good and we compute a new example.
This paper targets to search so-called \emph{good} generators by doing a brief survey over the generators developed in the history of pseudo-random number generators (PRNGs), verify their claims and rank them based on strong empirical tests in same platforms. To do this, the genre of PRNGs developed so far are explored and classified into three groups -- linear congruential generator based, linear feedback shift register based and cellular automata based. From each group, the well-known widely used generators which claimed themselves to be `\emph{good}' are chosen. Overall $30$ PRNGs are selected in this way on which two types of empirical testing are done -- blind statistical tests with Diehard battery of tests, battery \emph{rabbit} of TestU01 library and NIST statistical test-suite as well as graphical tests (lattice test and space-time diagram test). Finally, the selected PRNGs are divided into $24$ groups and are ranked according to their overall performance in all empirical tests.
Data for good implies unfettered access to data. But data owners must be conservative about how, when, and why they share data or risk violating the trust of the people they aim to help, losing their funding, or breaking the law. Data sharing agreements can help prevent privacy violations, but require a level of specificity that is premature during preliminary discussions, and can take over a year to establish. We consider the generation and use of synthetic data to facilitate ad hoc collaborations involving sensitive data. A good synthetic dataset has two properties: it is representative of the original data, and it provides strong guarantees about privacy. In this paper, we discuss important use cases for synthetic data that challenge the state of the art in privacy-preserving data generation, and describe DataSynthesizer, a dataset generation tool that takes a sensitive dataset as input and generates a structurally and statistically similar synthetic dataset, with strong privacy guarantees, as output. The data owners need not release their data, while potential collaborators can begin developing models and methods with some confidence that their results will work similarly on th
It is shown that replacing the sinusoidal chip in Golay complementary code pairs by special classes of waveforms that satisfy two conditions, symmetry/anti-symmetry and quazi-orthogonality in the convolution sense, renders the complementary codes immune to frequency selective fading and also allows for concatenating them in time using one frequency band/channel. This results in a zero-sidelobe region around the mainlobe and an adjacent region of small cross-correlation sidelobes. The symmetry/anti-symmetry property results in the zero-sidelobe region on either side of the mainlobe, while quasi-orthogonality of the two chips keeps the adjacent region of cross-correlations small. Such codes are constructed using discrete frequency-coding waveforms (DFCW) based on linear frequency modulation (LFM) and piecewise LFM (PLFM) waveforms as chips for the complementary code pair, as they satisfy both the symmetry/anti-symmetry and quasi-orthogonality conditions. It is also shown that changing the slopes/chirp rates of the DFCW waveforms (based on LFM and PLFM waveforms) used as chips with the same complementary code pair results in good code sets with a zero-sidelobe region. It is also shown
We consider the continued fraction expansion of real numbers under the action of a non-uniform lattice in PSL(2,R) and prove metric relations between the convergents and a natural geometric notion of good approximations.
We prove that the "good" Boussinesq model with the periodic boundary condition is locally well-posed in the space $H^{s}\times H^{s-2}$ for $s > -3/8$. In the proof, we employ the normal form approach, which allows us to explicitly extract the rougher part of the solution. This also leads to the conclusion that the remainder is in a smoother space $C([0,T], H^{s+a}), where $0 <= a < \min (2s+1, 1/2)$. If we have a mean-zero initial data, this implies a smoothing effect of this order for the non-linearity. This is new even in the previously considered cases $s > -1/4$.
Ethics in the emerging world of data science are often discussed through cautionary tales about the dire consequences of missteps taken by high profile companies or organizations. We take a different approach by foregrounding the ways that ethics are implicated in the day-to-day work of data science, focusing on instances in which data scientists recognize, grapple with, and conscientiously respond to ethical challenges. This paper presents a case study of ethical dilemmas that arose in a "data science for social good" (DSSG) project focused on improving navigation for people with limited mobility. We describe how this particular DSSG team responded to those dilemmas, and how those responses gave rise to still more dilemmas. While the details of the case discussed here are unique, the ethical dilemmas they illuminate can commonly be found across many DSSG projects. These include: the risk of exacerbating disparities; the thorniness of algorithmic accountability; the evolving opportunities for mischief presented by new technologies; the subjective and value- laden interpretations at the heart of any data-intensive project; the potential for data to amplify or mute particular voices;
We present a high-statistics lattice-QCD determination of the kaon gluon parton distribution function and gluon momentum fraction. We use clover valence fermion action to take 1,296,640 kaon-correlator measurements on a HISQ ensemble with $a \approx 0.12$~fm and 310-MeV pion mass, generated by the MILC collaboration. A detailed investigation into the impact of gauge-link smearing on the gluonic matrix elements indicates that five steps of hypercubic smearing offer an effective balance between signal quality and preservation of long-distance physics. We report a nonperturbatively renormalized kaon gluon momentum fraction of $\langle x \rangle_g^{\overline{\text{MS}}, K} = 0.557(18)_\text{stat}(24)_\text{NPR}(56)_\text{mixing}$ at $μ= 2$ GeV in in the $\overline{\text{MS}}$ scheme. Using reduced pseudo-ITD matrix elements and pseudo-PDF matching, we extract the kaon gluon PDF and compare with the prediction from the Dyson-Schwinger equation and with the pion PDF obtained from the same ensemble.