搜索 — ResearchTracker

Stochastic bilevel optimization (SBO) has become a standard framework for hyperparameter learning, data reweighting, representation learning, and data-mixture optimization in deep learning. Existing exact single-loop SBO methods and memory-efficient surrogate SBO methods either create severe memory pressure for large lower-level neural networks or lack competitive convergence guarantees under standard assumptions. In this paper, we propose BROS, a memory-efficient single-loop SBO method with the same convergence rate order as exact single-loop SBO methods. BROS performs lower and auxiliary updates in randomized subspaces with a Rademacher bi-probe correction that recovers an unbiased Hessian-action estimator. We prove that BROS preserves the $\mathcal O(\varepsilon^{-2})$ sample complexity of MA-SOBA for finding an $\varepsilon$-stationary point under only standard assumptions. Experiments on hyper-data cleaning, data-mixture learning, hyper-representation learning, and ViT sample reweighting show that BROS reduces peak memory by up to 44.9% while closely matching full-space baseline performance.

You Can't Solve These Super Mario Bros. Levels: Undecidable Mario Games

arXiv2024-05-17作者：MIT Hardness Group, Hayashi Ani, Erik D. Demaine

We prove RE-completeness (and thus undecidability) of several 2D games in the Super Mario Bros. platform video game series: the New Super Mario Bros. series (original, Wii, U, and 2), and both Super Mario Maker games in all five game styles (Super Mario Bros. 1 and 3, Super Mario World, New Super Mario Bros. U, and Super Mario 3D World). These results hold even when we restrict to constant-size levels and screens, but they do require generalizing to allow arbitrarily many enemies at each location and onscreen, as well as allowing for exponentially large (or no) timer. Our New Super Mario Bros. constructions fit within one standard screen size. In our Super Mario Maker reductions, we work within the standard screen size and use the property that the game engine remembers offscreen objects that are global because they are supported by "global ground". To prove these Mario results, we build a new theory of counter gadgets in the motion-planning-through-gadgets framework, and provide a suite of simple gadgets for which reachability is RE-complete.

搜索结果：Bros

BROS: Bias-Corrected Randomized Subspaces for Memory-Efficient Single-Loop Bilevel Optimization

You Can't Solve These Super Mario Bros. Levels: Undecidable Mario Games

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

Experience-Driven PCG via Reinforcement Learning: A Super Mario Bros Study

Blazar Radio and Optical Survey (BROS): A catalog of blazar candidates showing flat radio spectrum and their optical identification in Pan-STARRS1 Surveys

Learning Constructive Primitives for Online Level Generation and Real-time Content Adaptation in Super Mario Bros

Nintendo Super Smash Bros. Melee: An "Untouchable" Agent

A Novel CNet-assisted Evolutionary Level Repairer and Its Applications to Super Mario Bros

Beating the World's Best at Super Smash Bros. with Deep Reinforcement Learning

Politics of Questions in News: A Mixed-Methods Study of Interrogative Stances as Markers of Voice and Power

Triangulating Temporal Dynamics in Multilingual Swiss Online News

Migrant Voices, Local News: Insights on Bridging Community Needs with Media Content

Interactive Simulations of Backdoors in Neural Networks

The Tarantula -- Revealed by X-rays (T-ReX)

RQC revisited and more cryptanalysis for Rank-based Cryptography

Revisiting Algebraic Attacks on MinRank and on the Rank Decoding Problem

Non-Commutative Renormalization

MYStIX First Results: Spatial Structures of Massive Young Stellar Clusters

Asymptotic dynamics of thermal quantum fields

Towards a Relativistic KMS Condition