搜索结果：take

共找到 20 条结果

高级筛选 ▾

Frontier Models Can Take Actions at Low Probabilities

arXiv

Pre-deployment evaluations inspect only a limited sample of model actions. A malicious model seeking to evade oversight could exploit this by randomizing when to "defect": misbehaving so rarely that no malicious actions are observed during evaluation, but often enough that they occur eventually in deployment. But this requires taking actions at very low rates, while maintaining calibration. Are frontier models even capable of that? We prompt the GPT-5, Claude-4.5 and Qwen-3 families to take a target action at low probabilities (e.g. 0.01%), either given directly or requiring derivation, and evaluate their calibration (i.e. whether they perform the target action roughly 1 in 10,000 times when resampling). We find that frontier models are surprisingly good at this task. If there is a source of entropy in-context (such as a UUID), they maintain high calibration at rates lower than 1 in 100,000 actions. Without external entropy, some models can still reach rates lower than 1 in 10,000. When target rates are given, larger models achieve good calibration at lower rates. Yet, when models must derive the optimal target rate themselves, all models fail to achieve calibration without entropy

Sparking Curiosity in Digital System Design Lectures with Take Home Labs

arXiv2025-03-20作者：Senol Gulgonul

Digital system design lectures are mandatory in the electrical and electronics engineering curriculum. Besides HDL simulators and viewers, FPGA boards are necessary for the real implementation of HDL, which were previously costly for students. With the emergence of low-cost FPGA boards, the use of take-home labs is increasing. The COVID-19 pandemic has further accelerated this process. Traditional lab sessions have limitations, prompting the exploration of take-home lab kits to enhance learning flexibility and engagement. This study aims to evaluate the effectiveness of a low-cost take-home lab kit, consisting of a Tang Nano 9K FPGA board and a Saleae Logic Analyzer, in improving students' practical skills and sparking curiosity in digital system design. The research was conducted in the EEE 303 Digital Design lecture. Students used the Tang Nano 9K FPGA and Saleae Logic Analyzer for a term project involving PWM signal generation. Data was collected through a survey assessing the kit's impact on learning and engagement. Positive Acceptance: 75% of students agreed or strongly agreed that the take-home lab kit was beneficial. Preference for Lab Types: 60% of students preferred classi

搜索结果：take

Frontier Models Can Take Actions at Low Probabilities

Sparking Curiosity in Digital System Design Lectures with Take Home Labs

How do Humans take an Object from a Robot: Behavior changes observed in a User Study

Annealed Winner-Takes-All for Motion Forecasting

Winners Take All: A Reverse Consensus Model

Helping Visually Impaired People Take Better Quality Pictures

Fight Fire with Fire: Hacktivists' Take on Social Media Misinformation

Take-Away Impartial Combinatorial Games on Hypergraphs and Other Related Geometric and Discrete Structures

Winner-Take-All Autoencoders

Automated Driving Systems: Impact of Haptic Guidance on Driving Performance after a Take Over Request

Qualifying threshold of take off stage for successfully disseminated creative ideas

Cost Effectiveness Statistic: A Proposal To Take Into Account The Patient Stratification Factors

Thou shalt not take sides: Cognition, Logic and the need for changing how we believe

Robots that Take Advantage of Human Trust

Revisiting Winner Take All (WTA) Hashing for Sparse Datasets

MaRginalia: Enabling In-person Lecture Capturing and Note-taking Through Mixed Reality

Selfie Taking with Facial Expression Recognition Using Omni-directional Camera

Fostering Innovation: Streamlining Magnetocaloric Materials Research by Digitalization

Applying General Turn-taking Models to Conversational Human-Robot Interaction

The Impact of Automation on Risk-Taking: The Role of Sense of Agency