搜索结果：attempts

共找到 20 条结果

高级筛选 ▾

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

arXiv2026-01-06作者：Dhruv Trehan, Paras Chopra

We report a case study of four end-to-end attempts to autonomously generate ML research papers using a pipeline of six LLM agents mapped to stages of the scientific workflow. Of these four, three attempts failed during implementation or evaluation. One completed the pipeline and was accepted to Agents4Science 2025, an experimental inaugural venue that required AI systems as first authors, passing both human and multi-AI review. From these attempts, we document six recurring failure modes: bias toward training data defaults, implementation drift under execution pressure, memory and context degradation across long-horizon tasks, overexcitement that declares success despite obvious failures, insufficient domain intelligence, and weak scientific taste in experimental design. We conclude by discussing four design principles for more robust AI-scientist systems, implications for autonomous scientific discovery, and we release all prompts, artifacts, and outputs at https://github.com/Lossfunk/ai-scientist-artefacts-v1

Synthesis of europium-based crystals containing As or P by a flux method: attempts to grow EuAgP single crystals

arXiv2025-03-31作者：Karolina Podgórska, Damian Rybicki, Lan Maria Tran

Europium-based materials are highly attractive due to their diverse range of physical properties. In these studies, we aimed to synthesize single crystals of the potentially topological semimetallic compound EuAgP, which up to this day has only been obtained in polycrystalline form. The flux method was employed for the syntheses, using fluxes such as: Bi, Sn, Pb, and In, in their various ratios. The purpose of using Bi flux was to try synthesizing an analog of EuAgAs single crystals, by fully substituting arsenic with phosphorus. The obtained crystals were characterized by x-ray diffraction and scanning electron microscopy. Despite many unsuccessful attempts to synthesize EuAgP single crystals, the study provides valuable insights into how different fluxes and their ratios influence the final synthesis product. It also underscores the complexity of designing analogs between arsenides and phosphides.

搜索结果：attempts

Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts

Synthesis of europium-based crystals containing As or P by a flux method: attempts to grow EuAgP single crystals

How do the professional players select their shot locations? An analysis of Field Goal Attempts via Bayesian Additive Regression Trees

Some attempts toward 3-dimensional Phyllotaxy

Increased-Efficiency Multiple-Decoding-Attempts Error Correction for Continuous-Variable Quantum Key Distribution

It's the Thought that Counts: Evaluating the Attempts of Frontier LLMs to Persuade on Harmful Topics

Improving Forecasts of Suicide Attempts for Patients with Little Data

RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts

Evaluating the Efficacy of Large Language Models in Identifying Phishing Attempts

The significance of fuzzy boundaries of the barrier regions in single-molecule measurements of failed barrier crossing attempts

Are Made and Missed Different? An analysis of Field Goal Attempts of Professional Basketball Players via Depth Based Testing Procedure

Discovering Spoofing Attempts on Language Model Watermarks

JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models

DAGKT: Difficulty and Attempts Boosted Graph-based Knowledge Tracing

Interactive Visualization of Saturation Attempts in Vampire

Attempts at a determination of the fine-structure constant from first principles: A brief historical overview

Persistent Cross-Attempt State Optimization for Repository-Level Code Generation

Learning from Failures in Multi-Attempt Reinforcement Learning

Learning to Correct: Calibrated Reinforcement Learning for Multi-Attempt Chain-of-Thought

The human intention. A taxonomy attempt and its applications to robotics