搜索 — ResearchTracker

Despite the impressive capabilities of Large Language Models (LLMs) on various tasks, they still struggle with scenarios that involves complex reasoning and planning. Recent work proposed advanced prompting techniques and the necessity of fine-tuning with high-quality data to augment LLMs' reasoning abilities. However, these approaches are inherently constrained by data availability and quality. In light of this, self-correction and self-learning emerge as viable solutions, employing strategies that allow LLMs to refine their outputs and learn from self-assessed rewards. Yet, the efficacy of LLMs in self-refining its response, particularly in complex reasoning and planning task, remains dubious. In this paper, we introduce AlphaLLM for the self-improvements of LLMs, which integrates Monte Carlo Tree Search (MCTS) with LLMs to establish a self-improving loop, thereby enhancing the capabilities of LLMs without additional annotations. Drawing inspiration from the success of AlphaGo, AlphaLLM addresses the unique challenges of combining MCTS with LLM for self-improvement, including data scarcity, the vastness search spaces of language tasks, and the subjective nature of feedback in lan

Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model

arXiv2018-11-02作者：Alexander H. Liu, Hung-yi Lee, Lin-shan Lee

In this paper we proposed a novel Adversarial Training (AT) approach for end-to-end speech recognition using a Criticizing Language Model (CLM). In this way the CLM and the automatic speech recognition (ASR) model can challenge and learn from each other iteratively to improve the performance. Since the CLM only takes the text as input, huge quantities of unpaired text data can be utilized in this approach within end-to-end training. Moreover, AT can be applied to any end-to-end ASR model using any deep-learning-based language modeling frameworks, and compatible with any existing end-to-end decoding method. Initial results with an example experimental setup demonstrated the proposed approach is able to gain consistent improvements efficiently from auxiliary text data under different scenarios.

搜索结果：Criticizing

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model

Paramount Refused to Air an Ad Criticizing Its Merger With Warner Bros.

Emergent Macro-Criticality from Micro-Critical Agents

A Critical Assessment of the Brain Criticality Hypothesis

Stationary solutions to the critical and super-critical quasi-geostrophic equation in the scaling critical Sobolev space

CriticAL: Critic Automation with Language Models

Critical edge sets in vertex-critical graphs

A critical state under weak measurement is not critical

Critical embeddings

Critical Matter

Quantum critical fans from critical lines at zero temperature

PACE: Improving Prompt with Actor-Critic Editing for Large Language Model

A critical analysis of `Relative facts do not exist. Relational quantum mechanics is incompatible with quantum mechanics' by Jay Lawrence, Marcin Markiewicz and Marek Źukowski

Critical Fluctuations in Polymer Solutions: Crossover from Criticality to Tricriticality

Critical and near-critical relaxation of holographic superfluids

Model Criticism of Bayesian Networks with Latent Variables

Model Criticism for Bayesian Causal Inference

Reply to "Comment on Renormalization group picture of the Lifshitz critical behaviors"

AI-designed universal coronavirus vaccine passes first human trial