搜索 — ResearchTracker

Long-context modeling is one of the critical capabilities of language AI for digesting and reasoning over complex information pieces. In practice, long-context capabilities are typically built into a pre-trained language model~(LM) through a carefully designed context extension stage, with the goal of producing generalist long-context capabilities. In our preliminary experiments, however, we discovered that the current open-weight generalist long-context models are still lacking in practical long-context processing tasks. While this means perfectly effective long-context modeling demands task-specific data, the cost can be prohibitive. In this paper, we draw inspiration from how humans process a large body of information: a lossy \textbf{retrieval} stage ranks a large set of documents while the reader ends up reading deeply only the top candidates. We build an \textbf{automatic} data synthesis pipeline that mimics this process using short-context LMs. The short-context LMs are further tuned using these self-generated data to obtain task-specific long-context capabilities. Similar to how pre-training learns from imperfect data, we hypothesize and further demonstrate that the short-c

ACER: An AST-based Call Graph Generator Framework

arXiv2023-08-29作者：Andrew Chen, Yanfu Yan, Denys Poshyvanyk

We introduce ACER, an AST-based call graph generator framework. ACER leverages tree-sitter to interface with any language. We opted to focus on generators that operate on abstract syntax trees (ASTs) due to their speed and simplicitly in certain scenarios; however, a fully quantified intermediate representation usually provides far better information at the cost of requiring compilation. To evaluate our framework, we created two context-insensitive Java generators and compared them to existing open-source Java generators.

搜索结果：Acer

ACER: Automatic Language Model Context Extension via Retrieval

ACER: An AST-based Call Graph Generator Framework

ACERAC: Efficient reinforcement learning in fine time discretization

ComBench: A Repo-level Real-world Benchmark for Compilation Error Repair

New Insights into Automatic Treatment Planning for Cancer Radiotherapy Using Explainable Artificial Intelligence

From Amateur to Master: Infusing Knowledge into LLMs via Automated Curriculum Learning

The Energy Blind Spot: NVIDIA's Flagship Edge AI Hardware Cannot Support Process-Level Energy Attribution

Actor Critic with Experience Replay-based automatic treatment planning for prostate cancer intensity modulated radiotherapy

LongCoT: Benchmarking Long-Horizon Chain-of-Thought Reasoning

Generalized Disguise Makeup Presentation Attack Detection Using an Attention-Guided Patch-Based Framework

Cascaded Robust Rectification for Arbitrary Document Images

Flow Augmentation and Knowledge Distillation for Lightweight Face Presentation Attack Detection

A Quantum-Secure and Blockchain-Integrated E-Voting Framework with Identity Validation

Paired-Sampling Contrastive Framework for Joint Physical-Digital Face Attack Detection

Dynamics of Resource Allocation in O-RANs: An In-depth Exploration of On-Policy and Off-Policy Deep Reinforcement Learning for Real-Time Applications

Supervised Contrastive Learning for Snapshot Spectral Imaging Face Anti-Spoofing

Adaptive Opponent Policy Detection in Multi-Agent MDPs: Real-Time Strategy Switch Identification Using Running Error Estimation

Optimal Intervention Strategies and Cost-effectiveness Analysis study of Tuberculosis with reference to TPT, Malnutrition and Diabetes Management

An AI-Native Runtime for Multi-Wearable Environments

Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues