搜索结果：Data in brief

共找到 20 条结果

高级筛选 ▾

A Brief Survey on Deep Learning Based Data Hiding

arXiv2021-03-02作者：Chaoning Zhang, Chenguo Lin, Philipp Benz

Data hiding is the art of concealing messages with limited perceptual changes. Recently, deep learning has enriched it from various perspectives with significant progress. In this work, we conduct a brief yet comprehensive review of existing literature for deep learning based data hiding (deep hiding) by first classifying it according to three essential properties (i.e., capacity, security and robustness), and outline three commonly used architectures. Based on this, we summarize specific strategies for different applications of data hiding, including basic hiding, steganography, watermarking and light field messaging. Finally, further insight into deep hiding is provided by incorporating the perspective of adversarial attack.

Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions

arXiv2025-04-08作者：Oded Ovadia, Meni Brief, Rachel Lemberg

While Large Language Models (LLMs) acquire vast knowledge during pre-training, they often lack domain-specific, new, or niche information. Continual pre-training (CPT) attempts to address this gap but suffers from catastrophic forgetting and inefficiencies in low-data regimes. We introduce Knowledge-Instruct, a novel approach to efficiently inject knowledge from limited corpora through pure instruction-tuning. By generating information-dense synthetic instruction data, it effectively integrates new knowledge while preserving general reasoning and instruction-following abilities. Knowledge-Instruct demonstrates superior factual memorization, minimizes catastrophic forgetting, and remains scalable by leveraging synthetic data from relatively small language models. Additionally, it enhances contextual understanding, including complex multi-hop reasoning, facilitating integration with retrieval systems. We validate its effectiveness across diverse benchmarks, including Companies, a new dataset that we release to measure knowledge injection capabilities.

搜索结果：Data in brief

A Brief Survey on Deep Learning Based Data Hiding

Knowledge-Instruct: Effective Continual Pre-training from Limited Data using Instructions

Training Data Reduction for Performance Models of Data Analytics Jobs in the Cloud

A Semantic Approach for Big Data Exploration in Industry 4.0

Data Sharing in the PRIMED Consortium: Design, implementation, and recommendations for future policymaking

Interoperability-oriented Quality Assessment for Czech Open Data

BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression

RADx Data Hub: A Cloud Platform for FAIR, Harmonized COVID-19 Data

Augmenting Anonymized Data with AI: Exploring the Feasibility and Limitations of Large Language Models in Data Enrichment

Amplify Initiative: Building A Localized Data Platform for Globalized AI

BRIEF-Pro: Universal Context Compression with Short-to-Long Synthesis for Fast and Accurate Multi-Hop Reasoning

Building a Disciplinary, World-Wide Data Infrastructure

Small Data Explainer -- The impact of small data methods in everyday life

On the Convergence of Federated Learning Algorithms without Data Similarity

Two-dimensional solitons in nonlocal media: a brief review

TerraGen: A Unified Multi-Task Layout Generation Framework for Remote Sensing Data Augmentation

Russian-German Astroparticle Data Life Cycle Initiative

Robust principal graphs for data approximation

Honest Computing: Achieving demonstrable data lineage and provenance for driving data and process-sensitive policies

Framework for Inferring Following Strategies from Time Series of Movement Data