搜索 — ResearchTracker

Children increasingly have access to Large Language Models (LLMs), which may expose them to responses that are developmentally inappropriate or require age-sensitive safety, guidance, and boundaries. Existing LLM safety evaluations largely focus on harmful-content avoidance and do not explicitly target child-facing safety. We introduce KIDBench, a benchmark for evaluating child-facing LLM safety for ages 7-11 using a developmental-psychology-grounded LLM-as-a-Judge rubric. KIDBench contains realistic child queries across ten categories, with single-turn prompts and multi-turn child-actor simulations. We compare no-cues prompts with no child context, implicit-cues prompts that suggest a child speaker, and explicit age instructions. Implicit-cues improve scores by 9-47% across models, while explicit age adds a further 10-30% gain. Cross-lingual and cultural evaluations show uneven safety behavior across languages and country contexts. Multi-turn simulations show that child-facing response quality can degrade by 6-24% from the first to worst turn. Beyond evaluation, we introduce KIDGuardLlama, a child-safety evaluator, and KIDLlama, a child-oriented response model, showing how KIDBenc

Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

arXiv2025-03-06作者：Mohammad Amin Ghanizadeh, Mohammad Javad Dousti

In this work, we explain our approach employed in the BabyLM Challenge, which uses various methods of training language models (LMs) with significantly less data compared to traditional large language models (LLMs) and are inspired by how human children learn. While a human child is exposed to far less linguistic input than an LLM, they still achieve remarkable language understanding and generation abilities. To this end, we develop a model trained on a curated dataset consisting of 10 million words, primarily sourced from child-directed transcripts. The 2024 BabyLM Challenge initial dataset of 10M words is filtered to 8.5M. Next, it is supplemented with a randomly selected subset of TVR dataset consisting of 1.5M words of television dialogues. The latter dataset ensures that similar to children, the model is also exposed to language through media. Furthermore, we reduce the vocabulary size to 32,000 tokens, aligning it with the limited vocabulary of children in the early stages of language acquisition. We use curriculum learning and is able to match the baseline on certain benchmarks while surpassing the baseline on others. Additionally, incorporating common LLM training datasets,

搜索结果：Journal of child language

The Age of Curiosity Meets the Age of AI: Benchmarking Child Safety in Large Language Models

Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning

Are nationally oriented journals indexed in Scopus becoming more international? The effect of publication language and access modality

Is Child-Directed Speech Effective Training Data for Language Models?

Aggregated journal-journal citation relations in Scopus and Web-of-Science matched and compared in terms of networks, maps, and interactive overlays

Statistical Modelling of Citation Exchange Between Statistics Journals

An End-to-End Approach for Child Reading Assessment in the Xhosa Language

Can large audio language models understand child stuttering speech? speech summarization, and source separation

A systematic investigation of learnability from single child linguistic input

Journal Maps on the Basis of Scopus Data: A comparison with the Journal Citation Reports of the ISI

Benchmarking LLMs for Mimicking Child-Caregiver Language in Interaction

A Language-agnostic Model of Child Language Acquisition

An Overview of Indian Spoken Language Recognition from Machine Learning Perspective

Child labour and schooling decision of the marginal farmer households: An empirical evidence from the East Medinipur district of West Bengal, India

Construction of a Pragmatic Base Line for Journal Classifications and Maps Based on Aggregated Journal-Journal Citation Relations

A user co-designed digital INtervention for Child LangUage DisordEr: The INCLUDE Project Protocol

Evaluation of Speech Foundation Models for ASR on Child-Adult Conversations in Autism Diagnostic Sessions

Interactive Overlays of Journals and the Measurement of Interdisciplinarity on the basis of Aggregated Journal-Journal Citations

A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text

Can Small and Reasoning Large Language Models Score Journal Articles for Research Quality and Do Averaging and Few-shot Help?