搜索 — ResearchTracker

Semantic representations can be framed as a structured, dynamic knowledge space through which humans navigate to retrieve and manipulate meaning. To investigate how humans traverse this geometry, we introduce a framework that represents concept production as navigation through embedding space. Using different transformer text embedding models, we construct participant-specific semantic trajectories based on cumulative embeddings and extract geometric and dynamical metrics, including distance to next, distance to centroid, entropy, velocity, and acceleration. These measures capture both scalar and directional aspects of semantic navigation, providing a computationally grounded view of semantic representation search as movement in a geometric space. We evaluate the framework on four datasets across different languages, spanning different property generation tasks: Neurodegenerative, Swear verbal fluency, Property listing task in Italian, and in German. Across these contexts, our approach distinguishes between clinical groups and concept types, offering a mathematical framework that requires minimal human intervention compared to typical labor-intensive linguistic pre-processing metho

Abusive text transformation using LLMs

arXiv2025-07-14作者：Rohitash Chandra, Jiyong Choi

Although Large Language Models (LLMs) have demonstrated significant advancements in natural language processing tasks, their effectiveness in the classification and transformation of abusive text into non-abusive versions remains an area for exploration. In this study, we aim to use LLMs to transform abusive text (tweets and reviews) featuring hate speech and swear words into non-abusive text, while retaining the intent of the text. We evaluate the performance of two state-of-the-art LLMs, such as Gemini, GPT-4o, DeekSeek and Groq, on their ability to identify abusive text. We them to transform and obtain a text that is clean from abusive and inappropriate content but maintains a similar level of sentiment and semantics, i.e. the transformed text needs to maintain its message. Afterwards, we evaluate the raw and transformed datasets with sentiment analysis and semantic analysis. Our results show Groq provides vastly different results when compared with other LLMs. We have identified similarities between GPT-4o and DeepSeek-V3.

搜索结果：Swears

Characterizing Human Semantic Navigation in Concept Production as Trajectories in Embedding Space

Abusive text transformation using LLMs

SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use

WSCoach: Wearable Real-time Auditory Feedback for Reducing Unwanted Words in Daily Communication

I Stolenly Swear That I Am Up to (No) Good: Design and Evaluation of Model Stealing Attacks

Examining Online Social Support for Countering QAnon Conspiracies

ID-XCB: Data-independent Debiasing for Fair and Accurate Transformer-based Cyberbullying Detection

Diagnosing Hate Speech Classification: Where Do Humans and Machines Disagree, and Why?

Can ChatGPT capture swearing nuances? Evidence from translating Arabic oaths

Revisiting the DARPA Communicator Data using Conversation Analysis

LIP: Lightweight Intelligent Preprocessor for meaningful text-to-speech

Contextual-Lexicon Approach for Abusive Language Detection

Challenges in Automated Debiasing for Toxic Language Detection

The fully-visible Boltzmann machine and the Senate of the 45th Australian Parliament in 2016

Learning from Fact-checkers: Analysis and Generation of Fact-checking Language

Quantifying Intimacy in Language

Analyzing the hate and counter speech accounts on Twitter

Predicting gender and age categories in English conversations using lexical, non-lexical, and turn-taking features

Identifying Offensive Expressions of Opinion in Context

Development of Security Detection Model for the Security of Social Blogs and Chatting from Hostile Users