搜索结果：Psychoanalytic review

共找到 20 条结果

高级筛选 ▾

ReviewEval: An Evaluation Framework for AI-Generated Reviews

arXiv

The escalating volume of academic research, coupled with a shortage of qualified reviewers, necessitates innovative approaches to peer review. In this work, we propose: 1. ReviewEval, a comprehensive evaluation framework for AI-generated reviews that measures alignment with human assessments, verifies factual accuracy, assesses analytical depth, identifies degree of constructiveness and adherence to reviewer guidelines; and 2. ReviewAgent, an LLM-based review generation agent featuring a novel alignment mechanism to tailor feedback to target conferences and journals, along with a self-refinement loop that iteratively optimizes its intermediate outputs and an external improvement loop using ReviewEval to improve upon the final reviews. ReviewAgent improves actionable insights by 6.78% and 47.62% over existing AI baselines and expert reviews respectively. Further, it boosts analytical depth by 3.97% and 12.73%, enhances adherence to guidelines by 10.11% and 47.26% respectively. This paper establishes essential metrics for AIbased peer review and substantially enhances the reliability and impact of AI-generated reviews in academic research.

Review Arcade: On the Human Alignment and Gameability of LLM Reviews

arXiv2026-05-27作者：Hans Ole Hatzel, Sebastian Steindl, Jan Strich

LLM-generated reviews for scientific papers are gaining considerable traction and are even being officially piloted by major conferences. We have to assume that not only reviewers are using LLM-assistance, but also that authors use LLMs to revise their papers before submitting. In this work, we perform empirical experiments on papers from the 2025 ACL Rolling Review (ARR) to evaluate LLM reviews from both the author and the reviewer perspective. First, we identify a limited alignment of LLM reviews with human ones. In the best-case scenario, the alignment is reasonable. However, we also find that LLM-human alignment varies substantially across prompts and models. Finally, we investigate the scenario in which the author uses an iterative draft-revise workflow to improve the submission according to the LLM review. We find that this "gaming" of LLM reviews can be effective in specific scenarios, leading to a statistically significant increase of overall scores for up to 35\% of papers. We publish our code: https://github.com/uhh-hcds/reviewarcade.

搜索结果：Psychoanalytic review

ReviewEval: An Evaluation Framework for AI-Generated Reviews

Review Arcade: On the Human Alignment and Gameability of LLM Reviews

Aspect-Guided Multi-Level Perturbation Analysis of Large Language Models in Automated Peer Review

When AI Reviews Its Own Code: Recursive Self-Training Collapse in Code LLMs

Optimizing Peer Grading: A Systematic Literature Review of Reviewer Assignment Strategies and Quantity of Reviewers

AI Assistance for Human Review of Default Judgments

AgenticSCR: An Autonomous Agentic Secure Code Review for Immature Vulnerabilities Detection

Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions

Automatically Annotating Articles Towards Opening and Reusing Transparent Peer Reviews

A Review on the Security Vulnerabilities of the IoMT against Malware Attacks and DDoS

Computational Studies in Influencer Marketing: A Systematic Literature Review

Ultracold atomic lattice systems for simulating topological phases: A review

Does the use of open, non-anonymous peer review in scholarly publishing introduce bias? Evidence from the F1000 post-publication open peer review publishing model

On-Demand Mobility Services for Infrastructure and Community Resilience: A Review toward Synergistic Disaster Response Systems

Large Language Models and Video Games: A Preliminary Scoping Review

Context in object detection: a systematic literature review

AI Literature Review Suite

User Bias Removal in Review Score Prediction

Previously on... Automating Code Review

Electron and hole $g$ factors in semiconductors and nanostructures (Review)