搜索结果：study

共找到 20 条结果

高级筛选 ▾

Malware Detection based on API Calls: A Reproducibility Study

arXiv2026-01-13作者：Juhani Merilehto

This study independently reproduces the malware detection methodology presented by Felli cious et al. [7], which employs order-invariant API call frequency analysis using Random Forest classification. We utilized the original public dataset (250,533 training samples, 83,511 test samples) and replicated four model variants: Unigram, Bigram, Trigram, and Combined n gram approaches. Our reproduction successfully validated all key findings, achieving F1-scores that exceeded the original results by 0.99% to 2.57% across all models at the optimal API call length of 2,500. The Unigram model achieved F1=0.8717 (original: 0.8631), confirming its ef fectiveness as a lightweight malware detector. Across three independent experimental runs with different random seeds, we observed remarkably consistent results with standard deviations be low 0.5%, demonstrating high reproducibility. This study validates the robustness and scientific rigor of the original methodology while confirming the practical viability of frequency-based API call analysis for malware detection.

Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code

arXiv2026-04-07作者：Dominik Blain, Maxime Noiseux

AI coding assistants are now used to generate production code in security-sensitive domains, yet the exploitability of their outputs remains unquantified. We address this gap with Broken by Default: a formal verification study of 3,500 code artifacts generated by seven widely-deployed LLMs across 500 security-critical prompts (five CWE categories, 100 prompts each). Each artifact is subjected to the Z3 SMT solver via the COBALT analysis pipeline, producing mathematical satisfiability witnesses rather than pattern-based heuristics. Across all models, 55.8% of artifacts contain at least one COBALT-identified vulnerability; of these, 1,055 are formally proven via Z3 satisfiability witnesses. GPT-4o leads at 62.4% (grade F); Gemini 2.5 Flash performs best at 48.4% (grade D). No model achieves a grade better than D. Six of seven representative findings are confirmed with runtime crashes under GCC AddressSanitizer. Three auxiliary experiments show: (1) explicit security instructions reduce the mean rate by only 4 points; (2) six industry tools combined miss 97.8% of Z3-proven findings; and (3) models identify their own vulnerable outputs 78.7% of the time in review mode yet generate them

搜索结果：study

Malware Detection based on API Calls: A Reproducibility Study

Broken by Default: A Formal Verification Study of Security Vulnerabilities in AI-Generated Code

Correction and standardisation of lung oscillometry techniques using parameter inference: A study group report

Decomposing Docker Container Startup Performance: A Three-Tier Measurement Study on Heterogeneous Infrastructure

Practical Limits of Autonomous Test Repair: A Multi-Agent Case Study with LLM-Driven Discovery and Self-Correction

Revisiting SVD and Wavelet Difference Reduction for Lossy Image Compression: A Reproducibility Study

Adolescent sports participation and health in early adulthood: An observational study

Agora Elevator Bodily Sensation Study -- a report

A study guide for the $\ell^2$ decoupling theorem for the paraboloid

Dermatologist-like explainable AI enhances melanoma diagnosis accuracy: eye-tracking study

A study guide for "On the Hausdorff dimension of Furstenberg sets and orthogonal projections in the plane" after T. Orponen and P. Shmerkin

Reproducibility Study of CDUL: CLIP-Driven Unsupervised Learning for Multi-Label Image Classification

Associations Between Sleep Efficiency Variability and Cognition Among Older Adults: Cross-Sectional Accelerometer Study

Uranus Study Report: KISS

A Large-Scale Study on the Prevalence and Usage of TEE-based Features on Android

Recommended Implementation of Quantitative Susceptibility Mapping for Clinical Research in The Brain: A Consensus of the ISMRM Electro-Magnetic Tissue Properties Study Group

Study on the Resolution of Large-Eddy Simulations for Supersonic Jet Flows

Understanding Digits in Identifier Names: An Exploratory Study

User Study for Improving Tools for Bible Translation

Gathering Strength, Gathering Storms: The One Hundred Year Study on Artificial Intelligence (AI100) 2021 Study Panel Report