Question 1

What is the methodology paper about?

Accepted Answer

This is the companion methodology paper to the Correctness Illusion (arXiv:2606.20128). It studies what kinds of test inputs actually find bugs in LLM-generated GPU kernels. Existing benchmarks use uniformly-sampled inputs; we show that op-schema-aware seeded fuzzing finds 4-8x more bugs in less time. We characterize the input types that catch the most bugs: boundary conditions, large inputs, and memory-layout edge cases.

Question 2

How is this different from the Correctness Illusion paper?

Accepted Answer

The Correctness Illusion paper (arXiv:2606.20128) presents the empirical finding and the 26-op corpus. This paper (arXiv:2606.27396) presents the methodology — what makes a good test input, how to generate it, and how it compares to existing approaches. They are companion papers, published together in June 2026.

Question 3

What is op-schema-aware seeded fuzzing?

Accepted Answer

Op-schema-aware seeded fuzzing is a technique that takes the operator schema (input shape, dtype, constraints) and generates seeds that exercise the boundaries. Unlike random fuzzing, the seeds are deterministic and reproducible. Unlike uniform sampling, the seeds target the bugs the test is most likely to miss. We compare 4 input-generation strategies across 26 ops and show that op-schema-aware seeded fuzzing finds 4-8x more bugs in less time.

Test-Input Generation for Tensor Programs: What Actually Finds Kernel Bugs

Abstract

Abstract

Frequently Asked Questions

What is the methodology paper about?

How is this different from the Correctness Illusion paper?

What is op-schema-aware seeded fuzzing?

Related Content

The Correctness Illusion in LLM-Generated GPU Kernels

Before the Pull Request: Mining Multi-Agent Coordination

Automated Prompt Optimization: From AutoPrompt (2020) to TextGrad (2024)

LLM Prompt Compression: LLMLingua, GIST Tokens, and the Path to 480x Compression

Prompt Structuring Techniques: From Chain-of-Thought to the Instruction Hierarchy

LLM Safety Techniques: Constitutional AI, Harmony, SAIF, and Llama Guard Compared