Skip to content

LLM-Assisted Test Automation

Coverage Expansion Modes

ModePurposeRepresentative PromptMetricsCaution
Unit CompletionRoutine test generation"Function spec → test case table"Branch coverageSpec error propagation
Spec → TestSpecification-driven"Spec Markdown → pytest"Spec diff detection rateAmbiguous language interpretation
Regression SummaryFailure log summarization"Failure log → failure cause clusters"Cause cluster accuracySensitive log leakage
Property DiscoveryInvariant extraction"Code → invariants to enforce"Invariant detection countOver-generation

Guardrails

Evaluation criteria:
- Execution success rate of generated tests
- PII contamination rate in dummy data
- Flake ratio (failures/successes over N runs)

Workflow Example

  1. Collect function docstrings
  2. LLM: Extract boundary values + error conditions
  3. Convert to property sets (e.g., Hypothesis)
  4. Initial run of generated tests & summarize failure clusters
  5. Manual review confirmation → commit to repository

Next Actions

  • Integrate test diff auto-review with GitHub Actions
  • Flake detection scoring

Back to: index.md