unit-test-coverage-targeter
Builds a "what to test next" recommendation by combining a coverage report (LCOV / Cobertura / coverage.py JSON / Jest JSON / JaCoCo XML) with the PR's `git diff`, ranking uncovered branches by risk × cost - risk weighted by McCabe cyclomatic complexity and code-churn frequency, cost weighted by the unit-test pyramid layer (unit tests cheaper than integration than E2E). Emits a prioritized list with concrete file:line targets and the test layer recommended for each. Use when a team has the budget to write 5 - 10 new tests and needs help picking which uncovered code to target first instead of blindly chasing 100% coverage.
unit-test-coverage-targeter
Overview
100% coverage isn't the goal; risk-adjusted coverage is. A 50-line uncovered helper that hasn't been changed in 4 years is lower priority than a 10-line uncovered branch on a payment-handling function that gets edited every week.
This skill builds a ranked list of uncovered branches, scoring each by:
The output is 5 - 10 specific test targets the team can take in one PR - not a 200-line "everything uncovered" dump.
When to use
Step 1 - Inputs
Two required + two optional inputs:
| Input | Required | Source |
|---|---|---|
| Coverage report | ✓ | LCOV / Cobertura / Jest JSON / coverage.py JSON / JaCoCo XML |
| Source tree | ✓ | Project root checkout |
git log history | opt | For churn weighting (Step 3). |
| PR diff | opt | For PR-touch weighting (Step 4). |
Step 2 - Extract uncovered branches
The shape (after parsing per upstream parser):
{
'path': 'src/checkout/cart.ts',
'uncovered_branches': [
{ 'line': 42, 'condition': 'if (item.stock < quantity)', 'arms_uncovered': 1, 'arms_total': 2 },
{ 'line': 78, 'condition': 'switch (status)', 'arms_uncovered': 3, 'arms_total': 5 },
],
'uncovered_lines': [33, 34, 35, 92, 93],
'covered_pct': 65.4,
}Per language tool:
Step 3 - Compute the risk weight
def risk_weight(file_path, branch):
cyclomatic = mccabe_complexity(file_path, branch.line)
churn = git_churn(file_path, days=90)
return (
normalize(cyclomatic, 1, 20) * 0.5 # 50% complexity weight
+ normalize(churn, 0, 50) * 0.5 # 50% churn weight
)McCabe cyclomatic complexity
Per cyclomatic: McCabe's formula is M = E - N + 2 (edges − nodes + 2). The threshold convention:
"1 - 10: Simple procedure, little risk" "11 - 20: More complex, moderate risk" "21 - 50: Complex, high risk" ">50: Untestable code, very high risk" (cyclomatic)
Compute via per-language tools:
# Python
radon cc src/ -a
# JavaScript / TypeScript
npx complexity-report src/ -f json
# Java
# JaCoCo's COMPLEXITY counter (already in jacoco.xml per jacoco-analysis)Function with cyclomatic >10 + uncovered branch = strong target.
Git churn
# Commits in the last 90 days touching this file
git log --since='90 days ago' --format= -- src/checkout/cart.ts | wc -lHigh-churn files are where bugs accumulate; uncovered branches in high-churn files are the highest-priority targets.
Per cyclomatic: "reducing the cyclomatic complexity of code is not proven to reduce the number of errors or bugs in that code." Use complexity as a risk indicator, not a goal.
Step 4 - PR-touch boost
If the PR diff (Step 1 optional input) changed a file, boost its risk weight by 1.5×. Newly-edited code is strictly higher regression risk than untouched code.
def pr_boost(file_path, pr_changed_files):
return 1.5 if file_path in pr_changed_files else 1.0
final_risk = risk_weight(...) * pr_boost(...)Step 5 - Cost weight (test pyramid layer)
Per test-pyramid, the canonical layers (Cohn 2009):
| Layer | Cost (relative) | Recommend for |
|---|---|---|
| Unit | 1× | Pure functions, business-logic branches. |
| Service / API | 3× | Cross-module integration, controllers, repos. |
| UI / E2E | 10× | User flows, browser interactions. |
Per test-pyramid: "you should have many more low-level UnitTests than high level BroadStackTests running through a GUI ... UI tests are brittle, expensive to write, and time consuming to run."
The targeter classifies each candidate branch by file path heuristic:
def layer(path):
if any(s in path for s in ['/components/', '/views/', '/pages/', '/e2e/']):
return 'ui-or-e2e'
if any(s in path for s in ['/api/', '/routes/', '/controllers/', '/services/']):
return 'service'
return 'unit'Step 6 - Score and rank
COST_BY_LAYER = {'unit': 1, 'service': 3, 'ui-or-e2e': 10}
for candidate in candidates:
candidate['score'] = candidate['risk'] / COST_BY_LAYER[candidate['layer']]
candidates.sort(key=lambda c: c['score'], reverse=True)Score = risk / cost. A unit-layer branch with risk 0.6 (score 0.60) beats a UI-layer branch with risk 0.9 (score 0.09).
Take the top 5 - 10 candidates (configurable). More than 10 is a to-do list, not a recommendation.
Step 7 - Render
## Uncovered branches — recommended targets (top 7)
This PR adds 12 uncovered branches in code touched by the PR. The
prioritized list focuses test budget on high-risk × low-cost
targets per the [test pyramid][tp].
| # | File | Line | Branch | Layer | Risk | Recommendation |
|---|-------------------------------------|-----:|-------------------------------------|-------|-----:|----------------|
| 1 | `src/checkout/promo.ts` | 42 | `if (codeIsValid && !expired)` | unit | 0.91 | Add a unit test for the expired-code path. |
| 2 | `src/checkout/promo.ts` | 78 | `switch (codeType)` arm `'BOGO'` | unit | 0.85 | Add a unit test per arm; BOGO arm uncovered. |
| 3 | `src/api/payments.ts` | 134 | `if (provider === 'stripe')` | service | 0.82 | Add an integration test for the stripe path. |
| 4 | `src/checkout/cart.ts` | 21 | `if (item.stock < quantity)` | unit | 0.78 | Add a unit test for stock-shortfall edge case. |
| 5 | `src/api/payments.ts` | 200 | `try / catch` (catch arm) | service | 0.65 | Add a test that triggers the failure path. |
| 6 | `src/components/CheckoutModal.tsx` | 88 | `if (showPromoBanner)` | unit | 0.55 | Add a snapshot/component test for the banner-hidden case. |
| 7 | `src/api/orders.ts` | 156 | `if (status === 'fulfilled')` | service | 0.48 | Add an integration test for the fulfilled-status path. |
### Skipped (low score)
5 candidates skipped — see `coverage-targets.json` for the full
list. Highest-risk skipped: UI-layer test that would catch a 0.9
risk branch but cost 10× a unit test (score 0.09).
### Approach
For each target above, the recommended test shape:
- **Unit (5)**: Add a per-branch test in the existing `*.spec.ts`;
use `expect.fail` if no path triggers the branch yet.
- **Service (2)**: Use [`testcontainers`](../../qa-test-environment/skills/testcontainers/SKILL.md)
to bring up a real Postgres + the Stripe sandbox; assert the
branch outcome.Step 8 - Wire into CI as advisory
This skill is advisory, not gating. Post the recommendation as a PR comment via coverage-diff-reporter's sticky-comment mechanism (see coverage-diff-reporter Step 7) but don't fail the build on it:
- name: Generate coverage targets
if: github.event_name == 'pull_request'
run: |
python scripts/coverage_targets.py \
--coverage current.json \
--diff <(git diff --name-only origin/${{ github.base_ref }}...HEAD) \
--top 7 \
> targets.md
- name: Post sticky comment
uses: marocchino/sticky-pull-request-comment@v2
with:
header: coverage-targets
path: targets.mdAnti-patterns
| Anti-pattern | Why it fails | Fix |
|---|---|---|
| Recommending coverage targets without considering test layer | Suggests UI tests for branches that should be unit-tested; cost ignored. | Score = risk / cost (Step 6). |
| Listing every uncovered branch | 200-row list; team ignores. | Top 5 - 10 (Step 6). |
| Pure-coverage chase (every uncovered line is a target) | Hits 100% by writing low-value tests; signal-to-noise collapses. | Risk-weight by complexity + churn (Step 3). |
| Treating cyclomatic complexity as a goal | Per cyclomatic: reducing complexity isn't proven to reduce defects. | Use complexity as a risk indicator only (Step 3). |
| Ignoring PR-changed files | A new function added in the PR with 0 coverage isn't surfaced. | PR-touch boost (Step 4). |
Recommending tests for code marked # pragma: no cover / istanbul ignore | The team explicitly excluded that code. | Skip files / lines with explicit ignores. |
| Blocking the build on the recommendation | Recommendation is opinion; gating turns it into bureaucracy. | Advisory comment only (Step 8); the actual gate is lcov-analysis / cobertura-analysis. |