Journal

Removing undated events produces more honest quality scores

careerdata-engreconstructed-from-sessions

2026-03-11

Signal

Removing events without dates from the pipeline before scoring them: rather than including them and flagging low quality: is a scope definition decision: it changes what the audit measures from “all events” to “datable events,” which produces more honest and more actionable quality scores.

Evidence

  • Project: projects/jobs-apply/_index: Run performance summary; reviewed last 3 run results and assessed overall pipeline health
  • Project: internal audit: Removed events missing dates from the pipeline; reviewed all events and articles for client-deliverable alignment
  • Decision rationale: Events without dates can’t be evaluated for temporal relevance, freshness, or scheduling accuracy: including them artificially deflates every temporal metric
  • Downstream impact: Changes the denominator of the quality calculation: a 0.85 score across 1,200 datable events is a different claim than a 0.60 score across 1,500 events where 300 were unevaluatable
  • Strategic: Began consolidation discussion between internal audit pipeline and internal-sync-server projects
  • Volume: 149 automated code-review sessions (internal audit pipeline: 75, autohunt: 66)

So What (Why Should You Care)

When you audit data quality, what you exclude from the audit is as important as what you include. Events without dates aren’t “low quality” records: they’re incomplete records that fall outside the scope of temporal quality evaluation. Scoring them as zero-quality events would drag down every temporal metric for reasons unrelated to the quality of the datable events in the system.

This is the same principle behind well-defined denominators in any metric. A “pass rate” of 60% means very different things depending on whether the denominator includes records that couldn’t be evaluated or only records that were fully evaluated. In quality systems, imprecise denominators create imprecise baselines, which makes improvement tracking unreliable.

The client-deliverable alignment review today also demonstrates the value of periodic manual review passes. Automated quality scores catch systematic patterns, but a human review of all events against client-deliverable’s specific alignment requirements catches the cases where the automation’s judgment diverges from the actual business requirements. Both views are necessary; neither replaces the other.

The internal audit pipeline and internal-sync-server consolidation discussion that began today reflects a common architectural question when two projects share 80% of their domain: is the separation providing value, or is it just creating coordination overhead? Beginning this discussion is itself a quality decision: acknowledging the coordination cost before it compounds.

The run performance summary for projects/jobs-apply/_index demonstrates the value of reviewing the last 3 runs rather than just the most recent one. A single run can have an anomalous result: unusually high success rate on a good day, unusually low success rate due to a transient platform issue. Looking at three runs gives you a trend line. Is the success rate improving, declining, or flat? That trend is what drives the next iteration decision: fix the current approach, try a new approach, or scale the current approach.

Reviewing docs and Claude files before starting work on a familiar project is also a discipline worth noting explicitly. Even when you wrote the code last week, the state of a rapidly-changing project can shift significantly in a few days. The Claude file captures the current context that wasn’t obvious from the code alone. Reading it before starting prevents work that duplicates something already done or contradicts a decision already made.

149 automated code-review sessions split roughly evenly between the two projects (internal audit pipeline: 75, autohunt: 66) suggests similar levels of active development across both codebases on this day: a useful proxy for development activity when git commit logs aren’t directly available.

What’s Next

  • Finalize internal audit pipeline and internal-sync-server consolidation decision
  • Continue client-deliverable alignment review for remaining articles and events
  • Validate that datable-event filter doesn’t exclude events with partial dates

Log

  • projects/jobs-apply/_index: summarized current state and last 3 run results; assessed pipeline health
  • Reviewed docs and Claude files for current project state
  • internal audit: removed events missing dates from the pipeline
  • Reviewed all events and articles for client-deliverable alignment
  • Began consolidation discussion between internal audit pipeline and internal-sync-server: single-repo evaluation started
  • 149 automated code-review sessions (internal audit pipeline: 75, autohunt: 66)