A computable voice profile distilled from a decade of tweets can produce engagement recommendations that match the author's authentic voice better than generic prompts
Voice profile distilled 8 measurable dimensions from 1,149 engagement-filtered tweets. Multi-agent pipeline (3 Sonnet generators + scoring + remix) pr
HypothesisA computable voice profile distilled from a decade of tweets can produce engagement recommendations that match the author's authentic voice better than generic prompts
Voice profile distilled 8 measurable dimensions from 1,149 engagement-filtered tweets. Multi-agent pipeline (3 Sonnet generators + scoring + remix) produced 10 recommendations per run. First posting round (8 replies) maintained voice authenticity. Sub-50 char constraint confirmed as 5x engagement multiplier.

Voice Distillation from Tweet Archive
Hypothesis
A computable voice profile distilled from a decade of tweets can produce engagement recommendations that match the author’s authentic voice better than generic prompts.
Method
- Parse archive: 12,459 tweets (2016-2026) from X/Twitter data export
- Filter corpus: Remove retweets, low-engagement noise, scrub targets. Result: 1,149 tweets
- Distill profile: Measure 8 dimensions deterministically (no LLM):
- Lane distribution (Geo/OSINT 45%, Humor 30%, Culture 20%, Tech 5%)
- Brevity profile (median 86 chars, sub-50 = 5x engagement)
- Punctuation habits (86.7% no terminal punctuation)
- Reply ratio (99.6% replies vs originals)
- Formality level, emoji usage, hashtag frequency
- Multi-agent generation: 3 parallel Sonnet agents (Sniper: brevity-first, Builder: thread starters, Strategist: high-value targets) generate raw recommendations
- 5-dimension scoring: Brevity (0.25), Punctuation (0.15), Register (0.15), Voice Authenticity (0.25, LLM-scored against 100-tweet sample), Engagement Potential (0.20)
- Remix: Top candidates remixed for punchiness by a fourth agent
- Post: Top 10 recommendations posted via Chrome CDP to real X account
Results
- Voice profile: 4-lane archetype “The Dry Observer” with 8 measured dimensions
- Corpus quality: 1,149 tweets after filtering (9.2% of archive), 753 old tweets scrubbed/deleted
- Pipeline throughput: ~10 scored recommendations per run, ~3 minutes total
- First posting round: 8 replies posted, voice authenticity scores 0.7-0.9 on the 100-tweet ground truth sample
- Key finding: Sub-50 character constraint is the single strongest engagement predictor (165 avg likes vs 33 for longer tweets)
So What
This confirms that voice can be imported, not hand-crafted. The 8 measured dimensions are more useful than any prompt could be because they’re grounded in actual engagement data. The multi-agent architecture (generate -> score -> remix) separates creativity from quality control, which prevents the common failure mode of a single agent trying to be both creative and constrained.
The next experiment should test whether the voice profile improves engagement rates compared to a baseline of generic “write like a tech influencer” prompting.