Apple's pre-computed ML metadata (scenes, aesthetics, OCR) on personal photos reveals measurable media personality dimensions without requiring custom image classification.
personalitymediaapple-photospublic-lab
HypothesisApple's pre-computed ML metadata (scenes, aesthetics, OCR) on personal photos reveals measurable media personality dimensions without requiring custom image classification.
Result: pending

Apple Photos Media Personality Distillation
Experiment 4 in The Public Lab personality series. Each photo becomes a searchable vault frame with Apple’s ML metadata and a pointer to the original file.
Method
- Ingest: Rust parser reads Photos.sqlite (read-only), extracts ZASSET + computed attributes + scene classifications + OCR text
- Frame generation: One markdown frame per asset at
~/vault/media/apple-photos/{uuid}.md - OCR extraction: bplist → LZFSE decompress → CRWordOutputRegion text extraction
- Vault integration:
rv indexpicks up frames, making photos searchable by content, scene, or aesthetic score
Corpus
- 1,299 assets (1,248 photos, 51 videos, 1 screenshot)
- Date range: 2020-2026 (partial library, bulk import)
- 879 scene classifications, 18 OCR-bearing assets
- 100% GPS location coverage (where valid)
- 16 Apple aesthetic scores per asset
Key Innovation
No media duplication: media_path field points to original. Agents access photos via vault search, not file browsing. OCR text as frame body makes document photos findable by content.
Results
- 1,299 vault frames generated in ~500ms
- 18 assets with extractable OCR text (documents, receipts, legal papers)
- Pipeline integrated as Step 15d of BloomNet ingest
- Frames searchable via
rv search