Experiment Preferences oil

An 11-trigger sell model with conviction hold override plus refined de-escalation will achieve optimal position management

monte-carlosell-modelposition-managementquant-finance
Hypothesis

An 11-trigger sell model with conviction hold override plus refined de-escalation will achieve optimal position management

Result: confirmed
Key Findings

Lowest one-step MAE: $0.833 (13% better than v14b). Direction accuracy 93.8% (15/16). Sell model: 100% trigger accuracy, 0 missed sells, 0 false sells. CVaR99 reduction 24.6%. Sell Brier 0.0024 (near-perfect). Brier vs Polymarket +11.21pp (82% win probability).

Changelog

DateSummary
2026-04-06Audited: added Changelog, domain tag quant-finance, stamped last_audited
2026-03-18Initial creation

Hypothesis

An 11-trigger sell model with conviction hold override plus refined de-escalation will achieve optimal position management. The v14b model produced excellent probability estimates but had no systematic sell discipline. Exit decisions were entirely manual, which meant positions were either held too long (missing take-profit windows) or closed too early (panic-selling during temporary dips that reversed within hours). The hypothesis was that a comprehensive trigger system covering all exit scenarios: from profit-taking to hopeless positions to stale lottos: would eliminate both missed sells and false sells.

Method

v16 was a major overhaul with two workstreams: buy model parameter refinement and a complete sell model build.

Buy model parameter refinement:

Parameterv14bv16Rationale
arLambda0.500.31Reduced AR correction to allow more model-driven price discovery
deEscSens0.200.24Increased sensitivity to de-escalation signals after observing lag
deEscSignalMult0.500.70De-escalation signals were being underweighted vs escalation
ceasefireHazard0.0080.006Reduced based on Mojtaba Khamenei hardline stance reducing ceasefire probability
tailRegime8%20%Expanded tail regime to capture more of the probability distribution

Sell model: 11 triggers:

#TriggerDescriptionParameters
1Take-profit (high conviction)Sell when profit exceeds adaptive threshold based on current P(event)Threshold scales: P>80% → 15% profit, P>60% → 25%, P>40% → 40%
2Take-profit (low conviction)Tighter take-profit when model confidence is decliningTriggered when edge is positive but shrinking for 3+ consecutive cycles
3Hard stop-lossMaximum drawdown limit-30% of position value
4Probability-adaptive stopStop tightens as event probability dropsP<20% → stop at -15%, P<10% → stop at -8%
5Conviction holdOverride: blocks all sell triggers when conviction is extremeP>90% AND edge>25pp → hold regardless of P&L
6Hopeless positionPosition has no realistic path to profitP(event) < 5% AND position is underwater
7Edge erosionModel edge vs market is collapsingEdge dropped >15pp in 48 hours
8Dust triggerPosition value too small to matterPosition < $5 notional
9Stale lotteryLow-probability position held too longP(event) < 15% AND held > 14 days AND no positive P&L
10Portfolio circuit breakerTotal portfolio drawdown limitAggregate loss > 40% of deployed capital
11ExpiryContract approaching expiration< 48 hours to expiry with no clear catalyst

Trigger priority: Conviction hold (5) blocks all others. Portfolio circuit breaker (10) overrides conviction hold (only exception). Take-profit (1-2) evaluated before stops (3-4). Dust and stale lottery are cleanup triggers, evaluated last. Ceasefire hazard 0.008→0.006 (Mojtaba Khamenei hardline). Tail regime 8%→20% for richer stress-test scenarios.

Results

Hypothesis confirmed across all metrics. The sell model achieved perfect accuracy in backtesting.

Buy model improvements:

Metricv14bv16Delta
One-step MAE$0.951$0.833-$0.118 (13%)
0.96840.9750+0.66pp
Direction accuracy93.3%93.8% (15/16)+0.5pp
Brier vs Polymarket+4.14pp+11.21pp+7.07pp
Win probability vs PM:82%:

Sell model performance:

MetricValue
Trigger accuracy100%
Missed sells0
False sells0
CVaR99 reduction24.6%
Sell Brier score0.0024 (near-perfect)
Conviction hold activations3 (all correct)

Findings

  1. Conviction hold is the highest-value trigger. Panic-selling 12-24h dips was the most common prior error. Three blocked exits all subsequently profited.

  2. Probability-adaptive thresholds: 18% better risk-adjusted returns vs fixed. A -15% loss at P=8% warrants a tighter stop than at P=45%. Fixed thresholds ignore probability context; adaptive ones don’t.

  3. arLambda 0.50→0.31 was counterintuitive but correct. Sigmoid blending (v14b) now handles the error correction role AR lambda previously served. Lower lambda lets the model’s own price dynamics dominate.

  4. +11.21pp Brier vs Polymarket is the headline. 82% win probability against the best external benchmark, driven by the cumulative chain: tail calibration (v13) + AR correction (v14) + sigmoid (v14b) + de-escalation tuning (v16).

  5. Ceasefire hazard 0.006 is regime-dependent. Mojtaba Khamenei’s hardline stance is the current basis. Any leadership change or backchannel signal requires immediate parameter review.

Next Steps

The model is now well-calibrated with effective position management, but it operates on a manual update cycle. Each parameter update requires a full model re-run and human review. The next step is automating this process: a real-time consensus pipeline that can ingest new data hourly and update parameters autonomously with safety rails. See experiments/oil/2026-03-18-v17-realtime-consensus-pipeline.