Back to News
Market Impact: 0.15

GitHub Copilot will use your data for AI training by default, but you can opt out

MSFT
Artificial IntelligenceTechnology & InnovationCybersecurity & Data PrivacyRegulation & LegislationPatents & Intellectual Property
GitHub Copilot will use your data for AI training by default, but you can opt out

On April 24 GitHub will begin using interaction data from Copilot Free, Pro, and Pro+ users to train AI models by default for individual accounts, prompting user backlash (117 thumbs-down on the announcement). Copilot Business and Enterprise accounts are excluded and GitHub says private repositories at rest won't be used, but session data while using Copilot inside private repos may be collected unless users opt out via the Copilot Privacy setting. Data may be shared with affiliates including Microsoft but not third-party model providers. The change is causing negative developer sentiment and potential trust/usage risk, though it is unlikely to move markets materially in the near term.

Analysis

The immediate second-order impact is behavioral: developer trust, not raw capability, will determine training data velocity. If a conservative 40-60% of paying individual users disable data-sharing, the effective signal fed back into model updates could drop by roughly one-third within 1-3 months, slowing cadence of incremental accuracy gains and increasing MSFT’s marginal cost per improvement (more compute/annotation per percentage point of acceptance-rate lift). Regulatory and legal exposure is low-probability but high-consequence. Expect a wave of state-level privacy inquiries and a handful of class actions within 3-18 months; even a modest GDPR-style enforcement or settlement (0.5-1% of Microsoft annual revenue analogously) would be immaterial to top-line but could materially increase compliance costs and slow enterprise conversion of new AI features for 12-24 months. Competitive dynamics create both a leak and an upsell path. Distrust among individual devs drives uptake of self-hosted or rival IDE plugins (GitLab, JetBrains integrations) and accelerates enterprise procurement of “safer” paid tiers; this bifurcation favors vendors that can credibly promise data isolation and auditors, boosting TAM capture for compliance/security vendors while compressing MSFT’s consumer ARPU but supporting enterprise ARR. For MSFT equity, the net is modestly negative near-term (headline-driven volatility) but structurally neutral-long if Microsoft successfully converts churn into enterprise upsells. The key watchables are opt-out rates by cohort, any regulatory letters in 3-6 months, and migration metrics from individual to business accounts over the next 6-12 months.