Back to News
Market Impact: 0.28

Anthropic Knew the Public Would Be Disgusted by How It Was Destroying Physical Books, Secret Documents Reveal

META
Artificial IntelligenceTechnology & InnovationLegal & LitigationPatents & Intellectual PropertyRegulation & LegislationMedia & EntertainmentManagement & Governance

Anthropic ran a secret initiative called Project Panama in which it bought, shredded and scanned millions of used books to train its Claude AI, a practice internal documents show leadership knew looked bad; a judge deemed the physical shredding arguably fair use but the company settled a separate author suit for $1.5 billion over use of pirated texts. The disclosures — including co-founder downloads from LibGen and internal warnings about reputational fallout — heighten legal, regulatory and reputational risk for Anthropic and may draw increased scrutiny of data-sourcing practices across AI firms (with Meta also implicated), potentially affecting valuations, deal terms and regulatory negotiations in the sector.

Analysis

Market structure: Rights-holders and large cloud/compute incumbents (MSFT, GOOGL, AMZN) are likely winners as legal exposure raises the price of clean, licensed training data and amplifies demand for compliant storage/compute; small AI startups that relied on low-cost scraped/pirated corpora face margin pressure and higher unit data costs (low double-digits to high double-digits percentage increases within 12–24 months). Competitive dynamics favor firms with legal/compliance teams and scale to absorb licensing (increases pricing power for integrated players), while fragmentation among pure-play model vendors will accelerate M&A or churn. Risk assessment: Tail risks include rapid regulatory action or precedent-setting rulings (US/EU) that impose industry-wide licensing fees or punitive damages (>$1B aggregate for a cohort), materially raising compliance costs over 6–24 months. Immediate risk (days-weeks) is reputational headlines that widen implied volatility in tech names; medium-term (3–12 months) risk is wave of settlements and insurance hits; hidden dependency is the shadow supply-chain (LibGen/mirror sites) — enforcement there would spike data sourcing costs and push training to licensed/synthetic alternatives. Trade implications: Tactical trades should favor large-cap cloud/AI incumbents and content owners while hedging or shorting reputationally exposed social/ad platforms. Use options to express asymmetric views: buy 3–6 month put spreads on META (10–15% OTM) as a low-cost protection while building 1–2% longs in MSFT/GOOGL and 1% in News Corp (NWSA) for content licensing upside (12-month hold). Reduce 30–50% exposure to pure-play, small/mid AI vendors without disclosed licensing deals and shift into infrastructure providers. Contrarian angles: The market underestimates the monetization opportunity for publishers — licensing could create a new ~$500M–$2B recurring revenue pool across major publishers within 24 months, which would rerate content owners and select cloud partners. Conversely, aggressive enforcement could accelerate open-source/synthetic-data adoption, benefiting low-cost model providers and cloud hosts; watch for this second-order shift before oversizing shorts on big tech.