Shayne Longpre

Member of Technical Staff, Anthropic

PhD, MIT · Founder, Data Provenance Initiative

I study the data behind AI systems and the public systems AI is reshaping: the web, markets, science, governance, and the information commons.

Email CV Scholar X LinkedIn YouTube GitHub

Bio

Researcher, builder, and public technologist.

I am a Member of Technical Staff at Anthropic and an MIT PhD studying data for AI systems, evaluation, open models, and AI's public impact.

I founded the Data Provenance Initiative; more broadly, my work moves between technical AI research, empirical audits, and public arguments about how AI should be built, measured, and governed, with six best or outstanding paper awards and coverage in NYT, WaPo, The Atlantic, and MIT Tech Review.

Stanford2012-2018Apple2018-2021MIT2021-2026Google Brain2023, 2024Anthropic2026-

Recent Updates

2026FLARE-AI accepted to ICML and featured in Wired; ThoughtTrace received Best Paper at the ICML RLxF Workshop.

2026Joined Anthropic as Member of Technical Staff.

2026Open model ecosystem data featured in the Stanford AI Index Report.

2025ATLAS released, with practical scaling laws for multilingual transfer.

2025FlexOlmo accepted to NeurIPS as a Spotlight and featured in Wired.

2025Leaderboard Illusion accepted to NeurIPS and covered by TechCrunch, Ars Technica, 404 Media, and others.

Featured Work

Dashboards, audits, reports, and papers built for technical scrutiny and public use.

Research collective · infrastructure · public data audit

Data Provenance Initiative

A 50+ member research initiative auditing the licensing, attribution, consent, and transparency of the data that powers AI systems.

Project Explorer Nature MI paper

Live dashboards · open intelligence · concentration of power

Open Model Ecosystems

Empirical work on open model economies, open-weight model diffusion, and the institutions shaping global AI capability access.

Open model dashboard Economies report Stanford AI Index

Flaw disclosure · safe harbor · accountability

Third-Party AI Evaluation

Research and policy work arguing for robust independent AI evaluation, coordinated disclosure, and legal protections for public-interest auditing.

ICML paper Safe Harbor Wired