About NewsDiff

News doesn't end when it's published. Articles are quietly edited — headlines softened, paragraphs rewritten, context added or removed. NewsDiff makes that process visible.

Why this exists

The newsroom is never really closed. Stories evolve as situations develop, sources respond, and editorial judgment shifts. Sometimes changes reflect genuine corrections. Sometimes they reflect something else — political pressure, legal threats, or simply a different editorial wind.

Whatever the reason, the public deserves to see how the record changes. NewsDiff monitors RSS feeds from news outlets, extracts article content, and tracks every edit. When something changes, it shows you exactly what — word by word.

How it works

  1. Monitor — RSS feeds are polled regularly. New articles are fetched and their content extracted automatically.
  2. Compare — Each time an article is checked, its content is compared against the previous version using word-level diffing.
  3. Display — Changes are shown inline: removed text and added text, just like track changes in a document.
  4. Share — Every diff can be exported as an image or shared directly to social media. The bot also posts to the fediverse automatically.

Standing on the shoulders of

This project draws inspiration from three pioneering news diff trackers:

  • NewsDiffs (2012) — Built at a Knight-Mozilla hackathon by Eric Price, Greg Price, and Jennifer 8. Lee. The original vision: full article body diffing with a web UI. It tracked NYT, CNN, BBC, Politico, and the Washington Post. The code is Python 2 and hasn't run in years, but the idea was right.
  • diffengine (~2017) — Ed Summers took a different approach: monitor any RSS feed, use Mozilla's Readability to extract content automatically (no per-site parsers), and submit every version to the Internet Archive. The key insight that per-site HTML parsing is a losing game.
  • NYTdiff (~2020) — Focused on the NYT's metadata (headlines, abstracts, kickers) and pioneered the social-first approach: generate visual diff images and post them as threaded replies on social media, building a chronological edit history for each article.

All three projects are now effectively abandoned — broken by API changes, platform shifts, and the steady rot of unmaintained dependencies. NewsDiff carries their ideas forward with modern technology.

The open newsroom

There's a tension at the heart of online news. Digital publishing means articles can be improved — errors corrected, context added, clarity sharpened. That's genuinely good. But it also means the record can be quietly altered, and unless someone is watching, no one notices.

Transparency doesn't mean suspicion. Most edits are routine. But the ability to see what changed, and when, is a small piece of accountability infrastructure that should exist. Not as a gotcha, but as a record.

Consider it a glass newsroom.

Credits

NewsDiff is open source software built on the work of others.

Inspiration
NewsDiffs by Eric Price, Greg Price & Jennifer 8. Lee
diffengine by Ed Summers
NYTdiff by Jason Norwood-Young
Core technologies
SvelteKit — web framework
Drizzle ORM — database
BullMQ — job queue
@mozilla/readability — content extraction
jsdiff — diff computation
Federation
Botkit by Fedify — ActivityPub bot framework
@atproto/api — Bluesky integration
Image generation
Satori by Vercel — HTML to SVG
sharp — image processing
Hosting
Cloudron — self-hosted app platform