Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli

Tech Over Tea1h 15mMay 8, 2026

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli” inside PodZeus.

AI-Generated Summary

In this episode of Tech Over Tea, host Brody Robertson interviews Stefano Zacchiroli, co-founder of Software Heritage and former Debian Project Leader. The conversation dives deep into the mission and mechanics of Software Heritage, a non-profit initiative dedicated to collecting, preserving, and sharing all publicly available software source code for future generations. Stefano explains that the archive goes beyond simple storage by preserving full version control histories—Git, Subversion, and more—creating a global, immutable record of software evolution. He emphasizes the cultural, historical, and scientific value of source code, citing examples like Apollo mission code and Quake’s inverse square root algorithm. The discussion also covers the technical and ethical challenges of archiving, including AI scraping, platform centralization risks, and the importance of integrity through the standardized SWHID identifier. Stefano highlights the project’s growth from a two-person research effort to a 20+ person team with diverse funding from public institutions and major tech companies like IBM, Microsoft, and AWS. He also touches on the broader implications of digital sovereignty, open science, and the need for research infrastructure to analyze the archive at scale. The episode concludes with a call to action for developers and enthusiasts to contribute through donations, code contributions, advocacy, or requesting specific repositories be archived. Key takeaways include: (1) Software Heritage preserves not just code, but the full development history, enabling future reconstruction and verification; (2) The SWHID standard provides cryptographic integrity guarantees, critical for scientific reproducibility and security; (3) Centralized platforms like GitHub are fragile—archiving is essential to prevent digital obsolescence; (4) The archive enables large-scale research on open source diversity, language evolution, and community dynamics; (5) Anyone can help by donating, contributing code, promoting the project, or requesting specific repositories be archived. The overall tone is optimistic and mission-driven, underscoring the importance of collective stewardship of digital heritage.

Key Takeaways
1

Software Heritage preserves the full history of software development, not just code, enabling future reconstruction and verification.

2

The SWHID identifier provides cryptographic integrity, ensuring archived software hasn't been altered over time.

3

Centralized platforms like GitHub are fragile—archiving is essential to prevent digital obsolescence and loss of historical context.

4

The archive enables large-scale research on open source diversity, language evolution, and community dynamics.

5

Anyone can help by donating, contributing code, advocating, or requesting specific repositories be archived.

Chapters
0:00
2 min

Introduction to Stefano Zacchiroli and Software Heritage

Brody introduces Stefano Zacchiroli, co-founder of Software Heritage and former Debian Project Leader, and sets the stage for a deep dive into the mission and impact of the organization.

2:00
3 min

What Is Software Heritage and Why It Matters

There is value in software source code. There is cultural value. There is technical value. There is a lot of effort that went into creating that software, and we don't want that value to be lost for future generations.

Highlight
5:00
5 min

Preserving Development History and the SWHID Standard

The idea is that when you archive something in Software Heritage, you have received a different granularity... and later on 10 years from now you will retrieve the same thing... and verify that they match if they don't match something has been modified.

Highlight
10:00
5 min

The Scale, Infrastructure, and Sustainability of the Archive

Stefano details the archive’s current size (2 petabytes), infrastructure (on-premise servers, cloud mirrors, and independent mirrors), and funding model, emphasizing its non-profit, diversified, and sustainable approach.

15:00
5 min

Why Archive Software? Historical, Scientific, and Practical Value

If we archive scientific papers because it's useful for the historical record and for the scientific record, then we should archive those pieces of source code.

Highlight
High-Impact Quotes
We are not just archiving software—we are archiving the collective intelligence of humanity’s digital evolution.
Stefano Zacchiroli123:20
Viral: 92.0
Digital objects last forever or five years, whichever occurs first.
XKCD (referenced by Stefano)58:04
Viral: 90.0
The idea is that when you archive something in Software Heritage, you have received a different granularity... and later on 10 years from now you will retrieve the same thing... and verify that they match if they don't match something has been modified.
Stefano Zacchiroli5:30
Viral: 88.0
Speakers

Host

Brody Robertson

Guest

Stefano Zacchiroli
Topics Discussed
software heritage98%software preservation95%version control history92%open source software90%scientific reproducibility88%open science87%digital sovereignty85%digital obsolescence83%
People & Brands

Stefano Zacchiroli

person

120xPositive

Software Heritage

organization

85xPositive

GitHub

organization

50xNeutral

Git

other

40xPositive

Debian

organization

35xPositive

SWHID

other

25xPositive

Internet Archive

organization

20xPositive

INRIA

organization

15xPositive

Roberto Di Cosmo

person

15xPositive

UNESCO

organization

10xPositive

Get the full intelligence

Search transcripts, export clips, track mentions, and explore all topics from “Software Heritage Co-Founder & Former Debian Leader | Stefano Zacchiroli” inside PodZeus.

Start discovering podcast insights today

Start with a 7-day trial and explore a growing catalog of popular podcasts. No credit card required.

No credit card required • 7-day trial • Cancel anytime