Go to article URL

The schedule for the Deep Dive: Data Governance virtual conference is now live. Taking place October 1–3, 2025 this premier three-day event will bring together industry leaders and world-class experts to explore the latest advancements in Data Governance and Open Source AI.

From a strong pool of 50 proposals, we’ve curated 12 standout sessions across three key themes:

Explore the preliminary schedule below or access it via our mobile website:

Time (EDT, UTC -4)SessionSpeaker
October 1stStewards of the data commons
12:00 PMOpening Keynote: Data is the key to Open Source AIStefano Maffulli
12:15 PMA data pathway to building public AIAlek Tarkowski
1:00 PMGovernments as data providers for AINeil Majithia
1:45 PMCopycats and the Commons: Governing Open Data for Trustworthy AINatalia-Rozalia
2:30 PMSovereign by Design: A Blueprint for Federated, Consent-Based AI SystemsSal Kimmich
3:15 PMWrap-Up + Live Q&ANick Vidal
October 2ndFrameworks for data governance
12:00 PMKeynote: Trends and Insights of China Open Source Ecosystem in AI EraNadia Jiang, Emily Chen
12:15 PMNew licensing initiatives for AI training dataRamya Chandrasekhar
1:00:00 PMHow Data Provenance Powers Trustworthy AILisa Bobbitt
1:45 PMThe CLeAR Documentation Framework for AI TransparencyKasia Chmielinski
2:30 PMBias Transparency in Human-AI Systems: Open Data Governance Frameworks for AIEDChaeyeon Lim
3:15 PMWrap-Up + Live Q&ANick Vidal
October 3rdBuilding and preserving public datasets
12:00 PMKeynote: What should open source AI aspire to be?Stefan Baack, Kasia Odrozek
12:15 PMBuilding Public Data for LLMsStella Biderman
1:00 PMA new paradigm for publishing library collections: Institutional Books 1.0, a 242B token datasetGreg Leppert, Matteo Cargnelutti, Catherine Brobston
1:45 PMBeyond Extraction: Building Community-Centered Speech DataJessica Rose
2:30 PMSaving What’s Ours: The Data Rescue Project and the Fight for Public DataLynda Kellam, Mikala Narlock
3:15 PMLive Q&A + Closing RemarksStefano Maffulli

The Deep Dive: Data Governance conference builds on the momentum of past events organized by the OSI, including the Deep Dive: AI webinars held in 2023, the Data in Open Source AI workshop held in 2024, and the early-2025 white paper “Data Governance in Open Source AI: enabling responsible and systematic access.”

Data governance and Open Source AI are evolving rapidly, and this event is your opportunity to stay at the forefront. OSI’s Deep Dive brings together leading experts to share practical insights, emerging trends, and proven strategies that organizations of all sizes can apply. Registration is free and we invite you to join us.

We would like to thank the authors who have submitted their proposals and the Program Committee: Alek Tarkowski (Open Future), Anna Tumadóttir (Creative Commons), Carlo Piana (Open Source Initiative), Julie Hunter (Linagora), Masayuki Hatta (Surugadai University), Maximilian Gahntz (Mozilla Foundation), Nick Vidal (Open Source Initiative), Ramya Chandrasekhar (CNRS – Centre national de la recherche scientifique), Stefano Maffulli (Open Source Initiative), Shane Coughlan (OpenChain), and Malcolm Bain (Across Legal).

blog.opensource.org/feed/
internet-npo | reporting