Members Newsletter – June 2024

In our efforts to establish a stable Open Source AI Definition, one of the most challenging aspects has been data and its availability. There are some people arguing that the original datasets used for training must be made available, with very stringent conditions that allow to download the precise, exact set used for the original training. There is another group that is agreeing with the definition of “Data information” which doesn’t require the full dataset, because that’s burdensome and unnecessary (as voted by the working groups in March.) 

The debate on the forum seems to be stalling, although some are starting to understand that legal ramifications of distributing large amounts of data are deep and uncertain across legislations.

Meanwhile, the validation phase revealed that the systems we expected to be Open Source AI actually are (OLMo and Pythia), while the systems that we expected to fail are failing (LLama, Falcon, Grok, Mistral.) The draft Definition seems to be working as expected. We’d love to see more systems evaluated: Respond to this thread to volunteer.

We want to hear your informed thoughts on the data topic. Follow this link, read carefully the past messages and lend your voice to the conversation!

Stefano Maffulli
Executive Director, OSI 

I hold weekly office hours on Fridays with OSI members: book time if you want to chat about OSI’s activities, if you want to volunteer or have suggestions.

News from the OSI

Contributions of Open Source to AI: a panel discussion at CPDP-ai conference

From the Outreach and Advocacy program

Stefano Maffulli discussed the challenges of Open Source AI when it comes to data, hardware, big tech companies and government regulations as a panelist at the CPDP-ai conference in Brussels. Read more.

OSI at PyCon: engaging with AI practitioners and developers as we reach OSAID’s first release candidate

From the Outreach and Advocacy program

As part of the Open Source AI Definition roadshow and as we approach the first release candidate of the draft, the Open Source Initiative (OSI) participated at PyCon US 2024, the annual gathering of the Python community. Read more.

Exploring openness in AI: Insights from the Columbia Convening

From the Outreach and Advocacy program

A framework to discuss openness and AI published by Columbia Institute of Global Politics and Mozilla, in collaboration with OSI and leading AI scholars and practitioners. Read more.

Practical Open Source 2024

If you run a business producing Open Source products or your company’s revenue depends on Open Source in any way, we want to hear your insights! Submit your proposal.

Job opening at the OSI

As a result of the European Digital Agenda, a wave of policy and legislation proposals affecting Open Source has arisen. The OSI seeks an experienced Policy Analyst to guide the positions taken and the delivery of education to legislators, their staff and the wider public. Submit your CV.

OSI in the news

Openwashing: An accusation against some A.I. companies that they are using the “open source” label too loosely.

OSI at The New York Times

Efforts to create a clearer definition for open source A.I. are underway. Researchers at the Linux Foundation in March published a framework that places open source A.I. models into various categories. And the Open Source Initiative, another nonprofit, is trying to draft a definition. Read more.

Open Source AI: OSI Wrestles With a Definition

OSI at The New Stack

At PyCon US, Open Source Initiative enlisted help in hammering out a FAQ for its open source AI definition. Some very sticky sticking points remain. Read more.

Open Source Initiative tries to define Open Source AI

OSI at The Register

The Open Source Initiative – the non-profit overseeing the Open Source Definition, which lays out the requirements for software licenses – is taking its effort to define Open Source AI to the wisdom of the crowds. Read more.

The debate over ‘open source AI’ has reached boiling point — this new OSI initiative looks to set the record straight

OSI at ITPro

With the debate over what constitutes ‘open source AI’ still raging, the OSI looks to create a clear-cut definition through a new initiative. Read more.

Open Source Initiative is close to coming up with a definition for Open Source AI

OSI at SD Times

The Open Source Initiative (OSI) is on a mission to define “Open Source AI.” It wants to be able to provide a framework that can be used to determine if an AI system is open source or not, as there is not currently a framework for doing so. Read more.

OSI affiliates in the news

Nerdearla opens up their Call For Papers

Sysarmy at Nerdearla

Sysarmy is celebrating the 10th anniversary of Nerdearla, one of Latin America’s biggest tech conferences. In 2023, it attracted over 10,000 in-person and 30,000 online attendees. The CFP offers both virtual and in-person speaking slots. Submit your proposal today! Read more.

AI training data has a price tag that only Big Tech can afford

EleutherAI at TechCrunch

If there’s a ray of sunshine through the gloom, it’s the few independent, not-for-profit efforts to create massive datasets anyone can use to train a generative AI model. EleutherAI, a grassroots nonprofit research group that began as a loose-knit Discord collective in 2020, is working with the University of Toronto, AI2 and independent researchers to create The Pile v2. Read more.

How Kubernetes succeeded

CNCF at InfoWorld

Kubernetes started as one of many tools for container orchestration. Ten years later it’s the leading platform for cloud-native applications. Read more.

Upcoming events

OW2con 2024

June 11-12, 2024 – Paris, France

OpenExpo Europe

June 13, 2024 – Madrid, Spain

Open Source AI Definition Town Hall

June 14, 2024 – Online

OSPO4Good at the United Nations

July 9-10, 2024 – New York City, New York

Thanks to our sponsors

New members and renewals

  • Sloan Foundation
  • Cisco
  • SAS
  • Intel

Interested in sponsoring, or partnering with, the OSI? Please see our Sponsorship Prospectus and our Annual Report. We also have a dedicated prospectus for the Deep Dive: Defining Open Source AI. Please contact the OSI to find out more about how your company can promote open source development, communities and software