Foundation Models

Open Foundation Models and Datasets

Enabling an ecosystem of open foundation models, including those with multilingual and multi-modal capabilities, and open datasets.

We are responsibly enhancing the ecosystem of open foundation models and datasets. We are embracing multilingual and multimodal models, as well as science models tackling broad societal issues like climate change and education.

To aid AI model builders and application developers, we’re collaborating to develop and promote open-source tools for model training, tuning, and inference. We are also launching programs to foster the open development of AI in safe and beneficial ways, and hosting events to explore AI use cases.

Without good datasets, model training and tuning would be impossible. We are promoting the development of open datasets with clear governance and provenance controls so they can be used without concerns for legal and other risks.

Projects

Time Series Data and Model Initiative

Time-series applications are an important target for AI. In addition to gathering high-quality and fully-governed time series datasets as part of the Open Trusted Data Initiative, Alliance members are collaborating on new and improved time series models (as part of the Industry Open FMs Initiative and benchmarks, both general-purpose and application-specific.

Please join us. We need time series and domain experts, including especially subject matter experts and use case and product owners who would like to apply emerging time series foundation models to new applications. There is an acute shortage of good, open datasets for time series and data specially benchmarks and evaluation methods for various use cases. Contributions are especially welcome here.

More details are coming soon. If you are interested in participating, use our contact form to let us know of your interest.

Open Trusted Data Initiative

Time Series Data and Model Initiative

A current challenge in AI is the “murky” provenance of many datasets used for training and tuning large language models (LLMs), which raises concerns for model developers and users of the potential for models to output private, confidential, and copyrighted information that might have been part of the training dataset, among other concerns.

OTDI aims to address these concerns with an industry wide effort to gather and process data fully in the open, allowing model developers and users to have full confidence in the provenance and governance of the data they use.

Industry Open FMs Initiative

Time Series Data and Model Initiative

Industry Open FMs Initiative

We have seen rapid progress in building and releasing highly-capable and open foundation models for general language, coding, scientific discovery, and multi-modal scenarios.

A key development in model strategies is a focus on building smaller, more specialized models.

More details are coming soon, but we would love for you to join us. We need both model-building and domain experts.

Foundation Models

Open Foundation Models and Datasets

Projects

Time Series Data and Model Initiative

Time Series Data and Model Initiative

Time Series Data and Model Initiative

Open Trusted Data Initiative

Time Series Data and Model Initiative

Time Series Data and Model Initiative

Industry Open FMs Initiative

Time Series Data and Model Initiative

Industry Open FMs Initiative

This website uses cookies.