Last week, Microsoft researchers introduced an innovative artificial intelligence (AI) model designed to create new inorganic materials with specific properties. Named MatterGen, this open-source large language model (LLM) is accessible to the public. The team has also released a paper indicating that the technology has the potential to expedite the development of new energy sources, semiconductors, and carbon capture materials. MatterGen employs a diffusion-based architecture, which also underpins well-known AI systems such as OpenAI’s Dall-E and Stability AI’s Stable Image Ultra.
Microsoft Launches MatterGen AI Model
While generative AI is often linked to the creation of text, imagery, audio, and video via user prompts, MatterGen distinguishes itself from traditional large language models, as reflected in a blog post shared last week. This model can interpret user requirements and generate a diverse array of inorganic material designs.
At present, material design involves a labor-intensive process wherein scientists rely on their expertise and intuition to create improved materials. A recent example of such innovation includes the application of lithium carbide batteries in smartphones, which enhances capacity while minimizing space. Nevertheless, a notable limitation of human-designed materials is the lengthy experimentation and creation period involved.
Contrastingly, MatterGen swiftly generates crystalline structures using various elements from the periodic table. The model operates at the atomic level, permitting the refinement of atom types, coordinates, and periodic lattice structures. It not only produces material designs rapidly but also conducts simulation-based experiments to evaluate the viability, efficiency, and durability of each design.
The researchers explained that the AI model’s diffusion architecture is typically found in image and video generation models, providing enhanced spatial and geometric comprehension of shapes and structures.
A study published in the journal Nature details that MatterGen’s base model was trained using a substantial dataset exceeding 600,000 stable inorganic crystal structures, sourced from the Materials Project and Alexandria databases. Following this, adapter modules were integrated to facilitate fine-tuning for specific attributes such as chemical composition or magnetic density.
Currently, the source code for MatterGen is available for download and further development via a GitHub repository. The model is distributed under an MIT license, allowing for both academic and commercial applications. The researchers are optimistic that this AI model could play a pivotal role in accelerating the discovery of new materials across various fields.