The Allen Institute for AI (AI2) has announced the event of a groundbreaking open language model called AI2 OLMo (Open Language Model). OLMo can be a state-of-the-art generative language model with a scale of 70 billion parameters, comparable to other large language models. The Project is predicted to finish by 2024. It goals to offer the research community with access to all elements of model creation, fostering collaboration and advancing the science of language models.
AI2 is partnering with leading technology corporations, including AMD and CSC, to develop OLMo. The collaboration involves utilizing the GPU capabilities of the AMD-powered LUMI pre-exascale supercomputer, known for its energy efficiency. By leveraging the ability of this eco-friendly supercomputer, AI2 goals to create a novel and open language model that may allow researchers to work directly on language models for the primary time.
A key aspect of OLMo is its openness and accessibility to the research community. AI2 plans to make all elements of the Project openly available, including data, code, training curves, evaluation benchmarks, and ethical considerations surrounding the model’s development. By providing complete transparency, AI2 intends to empower researchers to construct upon and enhance OLMo, enabling faster and safer progress in the sector. The goal is to develop one of the best open language model globally collaboratively.
The AI2 team ensures that OLMo becomes a genuinely open model that gives unique value to the AI research community. Every component created for OLMo, including training data, code, model weights, intermediate checkpoints, and ablations, can be openly available, well-documented, and reproducible, with few exceptions and suitable licensing. The discharge strategy for the model and its artifacts is currently being developed. Moreover, AI2 plans to create a demo and release interaction data from consenting users.
In parallel with the model’s development, AI2 will make decisions to maximise the model’s usability and efficiency without compromising performance. The goal is to make OLMo accessible to a big selection of AI researchers, fostering diversity of perspectives and accelerating improvements in language model development. AI2 also intends to create and release a meticulously studied and documented model training dataset, encompassing pre-training data, instruction data, and human interaction data.
Recognizing the importance of ethical considerations, AI2 takes a realistic approach to ethics and openness throughout the OLMo project. The team will document the choices, concerns, and trade-offs regarding the moral and societal impacts of making and releasing the OLMo model. AI2 promotes AI knowledge and understanding by sharing progress, challenges, and discoveries. Legal experts, each internal and external, are actively involved within the model-building process to evaluate privacy and mental property rights issues at multiple checkpoints.
AI2 has partnered with organizations akin to Surge AI and MosaicML to collaborate on data and training code for OLMo. An ethics review committee comprising internal and external advisors has been established to offer feedback through the Project. The OLMo model and API will function beneficial resources for the broader community, enabling higher understanding and engagement within the generative AI revolution. AI2 welcomes support and partnerships from organizations aligned with their values of AI for traditional, reasonable and responsible, useful AI technologies.
Try the Reference Article. Don’t forget to affix our 21k+ ML SubReddit, Discord Channel, and Email Newsletter, where we share the newest AI research news, cool AI projects, and more. If you could have any questions regarding the above article or if we missed anything, be at liberty to email us at Asif@marktechpost.com
🚀 Check Out 100’s AI Tools in AI Tools Club
Niharika
” data-medium-file=”https://www.marktechpost.com/wp-content/uploads/2023/01/1674480782181-Niharika-Singh-264×300.jpg” data-large-file=”https://www.marktechpost.com/wp-content/uploads/2023/01/1674480782181-Niharika-Singh-902×1024.jpg”>
Niharika is a Technical consulting intern at Marktechpost. She is a 3rd yr undergraduate, currently pursuing her B.Tech from Indian Institute of Technology(IIT), Kharagpur. She is a highly enthusiastic individual with a keen interest in Machine learning, Data science and AI and an avid reader of the newest developments in these fields.