AI2 is developing a large language model optimized for science

Kyle Wiggers

Updated 11 May 2023 at 11:00 am·4-min read

PaLM 2. GPT-4. The list of text-generating AI practically grows by the day.

Most of these models are walled behind APIs, making it impossible for researchers to see exactly what makes them tick. But increasingly, community efforts are yielding open source AI that's as sophisticated, if not more so, than their commercial counterparts.

The latest of these efforts is the Open Language Model, a large language model set to be released by the nonprofit Allen Institute for AI Research (AI2) sometime in 2024. Open Language Model, or OLMo for short, is being developed in collaboration with AMD and the Large Unified Modern Infrastructure consortium, which provides supercomputing power for training and education, as well as Surge AI and MosaicML (which are providing data and training code).

"The research and technology communities need access to open language models to advance this science," Hanna Hajishirzi, the senior director of NLP research at AI2, told TechCrunch in an email interview. "With OLMo, we are working to close the gap between public and private research capabilities and knowledge by building a competitive language model."

One might wonder -- including this reporter -- why AI2 felt the need to develop an open language model when there's already several to choose from (see Bloom, Meta's LLaMA, etc.). The way Hajishirzi sees it, while the open source releases to date have been valuable and even boundary-pushing, they've missed the mark in various ways.

AI2 sees OLMo as a platform, not just a model -- one that'll allow the research community to take each component AI2 creates and either use it themselves or seek to improve it. Everything AI2 makes for OLMo will be openly available, Hajishirzi says, including a public demo, training dataset and API, and documented with "very limited" exceptions under "suitable" licensing.

"We’re building OLMo to create greater access for the AI research community to work directly on language models," Hajishirzi said. "We believe the broad availability of all aspects of OLMo will enable the research community to take what we are creating and work to improve it. Our ultimate goal is to collaboratively build the best open language model in the world."

OLMo's other differentiator, according to Noah Smith, senior director of NLP research at AI2, is a focus on enabling the model to better leverage and understand textbooks and academic papers as opposed to, say, code. There's been other attempts at this, like Meta's infamous Galactica model. But Hajishirzi believes that AI2's work in academia and the tools it's developed for research, like Semantic Scholar, will help make OLMo "uniquely suited" for scientific and academic applications.

"We believe OLMo has the potential to be something really special in the field, especially in a landscape where many are rushing to cash in on interest in generative AI models," Smith said. "AI2’s unique ability to act as third-party experts gives us an opportunity to work not only with our own world-class expertise but collaborate with the strongest minds in the industry. As a result, we think our rigorous, documented approach will set the stage for building the next generation of safe, effective AI technologies."

That's a nice sentiment, to be sure. But what about the thorny ethical and legal issues around training -- and releasing -- generative AI? The debates raging around the rights of content owners (among other affected stakeholders), and countless nagging issues, have yet to be settled in the courts.

To allay concerns, the OLMo team plans to work with AI2's legal department and to-be-determined outside experts, stopping at "checkpoints" in the model-building process to reassess privacy and intellectual property rights issues.

"We hope that through an open and transparent dialogue about the model and its intended use, we can better understand how to mitigate bias, toxicity, and shine a light on outstanding research questions within the community, ultimately resulting in one of the strongest models available," Smith said.

What about the potential for misuse? Models, which are often toxic and biased to begin with, are ripe for bad actors intent on spreading disinformation and generating malicious code.

Hajishirzi said that AI2 will use a combination of licensing, model design and selective access to the underlying components to "maximize the scientific benefits while reducing the risk of harmful use." To guide policy, OLMo has an ethics review committee with internal and external advisors (AI2 wouldn't say who, exactly) that'll provide feedback throughout the model creation process.

We'll see to what extent that makes a difference. For now, a lot's up in the air -- including most of the model's technical specs. (AI2 did reveal that it'll have around 70 billion parameters, parameters being the parts of the model learned from historical training data.) Training's set to begin on LUMI's supercomputer in Finland -- the fastest supercomputer in Europe, as of January -- in the coming months.

AI2 is inviting collaborators to help contribute to -- and critique -- the model development process. Those interested can contact the OLMo project organizers here.

HuffPost
‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him
That didn’t take long.
Cosmo
Rosalía goes braless and *almost* frees the nip in a lace naked dress
Rosalía stepped out wearing a breathtaking naked dress at the Prelude to the Olympics in Paris. The design was a nude coloured see-through lace gown by Dior.
NY Daily News
Harris campaign roasts Trump as ‘old and quite weird’ after Fox News insults
Republican presidential candidate Donald Trump called in to Fox News Thursday, where he told supporters that presumptive Democratic nominee Kamala Harris is a “radical left, not very smart person” who’s part of a massive conspiracy to weaponize the nation’s legal system against him. Harris’ campaign fired back mere minutes later with an email blasting the “78-year-old convicted criminal’s Fox ...
SETHLUI.COM
Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon
The post Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon appeared first on SETHLUI.com.
Evening Standard
Arne Slot hails double new Liverpool addition with vital Premier League experience secured
New Reds boss hails latest Anfield arrivals as plans take shape
The Independent
Stranded Boeing astronauts are stuck on International Space Station, Nasa says in urgent update
The astronauts stranded on the International Space Station are still not able to come home, Nasa has said. Two astronauts went to the space station almost 50 days ago as part of a test of Boeing’s Starliner capsule. Test pilots Butch Wilmore and Suni Williams were supposed to visit the orbiting lab for about a week and return in mid-June, but thruster failures and helium leaks on Boeing‘s new Starliner capsule prompted Nasa and Boeing to keep them up longer.
Fortune
Want to get a job at Meta? It doesn’t matter what you study—as long as you can ‘do one thing really well,’ Mark Zuckerberg says
Meta CEO Mark Zuckerberg says what matters most in his hiring philosophy is people being able to do one thing really well.
People
Vanessa Williams, 61, Refuses to Get Botox, Fillers or a Facelift: ‘I Want to Look Like Myself’ (Exclusive)
The former beauty-queen-turned-Hollywood-star gets candid about what she has and hasn't done amid the aging process
The Telegraph
How Gerald Ford predicted Kamala Harris’s presidential run
Almost 35 years ago, Gerald Ford predicted that America would get its first female president only when a male incumbent could no longer continue.
The Telegraph
Manchester United staff ‘shocked, upset and angry’ as long-serving academy coaches face cull
Manchester United’s academy staff have been left “shocked”, “upset” and in some cases “angry” at the news that several respected, long-serving coaches could lose their jobs in the cost-cutting drive at Old Trafford.
Cosmo
JLo's plunging white swimsuit ticks off so many summer trends
Jennifer Lopez celebrated her 55th birthday wearing a Gooseberry Intimates plunging white one-piece. Shop her exact swimsuit plus more affordable look-a-likes.
The Independent
Police officer stood down after ‘truly shocking’ video shows man kicked in face at Manchester Airport
Hundreds of protesters chanted ‘shame on you’ at a protest at Manchester airport following the incident captured on camera
NextShark
Asian teen stomped on head during Bay Area basketball game
A police investigation is underway following a violent incident during a youth basketball game where a 13-year-old player stomped on an opponent's head, leading to a concussion. The game, held at the College of Alameda on Sunday, involved the Filipino American Tumakbo United team and Payton's Place team, both from the Bay Area. What happened: The now-viral video of the incident shows a scuffle over the ball, during which the Filipino boy falls to the ground before his 13-year-old opponent stomps on his head.
Evening Standard
Elderly woman was 'rammed with trolley' sparking Manchester airport police 'stamping' incident
Brothers confronted man who had argued with their mother on flight before pushing trolley into her, it is claimed
HuffPost
'How Dare You?': Whoopi Goldberg Drops Fiery Response To JD Vance's 'Childless' Dig
"The View" co-host went after Vance, who once likened Kamala Harris and Pete Buttigieg to "cat ladies."
SETHLUI.COM
Kiang Kiang Taiwan Teppanyaki: Ex-hotel chef from Taipei serves sizzling hotplate pasta with ribeye steak, basil pork & halibut
The post Kiang Kiang Taiwan Teppanyaki: Ex-hotel chef from Taipei serves sizzling hotplate pasta with ribeye steak, basil pork & halibut appeared first on SETHLUI.com.
INSIDER
Trump picking JD Vance was a 'really bad decision' and he would have been better off with Nikki Haley, ex-Trump official says
Anthony Scaramucci said Trump made a "really bad decision" choosing JD Vance as his VP, although Trump has said that Vance is "doing a fantastic job."
People
Was Trump Struck By a Bullet or Shrapnel? FBI Director Testifies There's 'Some Question' Around Injury
"There's some question about whether or not it's a bullet or shrapnel that hit his ear," FBI Director Christopher Wray said
Associated Press
China issues rare praise to Philippine president for his ban on Chinese online gambling operators
China issued a rare compliment to the administration of Philippine President Ferdinand Marcos Jr. Marcos accused some of venturing into crimes including financial scams, human trafficking, kidnappings, torture and murder. Relations between China and the Philippines under Marcos have been strained since he allowed an expanded U.S. military presence in the country under a 2014 defense pact and hostilities between their forces started to flare in the disputed South China Sea last year.
Hello!
Rita Ora just styled bedazzled latex lingerie with sheer tights
Rita Ora just made a case for latex lingerie while performing to 50 thousand people. See photos

AI2 is developing a large language model optimized for science

Latest stories

‘I Approve This Message’: Kamala Harris Instantly Uses Trump’s Own Words Against Him

Rosalía goes braless and almost frees the nip in a lace naked dress

Harris campaign roasts Trump as ‘old and quite weird’ after Fox News insults

Ru Yi Yuan: Rude, stingy & unhygenic auntie has hour-long queue for vegetarian bee hoon

Arne Slot hails double new Liverpool addition with vital Premier League experience secured

Stranded Boeing astronauts are stuck on International Space Station, Nasa says in urgent update

Want to get a job at Meta? It doesn’t matter what you study—as long as you can ‘do one thing really well,’ Mark Zuckerberg says

Vanessa Williams, 61, Refuses to Get Botox, Fillers or a Facelift: ‘I Want to Look Like Myself’ (Exclusive)

How Gerald Ford predicted Kamala Harris’s presidential run

Manchester United staff ‘shocked, upset and angry’ as long-serving academy coaches face cull

JLo's plunging white swimsuit ticks off so many summer trends

Police officer stood down after ‘truly shocking’ video shows man kicked in face at Manchester Airport

Asian teen stomped on head during Bay Area basketball game

Elderly woman was 'rammed with trolley' sparking Manchester airport police 'stamping' incident

'How Dare You?': Whoopi Goldberg Drops Fiery Response To JD Vance's 'Childless' Dig

Kiang Kiang Taiwan Teppanyaki: Ex-hotel chef from Taipei serves sizzling hotplate pasta with ribeye steak, basil pork & halibut

Trump picking JD Vance was a 'really bad decision' and he would have been better off with Nikki Haley, ex-Trump official says

Was Trump Struck By a Bullet or Shrapnel? FBI Director Testifies There's 'Some Question' Around Injury

China issues rare praise to Philippine president for his ban on Chinese online gambling operators

Rita Ora just styled bedazzled latex lingerie with sheer tights