Military Space News
ROBO SPACE
Why tech firms are aiming for smaller, leaner AI models
Why tech firms are aiming for smaller, leaner AI models
By Daxia ROJAS
Paris (AFP) Dec 3, 2024
AI firms have long boasted about the enormous size and capabilities of their products, but they are increasingly looking at leaner, smaller models that they say will save on energy and cost.

Programs like ChatGPT are underpinned by algorithms known as "large language models", and the chatbot's creator bragged last year that its GPT-4 model had nearly two trillion "parameters" -- the building blocks of the models.

The vast size of GPT-4 allows ChatGPT to handle queries about anything from astrophysics to zoology.

But if a company needs a program with knowledge only of, say, tigers, the algorithm can be much smaller.

"You don't need to know the terms of the Treaty of Versailles to answer a question about a particular element of engineering," said Laurent Felix of Ekimetrics, a firm that advises companies on AI and sustainability.

Google, Microsoft, Meta and OpenAI have all started offering smaller models.

Amazon too allows for all sizes of models on its cloud platform.

Kara Hurst, Amazon's chief sustainability officer, said at a recent event in Paris that it showed the tech industry was moving towards "sobriety and frugality".

- Energy needs -

Smaller models are better for simple tasks like summarising and indexing documents or searching an internal database.

US pharmaceutical company Merck, for example, is developing a model with Boston Consulting Group (BCG) to understand the impact of certain diseases on genes.

"It will be a very small model, between a few hundred million and a few billion parameters," said Nicolas de Bellefonds, head of AI at BCG.

Laurent Daudet, head of French AI startup LightOn, which specialises in smaller models, said they had several advantages over their larger siblings.

They were often faster and able to "respond to more queries and more users simultaneously", he said.

He also pointed out that they were less energy hungry -- the potential climate impact being one of the major concerns over AI.

Huge arrays of servers are needed to "train" the AI programs and then to process queries.

These servers -- made up of highly advanced chips -- require vast amounts of electricity both to fuel their operation and to cool them down.

Daudet explained that the smaller models needed far fewer chips, making them cheaper and more energy efficient.

- Multi-model future -

Other proponents point out that they can run without using data centres altogether by being installed directly on devices.

"This is one of the ways to reduce the carbon footprint of our models," Arthur Mensch, head of French start-up Mistral AI, told the Liberation newspaper in October.

Laurent Felix pointed out that direct use on a device also meant more "security and confidentiality of data".

The programs could potentially be trained on proprietary data without fear of it being compromised.

The larger programs, though, still have the edge for solving complex problems and accessing wide ranges of data.

De Bellefonds said the future was likely to involve both kinds of models talking to each other.

"There will be a small model that will understand the question and send this information to several models of different sizes depending on the complexity of the question," he said.

"Otherwise, we will have solutions that are either too expensive, too slow, or both."

dax/jxb/rl

Merck & Co.

GOOGLE

MICROSOFT

Meta

Amazon.com

Related Links
All about the robots on Earth and beyond!

Subscribe Free To Our Daily Newsletters
Tweet

RELATED CONTENT
The following news reports may link to other Space Media Network websites.
ROBO SPACE
New datasets aim to teach AI models cross-disciplinary scientific thinking
Los Angeles CA (SPX) Dec 03, 2024
What can exploding stars reveal about blood flow in arteries, or how might swimming bacteria inform our understanding of ocean dynamics? Researchers from leading institutions have taken a major step forward in training artificial intelligence (AI) models to draw insights across disciplines to unlock scientific discoveries. The initiative, known as Polymathic AI, leverages advanced technology similar to large language models like ChatGPT, but instead of processing text, it uses datasets from fields ... read more

ROBO SPACE
Russia gave N. Korea anti-air missiles in exchange for troops: Seoul security chief

Ukraine seeking new air-defence systems after latest Russian strike: Zelensky

Think fast: A missile-defense system built for speed

Poland opens long-awaited US missile base

ROBO SPACE
Yemen's Huthis say they targeted Israel with missile

Putin threatens Kyiv with new hypersonic missile

Kongsberg secures NOK 12 billion contract for Dutch air defense systems

NATO vows Ukraine backing after Russian missile 'intimidation'

ROBO SPACE
Russia launches massive aerial attack on Ukraine's energy sector

PLP launches drone kit for installing bird diverters on power lines

'Record' drone barrage pummels Ukraine as missile tensions seethe

Drones spotted flying near US Air Force bases in UK

ROBO SPACE
Airbus to deliver advanced satellite modems to UK MoD for Skynet comms

Fleet Space Centauri 6 advances resilient SATCOM for defence

SpaceX launches secret 'Optus-X' payload atop Falcon 9 rocket

Fort Detrick Maryland chosen as permanent site for Wideband Military SATCOM training

ROBO SPACE
Finland mulls reintroduction of banned anti-personnel mines

Netherlands eyes joining European weapons programmes

Cash-strapped UK to decommission aging assault ships, helicopters

UK and Moldova sign defence pact to counter 'Russian aggression'

ROBO SPACE
Putin signs record Russian budget, one-third for defense

Wars, regional tensions boost arms sales: report

Rheinmetall, Lithuania pave way to building ammunition plant

Zelensky approves Ukraine 2025 budget giving 60% to defence

ROBO SPACE
Russia waging 'reckless' sabotage campaign in Europe: UK spy chief

Japan PM says will have 'frank discussions' with Trump

Cyprus seeks NATO membership with US help

Swiss government rejects tightening neutrality rules

ROBO SPACE
New Technique Enables Mass Production of Metal Nanowires

Subscribe Free To Our Daily Newsletters




The content herein, unless otherwise known to be public domain, are Copyright 1995-2026 - Space Media Network. All websites are published in Australia and are solely subject to Australian law and governed by Fair Use principals for news reporting and research purposes. AFP, UPI and IANS news wire stories are copyright Agence France-Presse, United Press International and Indo-Asia News Service. ESA news reports are copyright European Space Agency. All NASA sourced material is public domain. Additional copyrights may apply in whole or part to other bona fide parties. All articles labeled "by Staff Writers" include reports supplied to Space Media Network by industry news wires, PR agencies, corporate press officers and the like. Such articles are individually curated and edited by Space Media Network staff on the basis of the report's information value to our industry and professional readership. Advertising does not imply endorsement, agreement or approval of any opinions, statements or information provided by Space Media Network on any Web page published or hosted by Space Media Network. General Data Protection Regulation (GDPR) Statement Our advertisers use various cookies and the like to deliver the best ad banner available at one time. All network advertising suppliers have GDPR policies (Legitimate Interest) that conform with EU regulations for data collection. By using our websites you consent to cookie based advertising. If you do not agree with this then you must stop using the websites from May 25, 2018. Privacy Statement. Additional information can be found here at About Us.