Meta presents an AI model that can handle challenging arithmetic issues
By MYBRANDBOOK
Meta Platforms has announced its latest Llama 3 AI models, including a major release with 405 billion parameters. This model approaches the performance of competitors like GPT-4 and Claude 3.5 Sonnet in multilingual communication and difficult math problem solving. To further improve its overall functionality and accuracy, the Llama 3 also has a larger context window and improved coding capabilities. The new Llama 3 model can converse in eight languages, write higher-quality computer code and solve more complex math problems than previous versions.
With 405 billion parameters, or variables that the algorithm takes into account to generate responses to user queries, it dwarfs the previous version released last year though is still smaller than leading models offered by competitors.
OpenAI's GPT-4 model, by contrast, is reported to have one trillion parameters and Amazon is preparing a model with 2 trillion parameters.
Promoting Llama 3 across multiple channels, Chief Executive Mark Zuckerberg said he expected future Llama models would overtake proprietary competitors by next year. The Meta AI chatbot powered by those models was on track to become the most popular AI assistant by the end of this year, with hundreds of millions of people using it already.
In addition to its flagship 405 billion parameter model, Meta is also releasing updated versions of its lighter-weight 8 billion and 70 billion parameter Llama 3 models initially introduced in the spring, the company said.
All three new models are multilingual and can handle larger user requests via an expanded "context window," which Meta's head of generative AI, Ahmad Al-Dahle, said would improve the experience of generating computer code in particular.
Meta releases its Llama models largely free-of-charge for use by developers, a strategy Zuckerberg says will pay off in the form of innovative products, less dependence on would-be competitors and greater engagement on the company's core social networks. Some investors have raised their eyebrows at the costs entailed, however.
The company also stands to benefit if developers opt to use its free models over paid ones, which would undercut the business models of its rivals. With its announcement, Meta touted gains on key math and knowledge tests that may make that prospect more appealing.
Although measuring progress on AI development is notoriously difficult, test results provided by Meta appeared to suggest that its largest Llama 3 model was nearly matching and in some cases besting Anthropic's Claude 3.5 Sonnet and OpenAI's GPT-4o, which are widely regarded as the two most powerful frontier models on the market.
On the MATH benchmark of competition level math word problems, for example, Meta's model posted a score of 73.8, compared to GPT-4o's 76.6 and Claude 3.5 Sonnet's 71.1.
The model scored 88.6 on MMLU, a benchmark that covers dozens of subjects across math, science and the humanities, while GPT-4o scored 88.7 and Claude 3.5 Sonnet scored 88.3.
In their paper, Meta researchers also teased upcoming "multimodal" versions of the models due out later this year that layer image, video and speech capabilities on top of the core Llama 3 text model.
Early experiments indicate those models can perform "competitively" with other multimodal models such as Google's Gemini 1.5 and Anthropic's Claude 3.5 Sonnet, they said.
Nazara and ONDC set to transform in-game monetization with ‘
Nazara Technologies has teamed up with the Open Network for Digital Comme...
Jio Platforms and NICSI to offer cloud services to government
In a collaborative initiative, the National Informatics Centre Services In...
BSNL awards ₹5,000 Cr Project to RVNL-Led Consortium
A syndicate led by Rail Vikas Nigam Limited (abbreviated as RVNL), along wi...
Pinterest tracks users without consent, alleges complaint
A recent complaint alleges that Pinterest, the popular image-sharing platf...
VVDN TECHNOLOGIES
INFOSYS TECHNOLOGIES PVT. LTD.
LUMINOUS POWER TECHNOLOGIES PVT. LTD.
FIRE BOLTT
Icons Of India : PRATIVA MOHAPATRA
Prativa is a transformational leader with an incredible breadth of exp...
Icons Of India : ASHISH KUMAR CHAUHAN
Ashish kumar Chauhan, an Indian business executive and administrator, ...
Icons Of India : CP Gurnani
Former Managing Director and CEO of the well-known IT service company ...
BSE - Bombay Stock Exchange
The Bombay Stock Exchange (BSE) is one of India’s largest and oldest...
IREDA - Indian Renewable Energy Development Agency Limited
IREDA is a specialized financial institution in India that facilitates...
EESL - Energy Efficiency Services Limited
EESL is uniquely positioned in India’s energy sector to address ener...
Indian Tech Talent Excelling The Tech World - Thomas Kurian, CEO- Google Cloud
Thomas Kurian, the CEO of Google Cloud, has been instrumental in expan...
Indian Tech Talent Excelling The Tech World - AJAY BANGA, President - World Bank
Ajay Banga is an Indian-born American business executive who currently...
Indian Tech Talent Excelling The Tech World - REVATHI ADVAITHI, CEO- Flex
Revathi Advaithi, the CEO of Flex, is a dynamic leader driving growth ...