Tuesday, October 21, 2025
HomeNewMeta launches Llama 3.2 open-source model with image processing

Meta launches Llama 3.2 open-source model with image processing

Published on

spot_img


FILE PHOTO: Meta has released its first open-source model with both image and text processing abilities, two months after the release of its last big AI model.
| Photo Credit: Reuters

Meta has released its first open-source model with both image and text processing abilities, two months after the release of its last big AI model. The Llama 3.2 model includes both small and medium-sized variants at 11-billion and 90-billion parameters as well as more lightweight text-only models at 1-billion and 3-billion parameters that fit into select mobile and edge devices. 

The models will help developers create more advanced AI applications like AR apps with a real-time understanding of video, visual search engines that distribute images based on content, or document analysis tools that can summarise large portions of text.

Out of the different Llama 3.2 variants, the 11-billion parameter one and the 90-billion parameter one are vision models and can understand charts and graphs, caption images and locate objects from natural language prompts. The bigger model can also pinpoint details from images to create captions. 

While the lightweight models are text-only meant to work on phones with Qualcomm, MediaTek, and other Arm hardware. These are meant for summarising recent messages, sending calendar invites for meetings, and for developers to build personalised agentic apps on them. 

Although Meta is playing catch-up with multimodal models from rival AI companies, the Llama 3.2 is comparable with Anthropic’s Claude 3 Haiku and OpenAI’s GPT4o-mini at image recognition and visual understanding tasks, said Meta. 

The company said Llama 3.2 outperformed Gemma and Phi 3.5-mini at certain tasks like prompt rewriting, instruction following and summarisation. 



Source link

Latest articles

What to know about the Amazon Web Services outage

Internet disruptions tied to Amazon's cloud computing service affected people around the...

Meta Employee Creates AI App That Deepfakes the Dream Vacation You Couldn’t Afford

Endless Summer (Screenshots) Are you one of the millions of workers who’ve recently...

Microsoft hints at first-party Xbox handheld, but gamers must demand it

“If there’s demand, we’re going to build it” – that’s the message from...

Claude Code comes to web and mobile, letting devs launch parallel jobs on Anthropic’s managed infra

Vibe coding is evolving and with it are the leading AI-powered coding services...

More like this

What to know about the Amazon Web Services outage

Internet disruptions tied to Amazon's cloud computing service affected people around the...

Meta Employee Creates AI App That Deepfakes the Dream Vacation You Couldn’t Afford

Endless Summer (Screenshots) Are you one of the millions of workers who’ve recently...

Microsoft hints at first-party Xbox handheld, but gamers must demand it

“If there’s demand, we’re going to build it” – that’s the message from...