Google DeepMind Releases Gemma 4 Models for Multimodal AI

1,771,730 followers

4w Edited

🙌 Congrats Google DeepMind, Google AI for Developers on the release of your Gemma 4 models!🎉 The new multimodal and multilingual models are built for fast, efficient, and secure AI across devices – and optimized to run locally on NVIDIA RTX, RTX PRO, DGX Spark, and Jetson. 👉 Prototype the 31B model and start experimenting for free on https://lnkd.in/gttfrsCb 🔗Check out the details to get started in our Technical Blog: https://lnkd.in/gC8iTd2m

Google DeepMind

1,539,888 followers

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE

4 Comments

Anu Verma 3w

What makes releases like this matter isn’t just the model, it’s the deployment flexibility. Once teams can run multimodal models locally across very different hardware tiers, AI stops being a central platform privilege and starts becoming an operating layer inside real workflows. That’s when you see the biggest shift, not in demos, but in all the boring internal processes that suddenly become automatable.

1 Reaction

Des Raj C. 3w

Everyone is focusing on “Open” and model size, but the more interesting signal is native tool use plus local deployment. That combo pushes these models much closer to actual agent infrastructure instead of just another benchmark event. The hard part now isn’t access, it’s whether teams have the evals and guardrails to stop a capable local model from creating very confident chaos.

1 Reaction

Muhammad Faisal Khan 3w

Thank you so much for sharing!🇵🇰🇺🇸🇹🇼🇻🇳🇫🇷🇮🇳

1 Reaction

Dmytro Romanov 3w

Native tool use on a 26B MoE that runs locally is the real unlock here. That moves open models from "good for chat" to "usable in actual agent pipelines" territory.

See more comments

To view or add a comment, sign in

More Relevant Posts

Google DeepMind

1,539,888 followers
4w
Report this post
Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
216 Comments
Like Comment
To view or add a comment, sign in
NVIDIA Robotics

495,595 followers
4w
Report this post
Congrats to Google DeepMind and Google AI for Developers teams on your launch of Gemma 4. Jetson developers can now run these new multimodal, multilingual models at the edge—from Jetson Orin Nano all the way up to Jetson Thor—to cut latency, manage costs, and keep sensitive data secure on device. Whether you're building for robotics, smart machines, or industrial automation, Gemma 4 brings frontier intelligence to the edge. See our technical blog for details: https://lnkd.in/gC8iTd2m
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
Federico Barbero
4w
Report this post
Very happy to say that Gemma 4 utilizes our work on p-RoPE for improved long-context generalisation! The p-RoPE project was born out of a desire to better understand modern LLM architectures and particularly how they length-generalise. I think there is immense merit in investigating edge cases and failure modes of modern architectures from a theoretical and empirical perspective. This "first principles" type of academic work will continue to be very useful as we keep improving our understanding of how these models behave out of distribution. It was a huge pleasure to collaborate with the Gemma team on this! Check out our p-RoPE paper here: https://lnkd.in/g6N2MjsG Would have not been possible without Tatiana Matejovicova Alex Vitvitskyi Christos Perivolaropoulos Razvan Pascanu Petar Veličković
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
3 Comments
Like Comment
To view or add a comment, sign in
David Tanis
4w
Report this post
Gemma 4 is here, and it’s a notable release for anyone interested in running powerful AI locally for FREE. While it may not match the full versatility of the latest versions of Gemini, ChatGPT, or Claude, it makes capable language models more accessible without subscription costs, usage limits, or continued reliance on the cloud. Why it stands out: • Local deployment • Specialized coding variant • No per-token fees • Greater privacy and user control • Fewer usage restrictions • Strong value for developers, researchers, and AI enthusiasts • Backed by Google For anyone following open models, local inference, or more affordable AI experimentation, Gemma 4 is worth paying attention to.
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
Aamad Naseem
3w Edited
Report this post
Google just made a big move in AI. Before, its model Gemma 3 had a special license. That caused problems: ● Developers couldn’t freely use it for business ● There were restrictions and legal risks Now, Gemma 4 is released under Apache 2.0 license. That means: ● Free for commercial use ● No restrictions ● No remote control or “kill switch” ● You can modify, share, and build on it freely This change is very important. On the technical side: ● The model has 26 billion parameters but uses only 3.8B at a time, so it can run on a single GPU. ● Smaller models (2B, 4B) can run on devices like Raspberry Pi
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
Denis B.
3w
Report this post
Interesting release from Google. Four model sizes covering everything from mobile to workstation-grade, with native vision, audio, and tool use baked in. The Apache 2.0 licensing makes this especially compelling for enterprise and hobbyist builders alike. Worth exploring.
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
Noam Gold
3w
Report this post
The release of Gemma 4 marks a significant shift in the open-weights landscape, particularly for those of us focused on building agentic workflows and local AI implementations. Google has managed to pack frontier-level reasoning into models that are remarkably efficient. Here is a breakdown of what makes Gemma 4 a game-changer: 1. "Intelligence-per-Parameter" Breakthrough The new 31B model is punching far above its weight class. It’s currently ranking among the top open-weights models globally, proving that architecture optimization often beats raw parameter count. 2. Built for Agents Unlike models designed primarily for chat, Gemma 4 was specifically tuned for agentic reasoning and complex tool use. This is a massive win for anyone working with autonomous agents or Model Context Protocol (MCP) integrations. 3. Multimodal at the Edge The E2B and E4B "Effective" models bring high-performance multimodal capabilities directly to local hardware, enabling low-latency image and text processing without relying on heavy cloud APIs. 4. Efficiency & Portability With a 26B Mixture of Experts (MoE) variant, you get the reasoning power of a much larger model with the inference speed and memory footprint of a smaller one. It’s the ideal "engine" for local development environments and IDE integrations like Cursor. Gemma 4 provides the perfect balance of privacy, speed, and frontier-level intelligence for our custom builds and automation scripts. Read the full release and benchmarks here: https://lnkd.in/g79TQhKt #Gemma4 #GoogleDeepMind #OpenSourceAI #SoftwareArchitecture #AgenticAI #VibeCoding #MachineLearning #GenerativeAI
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
David Evans
3w
Report this post
People get tired of me saying this, but: if you've got Llama models in your self-hosted setup, I highly recommend also benchmarking out the Gemma models. I dont know what the 4 series is like, but the team behind these models is great and the Gemma family has been consistently good since they started releasing them.
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
1 Comment
Like Comment
To view or add a comment, sign in
Abdul Mathin Shaik, CSSBB®
3w
Report this post
Two best things I've seen so far: 1. Native support for vision - Image and video 2. Apache 2.0 License These two are great for using Gemma 4 in Manufacturing without the need to upload the data outside your control environment. Time to build. LFG!
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in
Srijan Chakraborty
3w
Report this post
Who’s winning the AI race? Google: Everyone. Dropping open models, full flexibility to fine-tune, run locally, and build—no gatekeeping, no limits. Google really said: we’ve got you. ⚡ Power to build > power to control. 🙌 Thank you Google.
Google DeepMind

1,539,888 followers
4w

Gemma 4 is here. 💻 We’ve built a new family of open models based on the same world class research and tech as Gemini 3. “Open” means the model weights are yours to download, customize, and run on your own hardware. ⚖️ Four sizes: High-performance versions for workstations (31B Dense & 26B MoE) and highly optimized “Edge” versions (E4B & E2B) built specifically for mobile. 🧠 Advanced reasoning: Capable of multi-step planning and deep logic with native vision and audio support. 🤖 Built for agents: Native tool use lets you build autonomous systems that can actually do things, like search databases or trigger APIs. 🔒 Apache 2.0 License: Complete flexibility to build, fine-tune, and deploy however you want. Start building with Gemma 4 now in Google AI Studio. You can also download the model weights from Hugging Face, Kaggle, or Ollama. Find out more → https://goo.gle/4cb8LBE
Like Comment
To view or add a comment, sign in

1,771,730 followers

View Profile Follow

Google DeepMind Releases Gemma 4 Models for Multimodal AI

More from this author

🦞New Claw Tutorials, World's First Open Source AI Models for Quantum, MiniMax M2.7 Open Weights Available, and more...

⏱️ Vibe Hack at GTC 2026: What can you build in 2 hours?

Gemma 4 at the Edge, Extreme Co-Design Sets New MLPerf Inference Records, CUDA Tile Programming for BASIC, and more

Explore content categories

Google DeepMind Releases Gemma 4 Models for Multimodal AI

More Relevant Posts

More from this author

🦞New Claw Tutorials, World's First Open Source AI Models for Quantum, MiniMax M2.7 Open Weights Available, and more...

⏱️ Vibe Hack at GTC 2026: What can you build in 2 hours?

Gemma 4 at the Edge, Extreme Co-Design Sets New MLPerf Inference Records, CUDA Tile Programming for BASIC, and more

Explore related topics

Explore content categories

⏱️ Vibe Hack at GTC 2026: What can you build in 2 hours?