Microsoft's Magma-8B: A Free AI Model for Text and Image Tasks
- Magma-8B is an open-source AI model with 8 billion parameters.
- It processes text images and videos for tasks like UI navigation and robotics.
- Uses Set-of-Mark and Trace-of-Mark techniques for better action planning.
- Outperforms many models in benchmarks for UI and robotic tasks.
- Free to use under the MIT license.

Microsoft Research has launched Magma-8B a strong AI model packing 8 billion parameters. It blends text image and video processing to handle tasks from UI navigation to robotic control. The best part? It’s open-source under the MIT license meaning anyone can use or build on it.
What Does Magma-8B Do?
Magma-8B is built to process and generate text using both words and visual input. That means it can look at images or videos and describe them in detail—perfect for things like automated UI testing or guiding robots through real-world tasks.
A team of researchers including Jianwei Yang Reuben Tan and Qianhui Wu developed the model using a mix of images videos and robotics data. This broad training gives it the power to interpret complex scenes and make smart action plans.
Smart Techniques for Better Performance
One of the coolest things about Magma-8B is how it plans actions. It uses two key techniques:
- Set-of-Mark. Helps the model recognize and label important objects in images.
- Trace-of-Mark. Lets it track object movements improving planning and execution.
These features make it especially good at UI-based tasks and robotic control since it can pinpoint objects and decide how to interact with them.
How Good Is It?
Magma-8B has already proven itself in multiple tests performing better than many existing models in UI navigation and robotic manipulation. It understands user interfaces well enough to click buttons fill out forms and even automate complex workflows.
Where to Get It
Want to try it out? Unlike some AI models that come with restrictions Magma-8B is free under the MIT license. That means developers and researchers can tweak it for their own projects whether it’s building a better chatbot improving automation or experimenting with AI-driven robotics.
You can check out Magma-8B here:
Published: Feb 26, 2025 at 4:12 PM