MIT and Microsoft Develop Murakkab to Optimize AI Workflows

MIT and Microsoft Develop Murakkab to Optimize AI Workflows

Laurent Giraid is a technologist who stands at the forefront of the artificial intelligence revolution, specializing in the intricate balance between machine learning performance and the ethical implications of resource consumption. With a career spanning the evolution of natural language processing, Giraid has witnessed firsthand the shift from simple algorithms to the complex, multi-agent ecosystems that now define modern cloud computing. As we move into an era where AI “agents” collaborate to solve multifaceted problems, the hidden costs—both financial and environmental—of these sprawling workflows have become impossible to ignore. Our conversation explores a groundbreaking approach to streamlining these systems, moving away from the rigid, manual labor of the past toward a future of autonomous, intent-based optimization.

The following discussion explores the evolution of agentic workflows and the development of the Murakkab system, an intelligent framework designed to automate the configuration of AI-powered software. We examine the transition from “hard-coded” development to plain-language intent specifications, which allow for dynamic model selection and hardware allocation. Furthermore, the dialogue highlights the dramatic efficiency gains achieved through this method, demonstrating how cutting-edge research from MIT and Microsoft can significantly reduce energy consumption and operational costs without sacrificing the accuracy that users demand.

Traditional AI development often requires developers to hard-code every technical choice, including specific models and tool sequences. Why has this rigid approach become such a significant bottleneck for the next generation of agentic workflows?

It’s a massive logistical hurdle because today’s agentic workflows aren’t just single models; they are fragmented ecosystems where multiple autonomous agents, Python scripts, and databases must collaborate. When a developer has to manually specify the sequence of every tool and the exact hardware configuration, they are essentially trying to navigate a search space that is far too large for a human to optimize. If a superior GPU or a more efficient model is released, the current standard requires the developer to basically start from scratch to reintegrate it. This rigidity doesn’t just slow down innovation; it creates a “black-box” scenario where cloud providers can’t see the internal logic of the workflow, leading to massive over-allocation of resources and a palpable sense of wasted potential.

By contrast, the Murakkab system allows developers to describe their goals in high-level terms rather than detailing every component. How does this shift to intent-based design fundamentally change the workflow for a technologist?

It completely transforms the role of the developer from a micro-manager to a high-level architect. Instead of spending hours or days specifying which exact tool should handle frame extraction and which should generate a transcript for a video Q&A app, you simply state your intent in plain language. Murakkab takes that straightforward description and identifies the best available models and tools to execute the request, determining which can run in parallel to save time and which must run sequentially. There is a profound sense of freedom in this because it allows the system to make configuration decisions dynamically; if a new GPU accelerator hits the market tomorrow, the system integrates it automatically. It removes the “conundrum” of choice and lets developers focus on the creative aspects of their applications rather than the underlying plumbing.

Sustainability in AI is a hot-button issue, particularly regarding the energy used by data centers. Looking at the recent data, what were the specific findings regarding energy and cost when this new system was put to the test?

The results were quite remarkable and honestly feel like a breath of fresh air for those of us worried about the carbon footprint of the cloud. When the system was tested on diverse workflows like code generation and video analysis, it met all user requirements while using only about 35 percent of the computation required by conventional methods. That efficiency translated into a dramatic reduction in the environmental footprint, consuming only 27 percent as much energy for less than 25 percent of the usual cost. There is a certain sensory relief in knowing that we can achieve the same level of performance while putting much less strain on our power grids. By providing cloud providers with visibility into these workloads, it prevents the over-allocation of hardware that often leaves expensive processors humming idly.

There is often a fear that cutting energy means sacrificing quality. How does the system navigate the delicate balance between high accuracy and resource conservation during a user’s request?

The beauty of this system is its ability to find ideal configurations that would be nearly impossible for a human developer to do manually. In one instance, the system lowered the energy consumption of a workflow by more than an order of magnitude, yet the drop in accuracy for the user was a mere 2 percent. We even saw cases where the system identified an unexpectedly ideal configuration for a model that selects video frames, optimizing performance in ways we hadn’t anticipated. It gives the user the power to set their own constraints—prioritizing accuracy while meeting a latency requirement, for example—and then it adaptively identifies the hardware and schedules to meet those goals in real time. It proves that we don’t have to burn through excessive power to get high-quality results; we just need to be smarter about how we orchestrate our tools.

What is your forecast for the future of agentic workflows as they become the backbone of cloud services?

I believe we are entering an era where AI will be responsible for its own structural optimization at a massive scale. We will see these workflows move beyond simple tasks into complex clusters that handle global-scale logistics, and the current manual way of building them will eventually feel as archaic as setting every bit in a computer’s memory by hand. There is immense potential to make these workflows more resource-optimal so they consume far less energy, but we need to start thinking about this at the scale of major cloud platforms immediately. My forecast is that within a few years, any cloud provider that doesn’t use an intelligent orchestration system like Murakkab will simply be unable to compete on cost or sustainability. The future of AI is not just about being “smarter,” but about being significantly more responsible with the physical resources of our planet.

Subscribe to our weekly news digest.

Join now and become a part of our fast-growing community.

Invalid Email Address
Thanks for Subscribing!
We'll be sending you our best soon!
Something went wrong, please try again later