Wondering how to measure or improve your team’s performance?
In this article, we will explore our measurement approach and evidence-based metrics to use to improve your team's performance.
But, before diving into our approach, it’s important to highlight that the bedrock to a high-performing engineering team is having a trust-based environment. Without that, measuring engineering team performance is pointless, as metrics may be misused or gamed and the team won't trust them (or their leaders) enough to take action on them.
It’s Goodhart’s Law in action: “When a measure becomes a target, it ceases to be a good measure.”
Sections:
Here is our 6 step approach to measuring your engineering team performance:
From there, it's rinse and repeat for steps 4-6. Choose one area to improve, then run an experiment, track progress and iterate.
This approach will help build a culture of continuous improvement and blameless experimentation. Building the habit of experimentation lowers the barriers for teams to change their ways of working, which really helps when tackling new challenges and constantly shifting priorities.
Once you’ve decided that we’d like to measure engineering team performance, there are endless debates across the internet about what metrics to use.
At Multitudes, our philosophy towards metrics is:
Among the recognized frameworks used for measuring engineering team performance are the DORA metrics, the SPACE (Satisfaction, Performance, Activity, Communication, and Efficiency) framework, and the Developer Experience (DevEx) framework.
You can read more about what we measure and why here.
Each framework presents its own lens through which engineering team performance can be viewed:
DORA metrics, when first introduced by Google, focused on 4 key metrics (”the four keys”) that are strong indicators of software delivery performance. This evolved over time, with updates to metrics and introduction of a 5th:
DORA metrics focus on performance aspects, which have correlations with customer value and financial performance of companies.
The SPACE framework takes a big picture view of developer productivity by considering 5 key dimensions:
By integrating these dimensions into consideration, the SPACE framework gives managers a holistic view of engineering team performance that enables them to make better decisions. At Multitudes, we like the SPACE framework because it encapsulates the performance aspects of the DORA framework while also acknowledging the importance of a psychologically-safe and trust-based working environment.
The Developer Experience (DevEx) framework focuses on the lived experience of developers and the points of friction they encounter in their everyday work. The DevEx framework aims to provide a holistic view of the developer experience by considering 3 key dimensions:
In addition to these dimensions, the DevEx framework also emphasizes the importance of measuring overall KPIs, such as developer satisfaction and engagement, to gain a comprehensive view of the developer experience and guide improvement efforts.
Software development is a type of knowledge work, so it’s not as simple to measure as inputs in leading to widgets out. On top of that, the rise in remote and hybrid work has further complicated this scenario. Given these challenges, how can we approach measuring engineering teams in a way that's both meaningful and beneficial?
Instead of fixating on effort or output, prioritize the right engineering metrics that reflect the real value delivered to customers and the business. While these are highly specific to each teams use-case, some examples include:
We recommend deciding on these metrics as a group with relevant stakeholders (e.g., ICs, Team Leads and Product Managers).
It can be easy to think of performance only as what work gets done. But since software development is knowledge work, it can only be done well if the people doing it are well. This is why looking at things like Wellbeing and Collaboration are important human factors to round out the performance metrics.
By examining communication patterns and interactions, organizations can identify factors that boost or hinder team performance. For example, Multitudes can help you measure:
Combining performance and human metrics gives companies a more comprehensive view of their development teams including insights into workplace stress, psychological safety, and team cohesion.
For instance, analyzing how team members communicate can reveal bottlenecks or effective practices that aren't apparent from performance data alone. To provide a real-world example, this is how Multitudes helped Octopus Deploy ship 47% more PRs by reducing the feedback burden on a principal engineer by 89%.
Numbers don't tell the whole story. What they can do is give you a great starting point for conversations. Regular check-ins, retrospectives, and peer feedback can provide valuable insights into team dynamics, individual growth, and areas for improvement that metrics alone might miss.
We also recommend leaving room for annotations and context notes alongside the data, so the human context stays next to the numbers (which you can easily do inside the Multitudes platform).
While individual contributions are important, we recommend emphasizing team-level metrics to encourage collaboration and shared responsibility. The reality is that the best software is built by cohesive, high-performing teams. PRs are a team sport.
When scrutinizing the output of single developers, it becomes difficult to assign credit or blame because software development inherently depends on teamwork. In fact, excessive focus on individual performance can actually make the overall performance worse – imagine the brilliant jerk who gets their work done quickly, but at a cost to the rest of the team.
That’s why it’s so important to look at team performance – because that’s what we want to optimize for. In fact, a two-year study by Google called Project Aristotle found that the number one thing for improving performance is building strong team factors, particularly psychological safety.
For all these reasons, we focus on team performance over individual performance.
Instead of using productivity metrics as a tool for reward or punishment, use them as a starting point for conversations about process improvements, resource allocation, and professional development.
However, it is important to note that metrics only capture a portion of the big picture. A big problem we see in the industry is when leaders use reductive measures to evaluate engineering performance, which tends to cause harm. We’ve all read about the CEO who stack-ranked engineers on lines of code and fired the bottom quartile.
That’s why Multitudes puts limits on what we measure in line with our data ethics principles. Ultimately, improving engineering team performance requires a holistic approach that goes beyond what can be measured alone. Clear communication, effective prioritization, knowledge sharing, targeted training, and maintaining a psychological safety all cultivate high-performing teams.
To effectively measure Engineering Team Performance, teams can use Multitudes, which is an engineering insights platform for sustainable delivery. Multitudes integrates with your existing development tools, such as GitHub and Jira, to provide insights into your team's productivity and collaboration patterns.
With Multitudes, you can:
By leveraging Multitudes, teams can spend more time acting on insights to improve their productivity and satisfaction.
Ready to unlock happier, higher-performing teams?