Providence Salumu
This article is the first in a series of articles discussing NES, the New Edit System for MusicBrainz. In this post, I’m going to explain briefly why we need NES and motivations to move away from the current edit system, and then look at some of the features that NES will bring.
MusicBrainz has been running for over a decade, and in that time has stuck with the same model for an editing system. It’s done well, given what it does, but it’s no surprise that in that time things have changed. If you’re unaware of how editing in MusicBrainz works, users make edits which then have to be peer-reviewed and voted on by other editors. Edits have 2 weeks in this voting period phase, when they are said to be open. If an edit has not received sufficient votes to be applied in this period, it will be applied by default after 2 weeks.
This graph shows the total amount of edits in the MusicBrainz database - roughly matching exponential growth, with a worrying knee around the start of 2012. This graph would be ok, if it wasn’t accompanied by this one:
This graph shows the amount of open edits at any point in time. It’s noisy, but we can roughly say there are at least 50,000 edits that need to be peer reviewed at any point in time. Finally, one last graph to really clarify the problem:
This shows the amount of votes a day - generally around 6000. An impressive number, but it’s not inline with the size of the open edit queue. The mere fact that open edits is growing is cause for alarm enough. The edits will eventually get applied - as mentioned above after the 2 week voting phase they get applied anyway. However, and this is the crucial problem, they are being applied without sufficient peer review. We pride ourselves on data quality, but without ample peer review, we risk losing this magnificant quality.
It’s clear that there is a problem with an excessive amount of open edits, but it’s also important to understand why that is. The amount of active editors in MusicBrainz has sadly gone down, year-on-year, so it’s not due to more editors. The real problem is that the edit system does not match how people actually edit.
Edits are currently very granular, almost corresponding directly to single database operations. There are edits to create a release, edits to add mediums, edits to add relationships, and so on. From a technical perspective, this makes sense - we have lots of small operations that can be sequenced to create bigger changes, but it doesn’t make sense socially. A single edit doesn’t have the context of what an editor was actually doing, and for voting people want to see the bigger picture. I think we can agree that the goal of peer review is to ensure that the end result is to assure that an editor’s work is correct as a whole, the individual steps are not as important.
The MusicBrainz edit system offers little for editors. There are plenty of things that people have been asking for, yet the design of the edit system makes it very difficult for us to implement them. Amongst all the various requests for new features, some of those that stand out are:
Wikipedia’s article on technical debt says:
Technical debt (also known as design debt or code debt) is a neologistic metaphor referring to the eventual consequences of poor or evolving software architecture and software development within a codebase.
The edit system was designed a long time ago, and has grown in a very ad-hoc fashion since then. This has led to bad design decisions, poor reliability, a large maintainence burden and a lack of trust. As a developer, it’s difficult for me to trust changes I make to the edit system, and for users it’s worse - they risk losing changes that they’ve spent time working on.
NES has been designed without the constraints of the current edit system, and has been thought out to make it either extremely difficult or impossible to introduce the critical bugs we have now. We’ll look at how NES is really designed in a future article, but lets move on to see the high level picture of the new edit system.
As NES is fresh start, it’s a different model from the current edit system, so it will take a little bit of time to understand how everything fits together. I’ve discussed with people the technical implementation of NES (which is important), but before I go into that lets take a look at the birds eye view of NES.
There are a few moving parts in NES, so lets have a look at each of these in turn.
Entities are roughly the same as what we have at the moment in MusicBrainz. They are the core data of the project, and MusicBrainz has 7 of them: artists, labels, recordings, releases, release groups, urls and works. An entity is something that has MBIDs, maintains a history of edits and can be edited via the new edit system. Entities have both versioned data, data that will show up in the history, but can also have non-versioned data such as tags and ratings.
When a user makes changes to an entity, they don’t change the data of the entity directly, instead they create revisions. This is handled transparently by the edit system, but revisions work a bit like the undo log in other applications. As you make changes to a release, for example, you will build up various revisions.
When an editor is happy with their changes, they need to publish these revisions for peer review, which brings us onto the next concept.
An editor publishes their changes under a merge request, which is the closest NES gets to the old edit system. A merge request groups all changes together and presents these changes for other editors to review and vote on. A merge request can be small, such as just correcting the case of a track on a release, or it can be large such as entering a classical box set with works, artist relationships, and more. While there is no limit to the size of merge requests in the system itself, I expect that the community will quickly find out what a manageable merge request looks like, and more editing editique will emerge.
It’s hard to enumerate everything that NES can do, but here are some of the features that I think are significant:
In the next post I’ll explain how NES is structured internally, and look at a rough plan of action and where we are now. If you want a sneak peak, you can have a look at the presentation I gave about NES, Towards a New Edit System (note that sadly this presentation only seems to work if you have a Webkit based browser. Sorry!).
You can contact me via email at ollie@ocharles.org.uk or tweet to me @acid2. I share almost all of my work at GitHub. This post is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License.
I accept Bitcoin donations: 14SsYeM3dmcUxj3cLz7JBQnhNdhg7dUiJn
. Alternatively, please consider leaving a tip on