Several years ago I was having a conversation with one of our clients: a buy-side firm that takes the same creative approach to unearthing and analyzing unique data as we do, and has been doing so since well before Majestic was founded. The particular person I was talking to is a brilliant computer scientist whom I have the utmost respect for, and who has demonstrated on numerous times a profound ability to understand (at least from a technology perspective) how the world is evolving. What he mentioned to me took me aback as it struck at the core of what we were trying to build. He said, “Don’t you think all or most of this data we are working so hard to gather and acquire will be readily available to anyone in the not-so-distant future?” Given much of our value proposition is largely focused on the uniqueness and proprietary nature of our information, it is easy to see why his remark had me concerned. Fast-forward to today, and there is ample evidence that he was correct in his thinking, and loads of data seem headed on a path to becoming a commodity, but it turns out that with respect to Majestic I was wrong to be so apprehensive.
Commodity businesses are challenging ones. When all you are competing on is price, margins are difficult to maintain and price wars can wreak havoc on a company’s value. Just look at the history of flash memory, Website hosting, or what is currently occurring with telecom service providers. Once consumers focus solely on price it becomes a fight to the bottom. Ultimately, commodity providers need to layer something on top of the commodity product to differentiate themselves. For instance, consider the crowded space of solid-state drives (SSDs), which many people speculate will grow tremendously into notebooks and enterprise computers over the next couple of years. While the NAND chips inside the SSD might be considered the commodity, the controller that manages the memory can make a world of performance difference and allow a company to gain a tremendous edge. So the layers on top of the commodity become the differentiators.
Back in the early days of Majestic, we thought it was imperative to demand exclusivity around all of the data deals that we signed, and we routinely paid a high premium for such exclusivity borne from a fear that without the exclusivity we would lose a considerable amount of our unique value proposition. Over time, however, it became apparent that the value we added on top of the data through our analysis, triangulation across multiple datasets (including those we gather internally), and ability to distill insight from and bring life and color to the data in a way that was most relevant to our clients became our true value proposition and was also something that other companies would be challenged to replicate. We realized that it was our ability to answer our clients’ most challenging questions, identify large points of inflection early, and solve our clients’ problems through our triangulation across many datasets of both the proprietary and non-proprietary sort that is what we were being paid for by the majority of our clients.
We therefore decided to reserve exclusivity covenants only for special cases where we determined the data was just too unique and value-added not to do so. But data that is truly unique or difficult to access is getting harder and harder to find, thus shifting the value from the data to the process and end-product. Companies that make their primary business simply packaging, providing a front-end into, and reselling data and information may find themselves in trouble.
Data and information are freeing themselves and disseminating at a frantic pace, and the concept of paying large or even very small amounts of money for information is becoming less and less appetizing. Just ask the newspaper publishers. As the raw data itself becomes more and more readily available and easily accessibly across numerous platforms, the value will shift from being a provider or platform to simply deliver the data to being a platform whereby the maximum value is extracted from large amounts of data. Let’s take a look at some examples that illustrate this point.
Consider XBRL. XBRL is a markup language (a child of XML), meant to standardize the way businesses report data. Importantly, it is an open standard, free of any license fees. With respect to financial data, an SEC mandate went into effect nearly a year ago requiring companies to “tag” financial data in XBRL, and there are many related initiatives currently underway, including a bill in Congress that would require XBRL tagging for all bailout requests and expenditures, as well as a bill meant to accelerate acceptance and broader use of interactive data. Ultimately, in the not-too-distant future, nearly every business metric for public companies will be readily and freely available to anyone whether they want to use the data for their own work or build an application on top of the data in much the same way so many programmers have created brilliant and useful apps for the iPhone and now for Android phones. Here is a rudimentary example and starting block of what can ultimately be accomplished–a not-for-profit project called Freerisk. A few other companies looking to build businesses around making data open, free, or highly affordable can be found here, here, and even Amazon is in the mix, as can be seen here.
So are data businesses becoming commodity businesses? Not just yet, but it is beginning to happen, and I believe the trends here will accelerate, and this will ultimately happen quickly. When it does, companies that make their livelihood selling in raw form large amounts of data will see their margins severely compressed–as with flash memory, the consumer will focus only on price. That is what happens in commodity businesses. The value will transfer to business models that leverage creativity, analysis, and the ability to drive insight from information in scalable ways. Just as Apple knew that its hardware edge in the mobile space would not be long-lived so it developed the app store by opening up its SDK in a way that shook up the industry and will allow it to maintain its edge into the foreseeable future, so too must information companies think of creative ways to protect their platforms.
Consider two titans in the information business, Thomson Reuters and Bloomberg. Bloomberg recently took an amazing initiative that should have received a lot more publicity than it did by opening up its proprietary security codes to anyone. The interface can be found here. In the meantime, Reuters was recently questioned by the European Commission on terms of use relating to its own proprietary codes (the RIC codes), and S&P is currently being investigated by the EC as to whether its codes breach any European antitrust laws. So while the competitors are questioned and investigated, Bloomberg, by opening the codes up to everyone, will have lots of people creating tremendous value–and saving untold amounts of time–for lots of other people, and the codes will become more and more pervasive as a result. In fact, they will likely become the currency that will be a large one of the many ways Bloomberg will benefit from this decision. I expect Bloomberg’s bold initiative will be a paradigm shifter for the space, as others will be forced to ultimately scramble in response. But for now Bloomberg is the one that seems to “get it,” and it will be interesting to see what Bloomberg’s next move is.
As far as Majestic goes, we will continue our mission of unearthing new and novel data sets whose analysis can drive important investment and business decisions, and we will continue to increase our focus on knowing which datasets to triangulate across to answer different types of questions, and how best to present the insight. As more and more data becomes available to us and to the public at large, we will understand better than anyone else how to combine it with other datasets to answer the questions that drive investment and business decisions. A tremendous amount of future value will be in both identifying the signals and filtering out the noise, and we’ve invested over 7 years in doing just that. Our triangulation, testing and filtering process is the analog to the controller of flash memory. I used to listen to arguments that packaging and reselling the raw content again and again to multiple different parties was the only way to get true value from the business, and I worried that what we were focusing on might not effectively scale. Ironically, what once troubled me as not effectively scalable I now think of as providing us the opportunity to become the market leader of firms that can add value in the burgeoning world of pervasive and ubiquitous data–a world that is coming faster than most people think.




Recent Comments