Artificial intelligence has become the biggest talking point. It’s not just in Silicon Valley, either. The promise – and threat – of the technology is on the agenda on Wall Street and in the halls of power of Washington, DC. Proponents of AI believe it will touch every facet of human life, whereas critics believe it’s somewhat overblown. The truth is probably somewhere in between. Still, it’s unarguable that AI won’t grow in influence, and that includes areas like sports predictions.
Yet, we wanted to comment a bit on two areas that will impact the future of sports modeling – the rush for ‘good’ data and the concept of competing models. We know that AI is going to be useful for everything ranging from making fantasy sports picks to spotting value in online betting on the NBA, but there are hurdles to overcome, and that’s the area we want to overcome here.
Huge Data Deals Signed By AI Companies
The first is the access to data. If you follow tech news, you’ll have noted that there was a bit of a scramble across 2024 for AI companies like OpenAI and Google to strike deals with publishing companies for access to their data. OpenAI, for example, signed an exclusive deal with Condé Nast, which publishes The New Yorker, Vogue, Wired, GQ, and other important ‘cultural’ publications.
Now, OpenAI’s deal with Condé Nast has little to do with sports, but it does help us illustrate our point. OpenAI will have access to that vast catalog of reporting going back decades from those magazines, whereas other AI models will not, at least not legally. In short, OpenAI’s ChatGPT will know some things that other chatbots do not. On the other hand, Google, Anthropic, Meta, and other companies with AI bots have signed their own exclusive deals with publications or have access to their own data pools (like Meta and Facebook) that OpenAI does not have, so they will ‘know’ things that others do not.
The point for sports predictions is that the internet is not some vast open receptacle with data ready to be plucked at whim. Moreover, companies that hold the data – sports analytics companies – are not acutely aware that the data they hold is hugely valuable for AI training. Many have been taking steps to actively block AI from accessing those websites. Even if web crawlers somehow bypass the blocks, the website owners can ‘poison’ the data to ensure it’s unusable.
Sports Analytics Companies Will Want Paid-For Data Access
You might say that there are many data sites for sports modeling on the open web that are publicly available. That is true, but it’s only up to a point. Anyone can look at NBA reference sites and see historical results and statistics to come up with an algorithm, but the whole point of AI is to go the extra mile. If the model does not have access to vast amounts of ‘good’ data, then its ability to predict is negligibly better than that of a basic algorithm. If, for example, you fed an AI model the season’s college football results without any additional data, any human with a good knowledge of football could make more informed predictions.
None of this is to say that AI won’t have a prominent role in sports modeling. In fact, we started this piece saying that it would. The purpose of the argument is to point out that AI bots are not omnipotent tools with access to all knowledge. Moreover, we are moving into a future where some AI models will have access to data that others do not, and that will include sports data. It’s something to be wary of.
In the end, we predict that there will be bespoke sports predictions AI models, either something akin to what you would find in OpenAI’s GPT Store or simply a release from an AI company that has built their model to focus on analyzing sports. Yet, it is always worth remembering that AI does not think, nor can it truly reason. It is only as good as the data fed into it, and that data has started to become very valuable indeed.