History, Data, and the Role of AI in Research

Nikita Balabanov

Author

•

min read

September 16, 2025

Share this article on:

Introduction

History is unique as a science. It is, at its core, data and its interpretations. Sometimes it looks like a coherent story, sometimes not. Other scientific fields often operate with standardized data, measurements, and formats. History, however, either lacks such data standards or sees them constantly questioned by other historians.

For example, just the classification of European swords of the early Middle Ages has at least two different systems of classifications—by Oakeshott and by Kirpichnikov. And this is just one of many examples.

It’s often hard to gather together data from different sources: on a base level, all archaeological data requires professional interpretation and often comparing to other data or interpretation from similar archaeological sites.

At a broader level, the challenge remains the same: data from multiple large-scale research articles are often presented in disparate formats, necessitating a lengthy and resource-intensive process of merging. Whichever part of historical science we turn to, the same issues are rooted in the informal nature of human communication.

Here, AI emerges as a tool that can help us build interfaces or unifications of historical data at a low level. AI comes with real limitations and is often viewed with suspicion by scientists—and that tension is precisely what this article explores.

AI as a tool

Of course, AI is not a magical solution to all problems, despite what some boldly claim. And AI is not an autonomous actor capable of replacing real scientists or researchers. As a tool, AI is not so different from the printing press, from paper, or—if we allow ourselves some irony—from clay tablets.

The current state of AI is at its local optimum—big, very intelligent-looking statistical models. Until a fundamental revolution in the concept of AI occurs, improvements to existing AI models are likely to remain incremental.

Once this is clarified, we can begin discussing the applications and limitations of AI in history and science. First, we need to make it clear that using AI as a decision tool serves little purpose. As a statistical model, AI exhibits what is commonly referred to as the 'average problem.' Because AI responses are based on a given amount of data, they are typically an average of all the data used to train the model. What if the training dataset for the model contains some distorted data on a topic that developers of these models have a special interest in?

This and the fact that training datasets are usually kept secret makes AI very untrustworthy, especially with data of any political value in it. Moreover, LLM models tend to reinforce connections between similar data, so these small changes in datasets can lead to unpredictable distortions of responses of the model.

Even taking this into account, AI performs remarkably well on narrowly defined tasks. The tasks that cannot be solved by usual linear algorithms. For example, parse and extract data from articles and papers, and transform data from different sources to one single format, or respond to small, atomized questions to check the correctness of data or find mutually exclusive elements in data. This is what makes AI great for historical research.

Now humanity cannot just gather and store incredible amounts of data about all sorts of things—from historical sites, property owners, and populations to climate and other contexts that have surrounded humanity throughout history. What we lacked only 20 years ago was a tool to standardize and process it. Of course, errors can still occur, so any research that heavily relies on AI must be supported by robust evidence from non-AI sources.

In our work at Historica, AI is not used to create a vague reflection or model of human history but to gather data, build interfaces, and help us connect different sources of data.

No existing model truly comprehends the concepts of history, human behavior, or spatial representations such as maps. We build historical maps based not only on proximity to points of interest but also on the Earth's geographical attributes. Mountains were typically natural barriers for ancient civilizations, and the same applies to deserts and jungles. For such features, we do not use AI, relying instead on straightforward linear mathematical methods.

The Future

Perhaps one day, with the development of true strong AI, it will be possible to have a single AI entity capable not only of containing the entirety of human history but also of actively processing it. Such an AI could discern hidden correlations between historical processes—not just related to climate, but across a broader spectrum of planetary and extra-planetary phenomena: from climate cycles and solar activity to biological changes, such as the emergence of new diseases or the spread of species, whether natural or artificial.

It could generate theories and compare different interpretations within the full context of history, without the limitations of the human brain. However, the existence of such AI remains uncertain, particularly in the near future.

‍

Nikita Balabanov

Honours Master's degree of Applied Informatics. Full stack software developer and Data Science Researcher

Meet on:

Don't miss out on the latest news!

Oops! Something went wrong while submitting the form.

Contribute to Historica's blog!

Learn guidelines, requirements, and join our history-loving community.

Become an author

FAQs

How can I contribute to or collaborate with the Historica project?

If you're interested in contributing to or collaborating with Historica, you can use the contact form on the Historica website to express your interest and detail how you would like to be involved. The Historica team will then be able to guide you through the process.

What role does Historica play in the promotion of culture?

Historica acts as a platform for promoting cultural objects and events by local communities. It presents these in great detail, from previously inaccessible perspectives, and in fresh contexts.

How does Historica support educational endeavors?

Historica serves as a powerful tool for research and education. It can be used in school curricula, scientific projects, educational software development, and the organization of educational events.

What benefits does Historica offer to local cultural entities and events?

Historica provides a global platform for local communities and cultural events to display their cultural artifacts and historical events. It offers detailed presentations from unique perspectives and in fresh contexts.

Can you give a brief overview of Historica?

Historica is an initiative that uses artificial intelligence to build a digital map of human history. It combines different data types to portray the progression of civilization from its inception to the present day.

What is the meaning of Historica's principles?

The principles of Historica represent its methodological, organizational, and technological foundations: Methodological principle of interdisciplinarity: This principle involves integrating knowledge from various fields to provide a comprehensive and scientifically grounded view of history. Organizational principle of decentralization: This principle encourages open collaboration from a global community, allowing everyone to contribute to the digital depiction of human history. Technological principle of reliance on AI: This principle focuses on extensively using AI to handle large data sets, reconcile different scientific domains, and continuously enrich the historical model.

Who are the intended users of Historica?

Historica is beneficial to a diverse range of users. In academia, it's valuable for educators, students, and policymakers. Culturally, it aids workers in museums, heritage conservation, tourism, and cultural event organization. For recreational purposes, it serves gamers, history enthusiasts, authors, and participants in historical reenactments.

How does Historica use artificial intelligence?

Historica uses AI to process and manage vast amounts of data from various scientific fields. This technology allows for the constant addition of new facts to the historical model and aids in resolving disagreements and contradictions in interpretation across different scientific fields.

Can anyone participate in the Historica project?

Yes, Historica encourages wide-ranging collaboration. Scholars, researchers, AI specialists, bloggers and all history enthusiasts are all welcome to contribute to the project.

History, Data, and the Role of AI in Research

Introduction

AI as a tool

The Future

People also read

Artificial intelligence and Indian Epigraphy: Problems and Promises

Mapping History with AI: Highlights from the Historica.org Conference Q&A

When Machines Learn to Write: Artificial Intelligence and the Limits of Human-Like Creativity

Contribute to Historica's blog!

FAQs