Using Deep Research in Historical Work: Its Impact and Challenges

On 2 February 2025, OpenAI released a new tool called Deep Research. The tool is designed to help users quickly analyze huge amounts of data, identify connections between sources, create summaries, and suggest new directions for research on various topics.

It will also undoubtedly impact historians' work by speeding up the search for sources, identifying patterns, analyzing texts, and collaborating on complex projects. The tool doesn’t just speed up research; it opens up new avenues for historical discovery and interpretation.

Source: OpenAI

Advancements of Deep Research

​​Recently, Humanity’s Last Exam—a global benchmark created by nearly 1,000 experts from over 500 institutions across 50 countries—conducted a rigorous comparison of AI tools, evaluating their reasoning and subject expertise across more than 100 disciplines.

In this challenging 3,000-question expert-level test, the model powering Deep Research set a new record for accuracy at 26.6%.

Beyond the numbers, it stood out for its human-like reasoning—actively seeking out and integrating specialized knowledge rather than relying on rote memory.

Source: Humanity’s Last Exam

Using Deep Research to Explore Historical Contexts

In “I just tried ChatGPT deep research to dive into my family history — here’s what happened”, Amanda Caswell explored how ChatGPT's deep research can benefit users outside of academic and professional work. She tested it on her family genealogy to see how the average person could benefit from a more thorough exploration than typical web searches.

To get the best results, she advises gathering key family details like names, dates, and regions (there is no need for addresses). The more precise your data and queries are, the better the outcome. Rather than asking basic questions focused on facts or dates, deep research tools enable historians to address more complex, interpretive questions.

Source: Tom’s Guide

Deep Research Challenges and Limitations

Despite its advanced capabilities, Deep Research faces several challenges. The model may be susceptible to malicious inputs that could influence its outputs. Like many AI models, Deep Research can exhibit biases or generate inaccurate information "hallucinations".

Nathan Lambert’s “Deep Research, Information vs. Understanding, and the Nature of Science” explores the evolving role of AI tools like OpenAI’s Deep Research in scientific discovery. Lambert argues that while AI can greatly accelerate the processing and synthesis of information, acting as a powerful “engine of understanding,” it currently lacks the ability to independently generate new scientific insights. He emphasizes that AI excels at organizing and synthesizing existing knowledge, but does not replicate the human ability to make groundbreaking discoveries.

In “Is This the Last Generation of Historians?” Mark Humphries raises several questions about the work of Deep Research. His initial attempt was to commission Deep Research to undertake a historiographical analysis of the evolution of fur trade scholarship, in particular to compare and contrast Canadian and American academic approaches. However, since most of the sources are freely available and there is no paywall, the resulting bibliography is somewhat limited in scope.

The author also asked him to find Alexander Henry’s unpublished letters, a complex task that requires navigating outdated archival sites and searching catalogues. Deep Research independently combed the LAC catalogue and 21 other archives, including ArchiveGrid, producing a robust list that closely matched his own earlier findings. Not perfect, but on par with what a research assistant could produce—something unthinkable just a few years ago.

Source: https://generativehistory.substack.com/

Deep Research has its limitations. Since it operates autonomously, a poorly worded prompt or misleading follow-up can send it off track without a way to correct itself.

In one of Mark Humphries’s tests, the model briefly descended into gibberish before quickly regaining coherence. Its most intriguing output came when asked to compare Alexander Henry’s 1809 Travels with earlier travelogues. Deep Research returned a 6,165-word analysis, claiming to use computational methods and close reading to identify stylistic parallels.

However, the code provided didn’t match its results, and many quotes were fabricated or misattributed. This suggests the model may simulate capabilities it doesn’t fully possess, possibly due to a lack of access to actual texts.When asked for a qualitative analysis without code, however, the model was more accurate.

Conclusion

AI now can address complex research challenges with unprecedented speed, posing a direct challenge to traditional academic methodologies. Historians, in particular, stand to benefit from these tools, using them to investigate detailed questions and uncover new insights.

However, the ongoing issue of hallucinations—especially when dealing with historical facts—remains a significant concern. While debates surrounding ethics and quality persist, the most pressing challenge is ensuring the accuracy and reliability of AI-generated research in the context of historical scholarship.

Don't miss out on the latest news!
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

People also read

ai in archaeology

Artificial Intelligence in Archaeology: Unveiling the Past with Technology

AI helps archaeologists analyze satellite images to find hidden sites, predict where to dig, classify artifacts, decipher ancient writings, and reconstruct ruins. It speeds up research and identifies patterns while still relying on human expertise.
andre gomes mit specialist
Andre Gomes
May 28, 2025
6
min read
Archaeology
Generative AI
ai ontology

Is Creating a Common Ontology for Human History Feasible?

Ontologies already structure the digital world — but applying them to human history poses deeper challenges. This article explores the promise of a global historical ontology, the political and cultural roadblocks, and whether AI can help us build a shared, inclusive narrative of the past.
Sofia Di Bella
Sofia Di Bella
May 6, 2025
7
min read
Digital Humanities
Generative AI
Ontological Bridges
AI and archives

AI’s Role in Preserving Digital Archives

Digital archives are growing fast, and traditional methods can’t keep up. AI offers new ways to manage, restore, and search vast collections with more speed and accuracy. Projects like the Time Machine and BBC Archives show how powerful these tools can be. But with that power come questions: Who owns the data? What voices are missing? And who gets access to the tech? This piece looks at how AI is changing archiving—and what we need to watch out for.
Spencer Johnson
Spencer Johnson
May 2, 2025
6
min read
Generative AI
Historical Research

Contribute to Historica's blog!

Learn guidelines, requirements, and join our history-loving community.

Become an author

FAQs

How can I contribute to or collaborate with the Historica project?
If you're interested in contributing to or collaborating with Historica, you can use the contact form on the Historica website to express your interest and detail how you would like to be involved. The Historica team will then be able to guide you through the process.
What role does Historica play in the promotion of culture?
Historica acts as a platform for promoting cultural objects and events by local communities. It presents these in great detail, from previously inaccessible perspectives, and in fresh contexts.
How does Historica support educational endeavors?
Historica serves as a powerful tool for research and education. It can be used in school curricula, scientific projects, educational software development, and the organization of educational events.
What benefits does Historica offer to local cultural entities and events?
Historica provides a global platform for local communities and cultural events to display their cultural artifacts and historical events. It offers detailed presentations from unique perspectives and in fresh contexts.
Can you give a brief overview of Historica?
Historica is an initiative that uses artificial intelligence to build a digital map of human history. It combines different data types to portray the progression of civilization from its inception to the present day.
What is the meaning of Historica's principles?
The principles of Historica represent its methodological, organizational, and technological foundations: Methodological principle of interdisciplinarity: This principle involves integrating knowledge from various fields to provide a comprehensive and scientifically grounded view of history. Organizational principle of decentralization: This principle encourages open collaboration from a global community, allowing everyone to contribute to the digital depiction of human history. Technological principle of reliance on AI: This principle focuses on extensively using AI to handle large data sets, reconcile different scientific domains, and continuously enrich the historical model.
Who are the intended users of Historica?
Historica is beneficial to a diverse range of users. In academia, it's valuable for educators, students, and policymakers. Culturally, it aids workers in museums, heritage conservation, tourism, and cultural event organization. For recreational purposes, it serves gamers, history enthusiasts, authors, and participants in historical reenactments.
How does Historica use artificial intelligence?
Historica uses AI to process and manage vast amounts of data from various scientific fields. This technology allows for the constant addition of new facts to the historical model and aids in resolving disagreements and contradictions in interpretation across different scientific fields.
Can anyone participate in the Historica project?
Yes, Historica encourages wide-ranging collaboration. Scholars, researchers, AI specialists, bloggers and all history enthusiasts are all welcome to contribute to the project.