Using Deep Research in Historical Work: Its Impact and Challenges

On 2 February 2025, OpenAI released a new tool called Deep Research. The tool is designed to help users quickly analyze huge amounts of data, identify connections between sources, create summaries, and suggest new directions for research on various topics.

It will also undoubtedly impact historians' work by speeding up the search for sources, identifying patterns, analyzing texts, and collaborating on complex projects. The tool doesn’t just speed up research; it opens up new avenues for historical discovery and interpretation.

Source: OpenAI

Advancements of Deep Research

​​Recently, Humanity’s Last Exam—a global benchmark created by nearly 1,000 experts from over 500 institutions across 50 countries—conducted a rigorous comparison of AI tools, evaluating their reasoning and subject expertise across more than 100 disciplines.

In this challenging 3,000-question expert-level test, the model powering Deep Research set a new record for accuracy at 26.6%.

Beyond the numbers, it stood out for its human-like reasoning—actively seeking out and integrating specialized knowledge rather than relying on rote memory.

Source: Humanity’s Last Exam

Using Deep Research to Explore Historical Contexts

In “I just tried ChatGPT deep research to dive into my family history — here’s what happened”, Amanda Caswell explored how ChatGPT's deep research can benefit users outside of academic and professional work. She tested it on her family genealogy to see how the average person could benefit from a more thorough exploration than typical web searches.

To get the best results, she advises gathering key family details like names, dates, and regions (there is no need for addresses). The more precise your data and queries are, the better the outcome. Rather than asking basic questions focused on facts or dates, deep research tools enable historians to address more complex, interpretive questions.

Source: Tom’s Guide

Deep Research Challenges and Limitations

Despite its advanced capabilities, Deep Research faces several challenges. The model may be susceptible to malicious inputs that could influence its outputs. Like many AI models, Deep Research can exhibit biases or generate inaccurate information "hallucinations".

Nathan Lambert’s “Deep Research, Information vs. Understanding, and the Nature of Science” explores the evolving role of AI tools like OpenAI’s Deep Research in scientific discovery. Lambert argues that while AI can greatly accelerate the processing and synthesis of information, acting as a powerful “engine of understanding,” it currently lacks the ability to independently generate new scientific insights. He emphasizes that AI excels at organizing and synthesizing existing knowledge, but does not replicate the human ability to make groundbreaking discoveries.

In “Is This the Last Generation of Historians?” Mark Humphries raises several questions about the work of Deep Research. His initial attempt was to commission Deep Research to undertake a historiographical analysis of the evolution of fur trade scholarship, in particular to compare and contrast Canadian and American academic approaches. However, since most of the sources are freely available and there is no paywall, the resulting bibliography is somewhat limited in scope.

The author also asked him to find Alexander Henry’s unpublished letters, a complex task that requires navigating outdated archival sites and searching catalogues. Deep Research independently combed the LAC catalogue and 21 other archives, including ArchiveGrid, producing a robust list that closely matched his own earlier findings. Not perfect, but on par with what a research assistant could produce—something unthinkable just a few years ago.

Source: https://generativehistory.substack.com/

Deep Research has its limitations. Since it operates autonomously, a poorly worded prompt or misleading follow-up can send it off track without a way to correct itself.

In one of Mark Humphries’s tests, the model briefly descended into gibberish before quickly regaining coherence. Its most intriguing output came when asked to compare Alexander Henry’s 1809 Travels with earlier travelogues. Deep Research returned a 6,165-word analysis, claiming to use computational methods and close reading to identify stylistic parallels.

However, the code provided didn’t match its results, and many quotes were fabricated or misattributed. This suggests the model may simulate capabilities it doesn’t fully possess, possibly due to a lack of access to actual texts.When asked for a qualitative analysis without code, however, the model was more accurate.

Conclusion

AI now can address complex research challenges with unprecedented speed, posing a direct challenge to traditional academic methodologies. Historians, in particular, stand to benefit from these tools, using them to investigate detailed questions and uncover new insights.

However, the ongoing issue of hallucinations—especially when dealing with historical facts—remains a significant concern. While debates surrounding ethics and quality persist, the most pressing challenge is ensuring the accuracy and reliability of AI-generated research in the context of historical scholarship.

Don't miss out on the latest news!
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

People also read

AI and the Future of the Museum Experience

AI is entering museums in practical ways: systems that register visitor behaviour, adjust content, and reorganize the experience in real time. This article looks at how these tools change the museum visit and how attention, interaction, and personalization are evolving.
Ludovica Russo
Ludovica Russo
November 18, 2025
5
min read
Generative AI

Teaching Machines Empathy: The Human Lessons in War Literature

When war reduces people to numbers, literature restores their faces. This essay reflects on how novels such as The Nightingale, Gone with the Wind, and Half of a Yellow Sun preserve empathy in times when conflict erodes it. Moving from the moral weight of storytelling to the rise of artificial intelligence, it asks whether machines trained on data can ever understand the human cost of suffering. In an age of algorithmic precision, it’s a reminder that intelligence without empathy risks repeating the same blindness that war reveals.
Warda Khan
Warda Khan
November 4, 2025
7
min read
Generative AI
Historical Events
History

When AI ‘Fictions’ Redirect History: Generative Models, Historiography and Misinformation

This article investigates the epistemic risks generative AI poses for historical research and education. It shows how large language models routinely produce hallucinatory outputs—fabricated references, misattributed quotations, and conflated events—presented with authoritative confidence, potentially misleading scholars and the public. By analyzing these dangers alongside AI’s affordances, it underscores the need for historians to exercise critical verification, methodological caution, and transparency when using AI in historiography and pedagogy.
Eleni Paipeti
Eleni Paipeti
October 29, 2025
7
min read
Generative AI
AI Bias
Ethics

Contribute to Historica's blog!

Learn guidelines, requirements, and join our history-loving community.

Become an author

FAQs

How can I contribute to or collaborate with the Historica project?
If you're interested in contributing to or collaborating with Historica, you can use the contact form on the Historica website to express your interest and detail how you would like to be involved. The Historica team will then be able to guide you through the process.
What role does Historica play in the promotion of culture?
Historica acts as a platform for promoting cultural objects and events by local communities. It presents these in great detail, from previously inaccessible perspectives, and in fresh contexts.
How does Historica support educational endeavors?
Historica serves as a powerful tool for research and education. It can be used in school curricula, scientific projects, educational software development, and the organization of educational events.
What benefits does Historica offer to local cultural entities and events?
Historica provides a global platform for local communities and cultural events to display their cultural artifacts and historical events. It offers detailed presentations from unique perspectives and in fresh contexts.
Can you give a brief overview of Historica?
Historica is an initiative that uses artificial intelligence to build a digital map of human history. It combines different data types to portray the progression of civilization from its inception to the present day.
What is the meaning of Historica's principles?
The principles of Historica represent its methodological, organizational, and technological foundations: Methodological principle of interdisciplinarity: This principle involves integrating knowledge from various fields to provide a comprehensive and scientifically grounded view of history. Organizational principle of decentralization: This principle encourages open collaboration from a global community, allowing everyone to contribute to the digital depiction of human history. Technological principle of reliance on AI: This principle focuses on extensively using AI to handle large data sets, reconcile different scientific domains, and continuously enrich the historical model.
Who are the intended users of Historica?
Historica is beneficial to a diverse range of users. In academia, it's valuable for educators, students, and policymakers. Culturally, it aids workers in museums, heritage conservation, tourism, and cultural event organization. For recreational purposes, it serves gamers, history enthusiasts, authors, and participants in historical reenactments.
How does Historica use artificial intelligence?
Historica uses AI to process and manage vast amounts of data from various scientific fields. This technology allows for the constant addition of new facts to the historical model and aids in resolving disagreements and contradictions in interpretation across different scientific fields.
Can anyone participate in the Historica project?
Yes, Historica encourages wide-ranging collaboration. Scholars, researchers, AI specialists, bloggers and all history enthusiasts are all welcome to contribute to the project.