Using Deep Research in Historical Work: Its Impact and Challenges

On 2 February 2025, OpenAI released a new tool called Deep Research. The tool is designed to help users quickly analyze huge amounts of data, identify connections between sources, create summaries, and suggest new directions for research on various topics.

It will also undoubtedly impact historians' work by speeding up the search for sources, identifying patterns, analyzing texts, and collaborating on complex projects. The tool doesn’t just speed up research; it opens up new avenues for historical discovery and interpretation.

Source: OpenAI

Advancements of Deep Research

​​Recently, Humanity’s Last Exam—a global benchmark created by nearly 1,000 experts from over 500 institutions across 50 countries—conducted a rigorous comparison of AI tools, evaluating their reasoning and subject expertise across more than 100 disciplines.

In this challenging 3,000-question expert-level test, the model powering Deep Research set a new record for accuracy at 26.6%.

Beyond the numbers, it stood out for its human-like reasoning—actively seeking out and integrating specialized knowledge rather than relying on rote memory.

Source: Humanity’s Last Exam

Using Deep Research to Explore Historical Contexts

In “I just tried ChatGPT deep research to dive into my family history — here’s what happened”, Amanda Caswell explored how ChatGPT's deep research can benefit users outside of academic and professional work. She tested it on her family genealogy to see how the average person could benefit from a more thorough exploration than typical web searches.

To get the best results, she advises gathering key family details like names, dates, and regions (there is no need for addresses). The more precise your data and queries are, the better the outcome. Rather than asking basic questions focused on facts or dates, deep research tools enable historians to address more complex, interpretive questions.

Source: Tom’s Guide

Deep Research Challenges and Limitations

Despite its advanced capabilities, Deep Research faces several challenges. The model may be susceptible to malicious inputs that could influence its outputs. Like many AI models, Deep Research can exhibit biases or generate inaccurate information "hallucinations".

Nathan Lambert’s “Deep Research, Information vs. Understanding, and the Nature of Science” explores the evolving role of AI tools like OpenAI’s Deep Research in scientific discovery. Lambert argues that while AI can greatly accelerate the processing and synthesis of information, acting as a powerful “engine of understanding,” it currently lacks the ability to independently generate new scientific insights. He emphasizes that AI excels at organizing and synthesizing existing knowledge, but does not replicate the human ability to make groundbreaking discoveries.

In “Is This the Last Generation of Historians?” Mark Humphries raises several questions about the work of Deep Research. His initial attempt was to commission Deep Research to undertake a historiographical analysis of the evolution of fur trade scholarship, in particular to compare and contrast Canadian and American academic approaches. However, since most of the sources are freely available and there is no paywall, the resulting bibliography is somewhat limited in scope.

The author also asked him to find Alexander Henry’s unpublished letters, a complex task that requires navigating outdated archival sites and searching catalogues. Deep Research independently combed the LAC catalogue and 21 other archives, including ArchiveGrid, producing a robust list that closely matched his own earlier findings. Not perfect, but on par with what a research assistant could produce—something unthinkable just a few years ago.

Source: https://generativehistory.substack.com/

Deep Research has its limitations. Since it operates autonomously, a poorly worded prompt or misleading follow-up can send it off track without a way to correct itself.

In one of Mark Humphries’s tests, the model briefly descended into gibberish before quickly regaining coherence. Its most intriguing output came when asked to compare Alexander Henry’s 1809 Travels with earlier travelogues. Deep Research returned a 6,165-word analysis, claiming to use computational methods and close reading to identify stylistic parallels.

However, the code provided didn’t match its results, and many quotes were fabricated or misattributed. This suggests the model may simulate capabilities it doesn’t fully possess, possibly due to a lack of access to actual texts.When asked for a qualitative analysis without code, however, the model was more accurate.

Conclusion

AI now can address complex research challenges with unprecedented speed, posing a direct challenge to traditional academic methodologies. Historians, in particular, stand to benefit from these tools, using them to investigate detailed questions and uncover new insights.

However, the ongoing issue of hallucinations—especially when dealing with historical facts—remains a significant concern. While debates surrounding ethics and quality persist, the most pressing challenge is ensuring the accuracy and reliability of AI-generated research in the context of historical scholarship.

Don't miss out on the latest news!
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

People also read

From Stone to Stylus: The Evolution of Handwriting, Its Role in building Civilizations, and Its Relevance in the Era of Artificial Intelligence

This article examines the relevance of handwriting in the era of artificial intelligence, its impact on learning and memory, and the transition from essential to optional skills.
Dr. D. Shravani
Dr. D. Shravani
April 9, 2025
6
min read
Generative AI
AI bias

The (im)possibility of technological neutrality

Is technology ever truly neutral? This article dismantles the illusion of objectivity in AI, exposing how systems like DALL-E and the US COMPAS algorithm reinforce racial biases. From erasure to hypervisibility, technology encodes historical inequalities, making neutrality not just improbable but impossible.
Sara Badran
Sara Badran
March 18, 2025
7
min read
AI Bias
AI Ethics

AI’s Role in Shaping Future Ethics: Who Teaches AI Right from Wrong?

AI is increasingly making life-changing decisions—from hiring to healthcare and criminal justice—but who decides what it considers right and wrong? This article explores the hidden power struggle over AI ethics, the risks of bias, and whether AI can ever develop a truly fair moral framework.
Ravi Kumar
Ravi Kumar
March 13, 2025
7
min read
Generative AI
Ethics

Contribute to Historica's blog!

Learn guidelines, requirements, and join our history-loving community.

Become an author

FAQs

How can I contribute to or collaborate with the Historica project?
If you're interested in contributing to or collaborating with Historica, you can use the contact form on the Historica website to express your interest and detail how you would like to be involved. The Historica team will then be able to guide you through the process.
What role does Historica play in the promotion of culture?
Historica acts as a platform for promoting cultural objects and events by local communities. It presents these in great detail, from previously inaccessible perspectives, and in fresh contexts.
How does Historica support educational endeavors?
Historica serves as a powerful tool for research and education. It can be used in school curricula, scientific projects, educational software development, and the organization of educational events.
What benefits does Historica offer to local cultural entities and events?
Historica provides a global platform for local communities and cultural events to display their cultural artifacts and historical events. It offers detailed presentations from unique perspectives and in fresh contexts.
Can you give a brief overview of Historica?
Historica is an initiative that uses artificial intelligence to build a digital map of human history. It combines different data types to portray the progression of civilization from its inception to the present day.
What is the meaning of Historica's principles?
The principles of Historica represent its methodological, organizational, and technological foundations: Methodological principle of interdisciplinarity: This principle involves integrating knowledge from various fields to provide a comprehensive and scientifically grounded view of history. Organizational principle of decentralization: This principle encourages open collaboration from a global community, allowing everyone to contribute to the digital depiction of human history. Technological principle of reliance on AI: This principle focuses on extensively using AI to handle large data sets, reconcile different scientific domains, and continuously enrich the historical model.
Who are the intended users of Historica?
Historica is beneficial to a diverse range of users. In academia, it's valuable for educators, students, and policymakers. Culturally, it aids workers in museums, heritage conservation, tourism, and cultural event organization. For recreational purposes, it serves gamers, history enthusiasts, authors, and participants in historical reenactments.
How does Historica use artificial intelligence?
Historica uses AI to process and manage vast amounts of data from various scientific fields. This technology allows for the constant addition of new facts to the historical model and aids in resolving disagreements and contradictions in interpretation across different scientific fields.
Can anyone participate in the Historica project?
Yes, Historica encourages wide-ranging collaboration. Scholars, researchers, AI specialists, bloggers and all history enthusiasts are all welcome to contribute to the project.