Machine Learning Accelerates Ancestry’s Record Digitization

Machine Learning Accelerates Ancestry’s Record Digitization

Machine Learning Accelerates Ancestry’s Record Digitization

https://www.businessinsider.com/ancestry-ai-llm-record-transcription-digitization-facial-recognition-2026-6

Publish Date: 2026-06-30 11:56:00

Source Domain: www.businessinsider.com

Certainly! Here’s an unordered list with 6 key points summarizing the article:

  • Expanse of Family Tree Data: Ancestry has painstakingly collected over 71 billion records from 88 countries, building 148 million family trees over 42 years, primarily through manual transcription processes.
  • International Expansion Costs: International market expansion became expensive and time-intensive, primarily due to the manual digitization of records.
  • Leadership in AI Adoption: Sriram Thiagarajan has spearheaded Ancestry’s investment in machine learning and artificial intelligence since 2020 to speed up the digitization process.
  • AI Training Efforts: Ancestry, led by Jackson Reese, developed proprietary machine learning models to digitize documents since the early 2010s using technologies like BERT and advanced image recognition.
  • Advances in Automation: The advent of large language models like those from ChatGPT has greatly accelerated the digitization process, allowing Ancestry to handle over 200 languages more efficiently.
  • New AI Tools and Historical Records Growth: AI is now used to generate over half of Ancestry’s historical records, significantly increasing the content growth from millions to billions of records per year.