Machine Learning Accelerates Ancestry’s Record Digitization
Machine Learning Accelerates Ancestry’s Record Digitization
Publish Date: 2026-06-30 11:56:00
Source Domain: www.businessinsider.com
Certainly! Here’s an unordered list with 6 key points summarizing the article:
- Expanse of Family Tree Data: Ancestry has painstakingly collected over 71 billion records from 88 countries, building 148 million family trees over 42 years, primarily through manual transcription processes.
- International Expansion Costs: International market expansion became expensive and time-intensive, primarily due to the manual digitization of records.
- Leadership in AI Adoption: Sriram Thiagarajan has spearheaded Ancestry’s investment in machine learning and artificial intelligence since 2020 to speed up the digitization process.
- AI Training Efforts: Ancestry, led by Jackson Reese, developed proprietary machine learning models to digitize documents since the early 2010s using technologies like BERT and advanced image recognition.
- Advances in Automation: The advent of large language models like those from ChatGPT has greatly accelerated the digitization process, allowing Ancestry to handle over 200 languages more efficiently.
- New AI Tools and Historical Records Growth: AI is now used to generate over half of Ancestry’s historical records, significantly increasing the content growth from millions to billions of records per year.