The HathiTrust full-text search application provides search services over the full text of more than 16 million volumes. This is about 5 billion pages and about 16 Terabytes (TB) of text created by optical character recognition (OCR.). There are a number of characteristics of the data and metadata in the HathiTrust repository that make providing good search services difficult.
Large-scale Search
Our Research Center
- Governance
- Access and Use
- News and Updates
- Background and Activities
- Papers and Presentations
- Partnering in Research
- Collections and Tools
- HathiTrust Reseach Center Workshops
- Non-Consumptive Use Policy
- Call for Proposals
- Data Capsule Terms of Use
- Advanced Collaborative Support Program
- Conferences and Meetings