OCR and other tools for Wikisource
From Ewan McAndrew
An overview of recent developments with tools for Wikisource, including the new OCR tool.
Wikisource is a project that makes use of several technicalities to alleviate the user's job in reading texts. Optical character recognition (OCR) in this sense is central for the project, and is currently the object of several improvements. The Wikisource community follows these improvements with attention, as any other tool that can be integrated in the multi-language project. This presentation is an opportunity to have a glimpse on the Wishlist selection process done by Community Tech, and the new tools developped for Wikisource, with a particular focus on OCR.
- OCR Tool (web interface; usual interaction is via Page editing on Wikisources)
- WS Export
- IA Upload
- Community Tech team
- Tesseract OCR