Contribute Media
A thank you to everyone who makes this possible: Read More

In Codice Ratio: Machine Transcription in the Vatican Secret Archive

Description

In Codice Ratio is a research project to study tools and techniques for analyzing the contents of digitized historical documents from the Vatican Secret Archives (VSA). Being digitized as images, their text content is still unaccessible without expert human intervention: transcription is therefore a key enabler for search and automation of knowledge discovery on such large collections. Handwritten documents are particularly challenging, as traditional OCR does not apply, and state of the art handwritten text recognition systems require very large and expensive to obtain datasets. ICR’s transcription system is based on convolutional neural networks and statistical language models, and requires minimal dataset collection effort.

Feedback form: https://python.it/feedback-1797

in __on Saturday 4 May at 18:45 **See schedule**

Details

Improve this page