CERberus -- guardian against character errors 🐶🐶🐶
-
Updated
Feb 15, 2024 - HTML
CERberus -- guardian against character errors 🐶🐶🐶
A pipeline to transfer ground truth from Transkribus to eScriptorium.
This repository provides German documentation relating to the text recognition and transcription platform eScriptorium. The documentation was created in the context of the OCR-BW project.
Create ready-to-use Label Studio pre-populated JSON files from popular OCR formats.
Post-process PageXMLs to improve their region reading order
A Clojure client for Transkribus
The following HTR model is available inside Transkribus platform (public model section). The HTR model dataset is available on Zenodo: https://doi.org/10.5281/zenodo.4889217
A python package providing some utility functions for interacting with the Transkribus-API
Custom named entity recognition (persons, locations) using spaCy for German texts annotated in Transkribus
Sample data set exemplifying an idealized data processing pipeline for didactic purposes
Collection of Early Modern Danish postils
Add transcriptions to items in Tropy using the Transkribus metagrapho API
The following HTR model is available inside Transkribus platform (public model section). The HTR model dataset is available on Zenodo: https://doi.org/10.5281/zenodo.4888926
Comparing OCR models: Tesseract and Transkribus for Devanagari script.
django app to interact with Transkribus-API
Mémoire pour le Master TNAH de l'ENC (2020)
Script to automate the process of updating a wiki page with the remaining amount of Transkribus credits left for the Wikimedia account
Dataset of the University of Basel's research seminar "Indexing and Digital Processing of a Historical Image Collection on the Appropriation of Buddhism in the West"
Add a description, image, and links to the transkribus topic page so that developers can more easily learn about it.
To associate your repository with the transkribus topic, visit your repo's landing page and select "manage topics."