Tools and data services

The FID4SA supports research processes in the field of South Asian Studies and Indology by developing open digital working tools and infrastructures. The following pages present tools and services developed within the framework of, or with the participation of, the FID4SA, which use innovative Digital Humanities methods. Our aim is to provide basic information about these developments. At the same time, we encourage developers and researchers to reuse and collaborate.


 

Text recognition for South Asian scripts

AI-based approaches have taken text recognition for historical documents, especially non-Latin scripts, a big step forward. Since autumn 2018, the FID4SA has used the Transkribus platform, which was developed as part of the READ project, to recognise the Devanagari text in the Naval Kishore Press collection. Various data models for text recognition have been trained on this collection using Transkribus.

Digital Collections by Region

To offer regional access to the digitised historical literature made available via Literature on South Asia - digital, this collection was classified in more detail by using geocoding methods. Geographical designations in digitised historical printed works, such as countries, places or temples, were given geo-coordinates. Users can therefore access the materials via interactive maps.