Aktuelles

CrossAsia DH Lunchtalks – AI for the Humanities: A Case of Manchu OCR

Dear users,

On February 3rd at 12:30 pm (CET), we are pleased to host the first session of the CrossAsia DH Lunchtalks 2026. The talk will be given by Dr. Yan Hon Michael Chung and is titled “AI for the Humanities: A Case of Manchu OCR.” Dr. Chung will introduce the development pipeline for creating an OCR model for Manchu-language documents and share his reflections on applying AI to humanities research.

Manchu, today an endangered language, was once the official language of China’s last imperial dynasty, the Qing (1644–1911). The Qing state produced an enormous corpus of Manchu-language documents, many of which have been digitized and made publicly available by archives and libraries worldwide. Despite this abundance of scanned materials, there is still no reliable, publicly accessible optical character recognition (OCR) system for Manchu, posing a major bottleneck for historical research.

This presentation introduces an end-to-end Manchu OCR system developed by fine-tuning a vision–language model (VLM), and uses it as a case study to reflect on the broader challenges of applying AI to humanities research. It identifies three structural constraints that distinguish humanities-oriented AI development from commercial or industrial settings: the scarcity of labeled training data, the unusually high accuracy requirements demanded by scholarly research, and the limited computational resources available to most humanities scholars.

To address these constraints, the project adopts a small-model, data-centric strategy. The OCR model is trained using a combination of large-scale synthetic data and carefully curated historical samples. Specifically, a LLaMA-3.2-11B Vision model is fine-tuned using approximately 60,000 synthetic Manchu images alongside 20,000 Manchu word images extracted from real Qing-era documents. The resulting model achieves up to 96% accuracy on unseen, real-world scanned Manchu sources.

The OCR pipeline is further enhanced through a custom Manchu word detection and segmentation model, combined with a post-processing large language model for typographical correction. Together, these components form a complete, practical Manchu OCR system built with state-of-the-art vision–language and language models. Beyond presenting technical results, this presentation argues that carefully constrained, accuracy-driven AI systems offer a viable and sustainable path for AI research in the humanities.

About the speaker:

Dr. Michael Chung is an Assistant Professor in Digital Humanities at the Hong Kong University of Science and Technology. Chung received his PhD in history from Emory University in 2025, and his BA and MPhil from the Chinese University of Hong Kong in 2012 and 2016 respectively. Chung’s research centers on the early Qing dynasty, with a focus on the transfer of European artillery technology and the formation of the Hanjun Eight Banners. As a digital humanist, Chung is currently developing a Manchu OCR system based on a fine-tuned vision-language model.

 

The lecture will be held in English. If you have any questions, please contact us at ostasienabt@sbb.spk-berlin.de.

The lecture will be streamed and recorded via Webex. You can take part in the lecture using your browser without having to install a special software. Please click on the respective button “To the lecture” below, follow the link “join via browser,” and enter your name.

You can find the full programm of CrossAsia DH Lunchtalks 2026 here. Further talks will also be announced on our blog as well as on Mastodon and BlueSky.

 

Yours,

CrossAsia Team

CrossAsia DH Lunchtalks Launching in February 2026

Dear colleagues,

We are delighted to announce that the CrossAsia DH Lunchtalks will return in February 2026.

Originally launched between winter 2023 and spring 2024, the first DH Lunchtalk Series was warmly received by our community. Building on this success, the CrossAsia team and the Max Planck Institute for the History of Science (MPIWG) went on to co-host the international conference “Charting the European D-SEA: Digital Scholarship in East Asian Studies” in Berlin from 8–12 July 2024, bringing together around 120 participants from 19 countries and regions (read more).

In light of this strong engagement and our ongoing commitment to digital scholarship, we are pleased to relaunch the Lunchtalks as an online forum where scholars can share project updates, present new tools and methods, offer methodological insights, and showcase innovative research in Digital Asian Studies.

Between February and June 2026, the DH Lunchtalks will take place monthly. While the 2023–2024 season focused primarily on training in digital tools and platforms, the upcoming series will feature 60-minute lunchtime talks (including Q&A) by distinguished speakers presenting their latest digital research projects. The currently confirmed programme is as follows:

  1. February 3
    Prof. Michael Yan Hon CHUNG (Hong Kong University of Science and Technology)
    AI for Endangered Documentary Archives: Manchu OCR
  2. March 24
    Dr. Franz Xaver Erhard (Leipzig University)
    Getting the Lines Right: Layout Analysis as the Critical First Step for Tibetan Newspaper HTR
  3. April 21
    Prof. ZHAN Beibei (Hunan University)
    Digital Analysis for Confucian Academies in East Asia
  4. May 21
    Dr. CHEN Shih-Pei (Max Planck Institute for the History of Science) & Prof. Mariana Favila-Vázquez (CIESAS–Unidad Ciudad de México)
    Treating a Genre as a Knowledge System: A Digital Research Methodology for Studying Chinese Local Gazetteers
  5. June (TBC)
    Dr. CHOI Donghyeok (Hong Kong Baptist University)
    AI Methods to Construct and Analyze Large-Scale Historical Databases
  6. May or June (TBC)
    Dr. Rafał Jan Felbur (Heidelberg University)
    Born-digital Dictionary of Early Chinese Buddhist Translations

 

All DH Lunchtalks will take place from 12:30 to 13:30 (Central European Time) and will be held online via Webex. Further details for each session, including abstracts and access links, will be announced in advance on the CrossAsia blog. The first talk, by Prof. Michael Yan Hon Chung, will be announced shortly on CrossAsia.

If you have any questions about the DH Lunchtalks, or if you are interested in proposing a future talk and sharing your own digital research, please contact Dr. Jing Hu at jing.hu@sbb.spk-berlin.de.

We look forward to welcoming many of you to the CrossAsia DH Lunchtalks 2026!

 

Yours,

CrossAsia Team

 

Wartungsarbeiten auf CrossAsia am 14.01.2026

Liebe Nutzer:innen,

am 14.01.2026 finden zwischen 06:00 und 07:30 Uhr Wartungsarbeiten an der CrossAsia-Seite statt. In dieser Zeit werden keine Anmeldungen an den CrossAsia-Diensten möglich sein.

Bitte entschuldigen Sie die Unannehmlichkeiten.

***

Dear users,

On January 14, 2026, between 6:00 and 7:30 a.m. maintenance work will be carried out on the CrossAsia website. During this time, registrations on CrossAsia services will be interrupted.

Please excuse the inconvenience.

Jetzt lizenziert: Contemporary Chinese Newspaper Full-Text Database mit 737 Zeitungen

Liebe Nutzer:innen,

 

kurz vor dem Jahresende haben wir die seit Oktober im Testzugriff verfügbare Contemporary Chinese Newspaper Full-Text Database 当代中文数字报纸数据库  nun dauerhaft lizenziert. Die Datenbank enthält einen Fundus von 737 regionalen und überregionalen Tages-, Abend- und Wochenzeitungen. Mit der Lizenzierung der Datenbank stehen nun – soweit seitens der jeweiligen Verlage freigegeben – auch die Archive der Zeitungen zur Verfügung. Sie können nach Zeitungen stöbern oder aber gezielt nach Artikeln, Seiten und Bildern (mithilfe der Bildunterschriften) suchen.

Sie finden die Datenbank auf unserer Datenbankenseite indem Sie den Titel der Datenbank direkt in den Suchschlitz eingeben oder aber durch Anklicken der Kategorie „Zeitungen & Magazine“ und der Sprache „Chinesisch“ bzw. des Regionalen Clusters „Chinesische Sprachregionen“.

Mit den besten Wünschen zum Neuen Jahr,

Ihr CrossAsia Team

 

Neuer Testzugang: 中国共产党思想理论资源数据库

Testen Sie die Datenbank Zhongguo Gongchandang sixiang lilun ziyuan shujuku 中国共产党思想理论资源数据库 und senden Sie uns Ihr Feedback! Der Testzugang ist bis zum 12. Februar 2026 aktiv.

Die Datenbank enthält in 14 Subdatenbanken nahezu 19.000 Werke und Dokumente zur Ideologie und Theorie der Kommunistischen Partei Chinas seit ihrer Gründung im Jahr 1921, darunter u.a. 1.650 Titel aus der Zeit der Kulturrevolution. Die Titel erschienen überwiegend im Verlag 人民出版社. Als besonderes Tool bietet die Datenbank unter dem Reiter 经典著作引文比对 die Möglichkeit zum Abgleich von Zitaten aus den „Klassischen Schriften“ des Marxismus bzw. Sozialismus chinesischer Prägung.

Die Datenbank ist im Volltext durchsuchbar.

Wir wünschen Ihnen viel Spaß beim Ausprobieren und freuen uns über Ihre Rückmeldungen an x-asia!

Ihr CrossAsia-Team

 

Kuzushiji Workshop 2026

Vom 25.-27. Februar 2026 findet ein gemeinsam vom National Institute of Japanese Literature (NIJL) und der European Association of Japanese Resource Specialists (EAJRS) in Kooperation mit der Ostasienabteilung der Staatsbibliothek zu Berlin (SBB) organisierter Workshop zur Einführung in das Lesen kursiv geschriebener bzw. gedruckter vormoderner japanischer Texte (kuzushiji) mit Materialien aus der Sammlung der SBB statt.

Der Workshop wird in Präsenz im Haus an der Potsdamer Straße der SBB in Berlin stattfinden, ein Hybridformat ist nicht vorgesehen.

Er ist offen für Bibliothekar:innen, Kurator:innen, Wissenschaftler:innen und postgraduierte Studierende auch mit wenig bis gar keinen Erfahrungen im Lesen dieser Materialien.

Die Teilnahme ist kostenfrei, die Zahl der zur Verfügung stehenden Plätze aber auf 25 begrenzt. Ggf. entstehende Kosten für Anreise, Übernachtung und Verpflegung müssen durch die Teilnehmenden selbst getragen werden.

Weitere Informationen zum Workshop und zur Anmeldung finden sich auf der Webseite der EAJRS, Informationen zu den Workshops der letzten Jahre inkl. Verlinkungen zu Videoaufzeichnungen und Arbeitsmaterialien finden sich hier.

CrossAsia Talks: Huiyi Wu, Mackenzie Cooley, Shih-Pei Chen 4. Dezember 2025

(See English below)

Zum Jahresabschluss der CrossAsia Talks am 4. Dezember ab 18 Uhr geben Mackenzie Cooley (Hamilton College, USA), Huiyi Wu (Centre Alexandre Koyré, Frankreich), and Shih-Pei Chen (Max-Planck-Institut für Wissenschaftsgeschichte, Berlin) in ihrem Onlinevortrag „Knowing an Empire: Early Modern Chinese and Spanish Worlds in Dialogue“ einen Einblick in den gleichnamigen Sammelband.

This collective book, edited by Cooley and Wu, brings leading scholars across Latin American and Asian Studies to write about how two early modern, vast empires – the Spanish and the Chinbese, despite being, separated by thousands of miles, developed comparable systems to gather, order, and write knowledge about their local worlds. Through a new methodology of “juxtapositional comparison,” this book reads the difangzhi 地方志 (local gazetteers) of China and the relaciones geográficas of the Spanish world in parallel. Knowing an Empire does not see the conveyance of information across an empire as a top-down process with an active center as a knowledge-maker. Instead, it amplifies a blend of voices that speak as much to imperial bureaucracy as to the rich local and Indigenous cultures, revealing these two early modern empires as diverse polities whose equilibria were constantly rebalanced among local powers.

This talk will also give glimpses into some of the book chapters to demonstrate how the juxtapositional comparison is done.

Die Vortragssprache ist Englisch. Bei Fragen kontaktieren Sie uns unter: ostasienabt@sbb.spk-berlin.de.

Der Vortrag wird via Webex gestreamt*. Sie können am Vortrag über Ihren Browser ohne Installation einer Software teilnehmen. Klicken Sie dazu unten auf „Zum Vortrag“, folgen dem Link „Über Browser teilnehmen“ und geben Ihren Namen ein.

Alle bislang angekündigten Vorträge finden Sie hier. Die weiteren Termine kündigen wir in unserem Blog und auf unserem X-Account, Mastodon und BlueSky an.

To mark the year-end conclusion of the CrossAsia Talks on December 4, at 6 p.m., Mackenzie Cooley (Hamilton College, USA), Huiyi Wu (Centre Alexandre Koyré, France), and Shih-Pei Chen (Max Planck Institute for the History of Science, Berlin) will give in their online lecture entitled „Knowing an Empire: Early Modern Chinese and Spanish Worlds in Dialogue“ and share insights into their recent publication of the same title.

This collective book, edited by Cooley and Wu, brings leading scholars across Latin American and Asian Studies to write about how two early modern, vast empires – the Spanish and the Chinbese, despite being, separated by thousands of miles, developed comparable systems to gather, order, and write knowledge about their local worlds. Through a new methodology of “juxtapositional comparison,” this book reads the difangzhi 地方志 (local gazetteers) of China and the relaciones geográficas of the Spanish world in parallel. Knowing an Empire does not see the conveyance of information across an empire as a top-down process with an active center as a knowledge-maker. Instead, it amplifies a blend of voices that speak as much to imperial bureaucracy as to the rich local and Indigenous cultures, revealing these two early modern empires as diverse polities whose equilibria were constantly rebalanced among local powers.

This talk will also give glimpses into some of the book chapters to demonstrate how the juxtapositional comparison is done.

The lecture will be held in English. If you have any questions, please contact us: ostasienabt@sbb.spk-berlin.de.

The lecture will be streamed via Webex*. You can take part in the lecture using your browser without having to install a special software. Please click on the respective button “To the lecture” below, follow the link “join via browser” (“über Browser teilnehmen”), and enter your name.

You can find all previously announced lectures here. We will announce further dates in our blog and on X, Mastodon and BlueSky.

 

*Mit Ihrer Teilnahme an der Veranstaltung räumen Sie der Stiftung Preußischer Kulturbesitz und ihren nachgeordneten Einrichtungen kostenlos alle Nutzungsrechte an den Bildern/Videos ein, die während der Veranstaltung von Ihnen angefertigt wurden. Dies schließt auch die kommerzielle Nutzung ein. Diese Einverständniserklärung gilt räumlich und zeitlich unbeschränkt und für die Nutzung in allen Medien, sowohl für analoge als auch für digitale Verwendungen. Sie umfasst auch die Bildbearbeitung sowie die Verwendung der Bilder für Montagen. / By participating, you grant the Stiftung Preußischer Kulturbesitz and its subordinate institutions free of charge all rights of usage of pictures and videos taken of you during this lecture presentation. This declaration of consent is valid in terms of time and space without restrictions and for usage in all media, including analogue and digital usage. It includes image processing and the usage of photos in composite illustrations. German law will apply.

Nutzendenvertretung beim CrossAsia Fachbeiratstreffen gesucht

Liebe Nutzer:innen,

wir suchen eine Nutzendenvertretung für das Fachbeiratstreffen des Fachinformationsdienstes (FID) Asien am 23. Januar 2026 in Berlin. In dieser Funktion haben Sie die Chance, die strategische Entwicklung des FID Asien und seiner Angebote mitzugestalten. Ihre Aufgabe wäre es wichtige Standpunkte der Community zu vertreten.

Wenn Sie Interesse haben, als Nutzendenvertretung zu fungieren, dann melden Sie sich gerne bei uns bis zum 15. Dezember mit einem Kurzlebenslauf (max. 150 Wörter) unter Angabe Ihres regionalen Schwerpunkts. Aus allen Bewerbungen wählen wir zwei Personen aus, die die Nutzendenperspektive während des Fachbeirates vertreten werden. Der FID Asien übernimmt die Kosten für An- und Abreise sowie eine Übernachtung in Berlin. Näheres über den FID Asien sowie dessen Fachbeirat erfahren Sie hier.

Sollten Sie Fragen haben, dann schreiben Sie uns gern eine E-Mail unter: x-asia@sbb.spk-berlin.de.

***

Dear users,

We are looking for a user representative for the Specialist Information Service (german: Fachinformationsdienst (FID)) Asia advisory board meeting on 23 January 2026 in Berlin. In this role, you will have the opportunity to help shape the strategic development of FID Asia and its services. Your task would be to represent important points of view from the community.

If you are interested in acting as a user representative, please contact us by 15 December with a short CV (max. 150 words) stating your regional focus. From all applications, we will select two people to represent the user perspective during the advisory board meeting. The FID Asia will cover the costs of travel to and from Berlin and one night’s accommodation in Berlin. You can find out more about FID Asia and its advisory board here.

If you have any questions, please feel free to send us an email at: x-asia@sbb.spk-berlin.de.

 

01.-02.12.2025: Wartungsarbeiten am CrossAsia Forum

Am 01. und 02.12.2025 finden Wartungsarbeiten am CrossAsia Forum statt. In dieser Zeit können nur Beiträge gelesen, aber keine neuen erstellt werden.

 


 

Maintenance work will be carried out on the CrossAsia Forum on December 1 and 2, 2025. During this time, posts can only be read, but no new ones can be created.

 

 

New Database Trial – MKS eBook

Dear CrossAsia users,

We are pleased to announce that we have secured trial access to Media Korean Studies eBook (http://erf.sbb.spk-berlin.de/han/korean-studies-mks-ebooks/), an e-book platform featuring Korean Studies titles published by Kyungin Publishing with a strong focus on the humanities.

All CrossAsia users now have full access to the MKS eBook platform until December 31, 2025.

Please note that, in addition to accessing MKS through CrossAsia, the platform also requires you to create an individual user account. To set up your account for the first time, simply follow the steps below:

1. Access MKS through CrossAsia: http://erf.sbb.spk-berlin.de/han/korean-studies-mks-ebooks/.

2. Click the “user icon“ in the upper right corner and start creating your personal account.

 

3. Click on “Switch to Institutional Member” and sign up for your account.

4. Complete the registration for your individual account.

5. After verifying your account via the provided email address, open your account settings and register your account as an institutional member.

Once these steps are completed, you will have full access to all titles on the MKS eBook platform during the trial period.

After your account has been verified as an institutional member (a CrossAsia member) at MKS, you can access the platform directly with your own account — without logging in via CrossAsia — for up to 90 days.

We strongly encourage you to explore the resources during the trial period (before December 31) and welcome any feedback or comments you may have. Your input will support our evaluation of a potential future subscription.

 

Yours,

CrossAsia Team