AI segmentation for stereoscopic cards

In 2024, my stereoscopy collection went online…

…at least partially. Some parts of the collection are still waiting to be digitised. While the derivatives were originally generated using “classic” computer vision algorithms, I have now revisited the problem using AI for current reasons (more on that soon).

The first step was to obtain training data. My own holdings were not sufficient for this, but fortunately, the cards are usually over a hundred years old and therefore no longer subject to copyright. Various institutions have digitised corresponding collections:

The training data was labelled with Labels Studio and is also freely available.

A YOLO11 image segmentation model was then trained with it.

Example

The results for the entry Reception hall of the Maharajah of Tangore in Calcutta, India:

Download

The model itself is available for download on Hugging Face.