Preparation of training data

GlottisNetV1 and GlottisNetV2

Necessary data for training:
  • Images of the glottis

  • Segmentation of glottal area

  • Annotations of anterior and posterior points

GlottisNetV1 and GlottisNetV2 are trained on the openly available BAGLS data set that can be found in https://github.com/anki-xyz/bagls. The annotations of the anterior and posterior points are provided in https://github.com/anki-xyz/GlottalMidline. The coordinates need to be stored as JSON.

GlottisNetV2 trained on videos

The used videos can be found on Zenodo https://zenodo.org/record/6938457.

The videos are stored as mp4-files and have 30 frames each. For the annotation of the anterior and posterior points, the in <https://github.com/anki-xyz/GlottalMidline>`_ provided tool, was used. The segmentations were created using a trained model of GlottisNetV2 on the frames of the input videos. The corresponding Python script predict_segemntation.py is available in this repository in the directory “Examples”. Those precalculated segmentations were enhanced and revised sing PiPra, provided in https://github.com/anki-xyz/pipra.