• Delineating the Glottal Edge Using Region-Growing Methodfor Proceedings of the International Conference on Bioinformatics and Biomedical Engineering (iCBBE 2014)   [iCBBE 2014]
  • Author(s)
  • Hyun June Kim, Tailong Shi, Yuling Yan
  • High-speed digital Imaging (HSDI) of the larynx can provide more accurate visualization and measurement of the vibratory behavior of the vocal folds than other existing modalities, as it is capable of capturing true intra-cycle vocal fold vibrations. It is especially useful for studying aperiodic vibratory behaviorof the vocal folds. However, due to its large amounts of data, it is difficult to quantitatively analyze the vocal fold motions. Thus, it is important to develop an efficient and effective method to detect vocal fold edges from the images captured from HSDI for more efficient and accurate analysis of vocal fold motions. In this paper, we propose a new approach for the segmentation of glottis that combines defining the region of interest (ROI) and region-growing method. First, we define the ROI by calculating the average of the difference between maximum and minimum intensity values of each pixel and applyingthresholding.Subsequently we apply the region-growing method selectively to frames representing open state of the glottal cycle.The experimental results obtainedby using the proposed method confirm the validity of the proposed method
  • no keyword
  • References
  • [1]
    Mehta D, Hillman R. 2008. "Voice assessment: Updates on perceptual, acoustic, aerodynamic, and endoscopic imaging methods," Current Opinion in Otolaryngology and Head and Neck Surgery, 16(3): 211–215.
    Yamauchi A, Imagawa H, Yokonishi H, et al. 2012. "Evaluation of vocal fold vibration with an assessment form for high-speed digital imaging: comparative study between healthy young and elderly subjects," J Voice, 26(6): 742–750.
    Yan Y. and Izdebski K. 2009. "Integrated Spatio-temporal Analysis of High-Speed Laryngeal Imaging and Abnormal Vocal Functions -Their Role and Applications in the Study of Normal and Abnormal Vocal Functions," in Speech and Language Technology. G. Demenko, ed. Poznan,pp.15-37.
    Wittenberg L, Tigges M, Mergell P, Eysholdt U. 2000. "Functional imaging of vocal fold vibration: digital multislice high speed kymography," J Voice,14(3):422–442.
    Snow, JB, Wackym PA. 2008. Ballenger's otorhinolaryngology head and neck surgery, 17th edition.,). PmphUSA,p. 863-865
    Voigt D., Döllinger M., Eysholdt U., Yang A., Gürlek E. ,Lohscheller J. 2010 "Objective detection and quantification of mucosal wave propagation," J.Acoust. Soc. Am.,128(5) EL347–EL353.
    Skalski A., Zielinski T., Deliyski D. 2008 "Analysis of vocal folds movement in high speed videoendoscopy based on level set segmentation and image registration," in2008 International Conference on Signals and Electronic Systems, IEEE, pp. 223-226.
    Elidan G., Elidan J.2012."Vocal folds analysis using global energy tracking,"J. Voice, 26(6): 760-768.
    Yan Y., Ahmad K., Kunduk M., Bless D. 2005. "Analysis of vocal fold vibrations from high-speed laryngeal images using Hilbert transform-based methodology," J.Voice, 19(2):161-175.
    Deliyski D., Hillman R. 2010. "State of the art laryngeal imaging: research and clinical implications," CurrOpinOtolaryngol Head Neck Surg.18(3):147–152.
    Deliyski D., Petrushev P., Bonilha H., et al. 2007. "Clinical implementation of laryngeal high-speed videoendoscopy: Challenges and evolution,"FoliaPhoniatrica et Logop.,60(1):33–44.
    Schwarz R, Hoppe U, Schuster M, Wurzbacher T, Eysholdt U, Lohscheller J.2006. "Classification of unilateral vocal fold paralysis by endoscopic digital high-speed recordings and inversion of a biomechanical model," IEEE Trans Biomed Eng., 53(6):1099-1108.
    Lohscheller J, Dollinger M, Schuster M, Schwarz R, Eysholdt U, Hoppe U. 2004. "Quantitative investigation of the vibration pattern of the substitute voice generator,"IEEE Trans Biomed Eng.,51(8):1394–1400.
    Neubauer J, Mergell P, Eysholdt U, Herzel H. 2001. "Spatio-temporal analysis of irregular vocal fold oscillations: biphonation due to desynchronization of spatial modes," J AcoustSoc Am.,110(6):3179–3191.
    Döllinger M, Braunschweig T, Lohscheller J, Eysholdt U, Hoppe U.2003. "Normal voice production: computation of driving parameters from endoscopic digital high speed images,"MethodsInf Med., 42(3):271–276.
    Döllinger M, Tayama N, Berry DA. 2005. "Empirical eigenfunctions and medial surface dynamics of a human vocal fold," Methods Inf Med.,44(3):384–391.
    Chen X., Bless D., Yan Y. 2005. "A Segmentation Scheme Based on Rayleigh Distribution Model for Extracting Glottal Waveform from High-speed Laryngeal Images, ," in27th Annual International Conference of theIEEEEngineering in Medicine and Biology Society, I, pp. 6269-6272
    Yan Y., Chen X., and Bless D. 2006. "Automatic Tracing of Vocal Fold Motion from High-SpeedDigital Images," IEEE Transactions on Medical Imaging, 53(7):1394-1400.
    Lohscheller J, Toy H., Rosanowski F., Eysholdt U., Döllinger M. 2007. "Clinically evaluated procedure for the reconstruction of vocal fold vibrations from endoscopic digital high-speed videos," Med. Image Anal.,11(4):400–413.
    Wittenberg T., Mergell P., Tigges M., Eysholdt U. 1997, "Quantitative Characterization of Functional Voice Disorders Using Motion Analysis of High-Speed Video and Modeling." in 1997 IEEE International Conference on Acoustics, Speech and Signal Processing,IEEE, pp. 1663-1666.
    Marendic B., Galatsanos N., Bless D. 2001, "A New Active Contour Algorithm for Tracking Vibrating Vocal Folds," in 2001 IEEE International Conference on Image Processing, IEEE, pp. 397-400.
    Lohscheller J., Döllinger M., Schuster M., Schwarz R., Eysholdt U., Hoppe U. 2004. "Quantitative Investigation of the Vibration Pattern of the Substitute Voice Generator," IEEE Transactions on Biomedical Engineering, 51(8):1394-1400.
    Yan Y., Du G., Zhu C. and Marriott G. 2012. "Snake Based Automatic Tracing of Vocal-fold Motion from High- Speed Digital Imaging," in2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), IEEE, pp. 593-596.
    Karakozoglou S., Henrich N., D‘Alessandro C., Stylianou Y. 2012. "Automatic Glottal Segmentation Using Local-Based Active Contours and Application to Glottovibrography," Speech Communication, 54(5):641-654.
    Manfredi C., Bocchi L., Cantarella G., and Peretti G. 2012. "Videokymographic Image Processing: Objective Parameters and User-Friendly Interface," Biomedical Signal Processing and Control, 7(2):192-201.
    Blanco M., Chen X., Yan Y. 2013. "A Restricted, Adaptive Threshold Segmentation Approach for Processing High-Speed Image Sequences of the Glottis," Engineering, 5(10B):357-362.
    Amaia M. et al. 2010. "Pathological Vocal Folds Features extraction using a modified Active Contour Segmentation," Majlesi Journal of Electrical Engineering, 4(4):55-61.
    Qin X, Wang S, Wan M. 2009. "Improving reliability and accuracy of vibration parameters of vocal folds based on high speed video and electroglottography," IEEE Trans Biomed Eng., 56(6):1744–1754.
    Pinheiro AP., Dajer ME.,Hachiya A., Montagnoli AN., Tsuji D. 2014. "Graphical Evaluation of Vocal Fold Vibratory Patterns by High-Speed Videolaryngoscopy," J Voice, 28(1):106-111.
    Demeyer J., DubuissonT.,Dutoit T., , Gosselin B., Remacle M. 2009. "Glottis Segmentation with a High-Speed Glottography: a Fully Automatic Method", in Proceedings of 3rd Advanced Voice Function Assessment Workshop, J. I. Godino-Llorente, P. G. Vilda, and R. Fraile, eds. Spain: Universidad Politécnica de Madrid, pp.113-116.
    Zhang Y, Bieging E, Henry T, Jiang JJ. 2010. "Efficient and effective extraction of vocal fold vibratory patterns from high-speed digital imaging," J Voice, 24(1):21–29.
    Shi T., Ling G., Yan Y. 2013. "Tracing Vocal-fold Vibrations Using Level-set Segmentation Method." 3rd International Conference on Computational and Mathematical Biomedical Engineering, P. Nithiarasu , R. Löhner , KLiew, eds. CMBE, pp. 63-66
    Kohler, R. 1981. "A segmentation system based on thresholding," Comput. Graph. Image Process. 15:319-338.
    Haralick, R., Shapiro, L. 1985. "Image segmentation techniques," Comput. Graph. Image Process. 29:100-132.
    Mehta, D., Deliyski, D., Quatieri, T., Hillman, R. 2011. "Automated measurement of vocal fold vibratory asymmetry from high-speed videoendoscopyreocordings," J. Speech Lang. Hear Res., 54(1):47-54.
    Mehta, D., Deliyski, D., Zeitels, S., Quatieri, T., Hillman, R. 2010. "Voice production mechanisms following phonosurgical treatment of early glottis cancer," Ann. Otol. Rhinol. Laryngol., 119(1):1-9,
    Burger M. 2001 "A level set method for inverse problems," Inverse Problems,17:1327-1355

Engineering Information Institute is the member of/source content provider to