Evaluation framework for video OCR

作者: Padmanabhan Soundararajan , Matthew Boonstra , Vasant Manohar , Valentina Korzhova , Dmitry Goldgof

DOI: 10.1007/11949619_74

关键词: Domain (software engineering)Evaluation strategyOptical character recognitionComputer scienceNatural language processingWord error rateMetric (unit)Baseline (configuration management)Image processingArtificial intelligenceSpeech recognitionCLIPS

摘要: In this work, we present a recently developed evaluation framework for video OCR specifically English Text but could well be generalized other languages as well. Earlier works include the development of an strategy text detection and tracking in video, work is natural extension. We sucessfully port use ASR metrics used speech community here domain. Further, also show results on small pilot corpus which involves 25 clips. Results obtained are promising believe that good baseline will encourage future participation such evaluations.

参考文章(6)
Hervé Bourlard, Iain A. McCowan, Daniel Gatica-Perez, Pierre Wellner, John Dines, Mike Flynn, Darren Moore, On the Use of Information Retrieval Measures for Speech Recognition Evaluation IDIAP. ,(2004)
Kenneth Steiglitz, Christos H. Papadimitriou, Combinatorial Optimization: Algorithms and Complexity ,(1981)
Michael L. Fredman, Robert Endre Tarjan, Fibonacci heaps and their uses in improved network optimization algorithms Journal of the ACM. ,vol. 34, pp. 596- 615 ,(1987) , 10.1145/28869.28874
D. Doermann, D. Mihalcik, Tools and techniques for video performance evaluation Proceedings 15th International Conference on Pattern Recognition. ICPR-2000. ,vol. 4, pp. 167- 170 ,(2000) , 10.1109/ICPR.2000.902888
James Munkres, Algorithms for the Assignment and Transportation Problems Journal of The Society for Industrial and Applied Mathematics. ,vol. 10, pp. 196- 210 ,(1957) , 10.1137/0105003
Vasant Manohar, Padmanabhan Soundararajan, Matthew Boonstra, Harish Raju, Dmitry Goldgof, Rangachar Kasturi, John Garofolo, Performance evaluation of text detection and tracking in video document analysis systems. pp. 576- 587 ,(2006) , 10.1007/11669487_51