Logo Repositorio Institucional

Por favor, use este identificador para citar o enlazar este ítem: https://dspace.ucuenca.edu.ec/handle/123456789/29245
Registro completo de metadatos
Campo DC Valor Lengua/Idioma
dc.contributor.authorSigcha, E-
dc.contributor.authorEspinoza Mejía, Jorge Mauricio-
dc.contributor.authorMedina, J-
dc.contributor.authorSaquicela Galarza, Víctor Hugo-
dc.contributor.authorVega, F-
dc.date.accessioned2018-01-11T16:47:50Z-
dc.date.available2018-01-11T16:47:50Z-
dc.date.issued2017-09-19-
dc.identifier.isbn9783319665610-
dc.identifier.issn18650929-
dc.identifier.urihttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85028800153&doi=10.1007%2f978-3-319-66562-7_49&partnerID=40&md5=fc942b108a228279f3e96b2b1984f4d1-
dc.identifier.urihttp://dspace.ucuenca.edu.ec/handle/123456789/29245-
dc.description.abstractA key element to enable the analysis and accessing to radio broadcast content is the development of automatic speech-to-text systems. The building of these systems has been possible given the current available of different speech resources, models, and open source services designed mainly for English language. However, the most of these tools have been migrated to other languages like Spanish for avoiding the creation of these systems from scratch. Despite existing efforts there is no clear evidence of the tools that can be used to convert audio to text in other dialects of Spanish. Also, the most of these systems are trained to consider a specific context, therefore, audio transcription systems personalized for a language and a specific context are needed. This article describes the implementation of an architecture oriented to automatic speech-to-text transcription applied on Ecuadorian radio broadcasters, using available free tools for performing audio segmentation and transcription. The selected tools were evaluated measuring their performance and facilities for adjusting to the defined architecture. At the end, a Web application was developed and its final performance was compared with IBM Watson speech to text service; the results show that the proposed system improves the accuracy and achieves a Word Error Rate around 10%. The obtained results allow to suggest the use of a free tools set in order to train models oriented to specific speech-to-text transcription scenarios.-
dc.language.isoen_US-
dc.publisherSPRINGER VERLAG-
dc.sourceCommunications in Computer and Information Science-
dc.subjectAudio Content Analysis-
dc.subjectAutomatic Audio Segmentation-
dc.subjectAutomatic Speech Recognition-
dc.subjectPython-
dc.subjectSpeech To Text-
dc.titleAutomatic speech-to-text transcription in an ecuadorian radio broadcast context-
dc.typeArticle-
dc.description.cityCali-
dc.ucuenca.idautor0102778818-
dc.ucuenca.idautor0103599577-
dc.identifier.doi10.1007/978-3-319-66562-7_49-
dc.ucuenca.embargoend2022-01-01 0:00-
dc.ucuenca.afiliacionsigcha, e., school of systems engineering, university of cuenca, cuenca, ecuador-
dc.ucuenca.afiliacionespinoza, m., computer science department, university of cuenca, cuenca, ecuador-
dc.ucuenca.afiliacionmedina, j., department of electrical, electronic engineering and telecommunications, university of cuenca, cuenca, ecuador-
dc.ucuenca.afiliacionsaquicela, v., computer science department, university of cuenca, cuenca, ecuador-
dc.ucuenca.afiliacionvega, f., computer science department, university of cuenca, cuenca, ecuador-
dc.ucuenca.correspondenciaEspinoza, M.; Computer Science Department, University of CuencaEcuador; email: mauricio.espinoza@ucuenca.edu.ec-
dc.ucuenca.volumen735-
dc.ucuenca.indicebibliograficoSCOPUS-
dc.ucuenca.factorimpacto0.162-
dc.ucuenca.cuartilQ3-
dc.ucuenca.nombrerevista12th Colombian Conference on Computing CCC 2017-
Aparece en las colecciones: Artículos

Ficheros en este ítem:
Fichero Descripción Tamaño Formato  
documento.pdf168.92 kBAdobe PDFVista previa
Visualizar/Abrir


Este ítem está protegido por copyright original



Los ítems de DSpace están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.

 

Centro de Documentacion Regional "Juan Bautista Vázquez"

Biblioteca Campus Central Biblioteca Campus Salud Biblioteca Campus Yanuncay
Av. 12 de Abril y Calle Agustín Cueva, Telf: 4051000 Ext. 1311, 1312, 1313, 1314. Horario de atención: Lunes-Viernes: 07H00-21H00. Sábados: 08H00-12H00 Av. El Paraíso 3-52, detrás del Hospital Regional "Vicente Corral Moscoso", Telf: 4051000 Ext. 3144. Horario de atención: Lunes-Viernes: 07H00-19H00 Av. 12 de Octubre y Diego de Tapia, antiguo Colegio Orientalista, Telf: 4051000 Ext. 3535 2810706 Ext. 116. Horario de atención: Lunes-Viernes: 07H30-19H00