Skip to content
printicon
Show report in:

UMINF 16.07

Implementing a speech-to-text pipeline on the MICO platform

MICO is an open-source platform for cross-media analysis, querying, and recommendation. It is the major outcome of the European research project Media in Context, and has been contributed to by academic and industrial partners from Germany, Austria, Sweden, Italy, and the UK. A central idea is to group sets of related media objects into multimodal content items, and to process and store these as logical units. The platform is designed to be easy to extend and adapt, and this makes it a useful building block for a diverse set of multimedia applications. To promote the platform and demonstrate its potential, we describe our work on a Kaldi-based speech-recognition pipeline.

Keywords

No keywords specified

Authors

Back Edit this report
Entry responsible: Johanna Bjorklund

Page Responsible: Frank Drewes
2024-05-05