Show report in:

UMINF 16.07

Implementing a speech-to-text pipeline on the MICO platform

MICO is an open-source platform for cross-media analysis, querying, and recommendation. It is the major outcome of the European research project Media in Context, and has been contributed to by academic and industrial partners from Germany, Austria, Sweden, Italy, and the UK. A central idea is to group sets of related media objects into multimodal content items, and to process and store these as logical units. The platform is designed to be easy to extend and adapt, and this makes it a useful building block for a diverse set of multimedia applications. To promote the platform and demonstrate its potential, we describe our work on a Kaldi-based speech-recognition pipeline.

Keywords

No keywords specified

Authors

Henrik Bjorklund , Johanna Bjorklund , Adam Dahlgren Lindstrom and Yonas Demeke Woldemariam

Back	Edit this report
Entry responsible: Johanna Bjorklund

UMINF-series

Actions

Page Responsible: Frank Drewes
2025-07-02