About the MLLP Platform

  • What is the MLLP transcription and translation platform?

    The MLLP transcription and translation platform is an online platform for automated and assisted mutilingual media subtitling and text translation created by Universitat Politècnica de València's Machine Learning and Language Processing (MLLP) research group. It provides support for the transcription and translation of the full content of MOOCs, and integrates other MLLP-developed technologies such as Text-to-Speech synthesis for enhanced accessibility.

  • What is the MLLP?

    The Machine Learning and Language Processing (MLLP) research group is composed of researchers based at the Universitat Politècnica de València's Departament de Sistemes Informàtics i Computació. Our main research areas of interest are: machine learning and applications; natural language processing; and educational technologies and big data.

    One of the main activities of the MLLP group has been the development of technologies for the automatic transcription and translation of video, audio and learning contents, most recently within the EU projects transLectures and EMMA. These technologies have been deployed within these two projects, and are also being provided now to other universities and organisations.

  • What services do you offer?
    1. Remote automatic multilingual media subtitling and text translation services (full MOOC content support). Including:
      • Automatic media transcription in several languages, with topic adaptation for improved accuracy.
      • Automatic media translation into several languages.
      • Automatic text translation into several languages.
      • Text-to-speech synthesis.
      • Additional adaptation options for large repositories, to further improve the accuracy of the automatic transcriptions and translations.
    2. An online service for the management and edition of automatic transcriptions and translations. Including:
      • TLP Media Player: An advanced interface for the post-editing of multilingual subtitles.
      • TLP Transcription Editor: An advanced interface for audio-synched, full text transcription post-editing.
      • TLP Text Translation Editor: An advanced interface enabling side-by-side text translation post-editing.
      • TLP Web Service: An advanced API enabling the automation and integration of the MLLP Platform's tasks in your media workflow.
  • Who provides the technology behind the MLLP Platform?

    The MLLP Platform has been developed 100% at Universitat Politècnica de València (UPV).

    The statistical models we use for automatic transcription and translation have been developed at the UPV's Machine Learning and Language Processing research group (MLLP). Our speech recognition engine is our own TLK: The transLectures-UPV Toolkit, while the MLLP Platform, its API and its advanced post-editing interface are based on our own TLP: The transLectures-UPV Platform software.

    These technologies have matured and been put into practice for large video repositories and full MOOC courses in the EU projects transLectures and EMMA, and are now also available for use by other universities and organizations.

    Our technology adapts to your videos for enhanced accuracy, going beyond what generalist speech recognition systems can provide. Furthermore, using our own technology makes us able to customize our systems for interested organizations through premium accounts.

  • Which transcription languages do you support?

    We are continuously adding new languages to our automatic transcription services. Currently the transcription languages we support are:

    • Català
    • Deutsch
    • English
    • Español
    • Estonian
    • Français
    • Italiano
    • Dutch
    • Português
    • Slovene
  • Which translation pairs do you support?

    As in transcription, we are continuously adding new language pairs to our automatic translation services. Currently our supported languages are:

    • Català → English, Español
    • Deutsch → English
    • English → Português, Català, Italiano, Français, Slovene, Español
    • Español → English, Català
    • Estonian → English
    • Français → English
    • Italiano → English
    • Dutch → English
    • Português → English
  • Do we need to install any software to use your services?

    Nothing at all. Our transcription and translation services are 100% cloud based, so you can access and transcribe your media files through the Internet.

  • Is it possible to integrate your services in our current technology and workflow?

    Indeed! We have developed an advanced API through which you can ingest media and text, and manage your transcriptions and translations. You will find customized API information within your account when you register.

  • Do you offer a trial period?

    Of course, you can register right now and upload up to 5 videos (or 2 hours in total) and 50 text documents to be transcribed and translated by our automatic services.

  • Using the MLLP Platform

  • How do I begin using the MLLP platform for automatic media subtitling?

    After registering and logging into your account, just go to "Upload media" in the left-hand menu to upload a file (video or audio). You will be asked for the original language of the file ("Media language") and to which languages you would like to translate the subtitles. The platform will then process your file, and you will be sent an email to let you know when the transcription and translations are ready.

    Then, log into your account and you will find the video in "My videos". You will be able to watch the video with the automatic subtitles and edit them for corrections in our TLP Media Player. You will also find options to download or upload subtitles as SRT files.

  • Where can I find a more detailed user guide for automatic media subtitling in the MLLP platform?
  • What can I do to obtain the best transcription results?

    The speech recognition system will work with three elements that you provide: the video or audio file itself; the title of the recording; and the slides and external documents (if there are any).

    • The video (or audio) file is the main input for the system. The recording conditions are important for the accuracy of the automatic transcription. Best results will be obtained for videos with only one speaker, and both the quality of the recording and the clarity of the speech and pronunciation will have an impact on the quality of the transcription. Non-speech elements such as background music can hinder speech recognition and impact transcription results negatively.
    • The title of the talk can be used to automatically search for related documents on the net, from which vocabulary and language characteristics will be extracted to improve the transcription. Try to be descriptive with the title: a generic title such as “Medicine” will be less useful for the system than something more specific such as “Methodology for data analysis in medical sciences”. Remember to switch on Topic Adaptation in the media upload form to take advantage of this feature (currently available for English, Spanish and Catalan).
    • Finally, if the video shows any accompanying slides or you have other documents related to the contents of the recording, providing the system with these files will allow it to analyse them as well and use their contents to improve transcription results. Switch on Topic Adaptation in the media upload form to take advantage of this feature (currently available for English, Spanish and Catalan).
  • What is the recommended workflow to minimize the effort to obtain quality subtitles?

    To post-edit the generated automatic subtitles to your liking with the minimum possible effort, we recommend following this order:

    1. Upload your media to generate the automatic transcription, and initial automatic translations into the languages you request.
    2. Revise the automatic transcription. The MLLP Platform's TLP Media Player includes advanced subtitle editing functionalities for this purpose (alternatively, you can download your subtitles and upload later a revised version).
    3. After you save your changes in the original transcription language and you close the video for confirmation, the MLLP Platform will offer you to regenerate the automatic translations from the revised transcription.
    4. When the new automatic translation is complete, open the video again and revise the improved automatic translations.
  • Contact

  • How can I contact you and learn more?

    For news and updates, you can visit the news section on our website. And follow us on Twitter! @mllpresearch.

    For support and information, contact us at mllp-support@upv.es.