The MLLP transcription and translation platform is an online platform for automated and assisted mutilingual media subtitling and text translation created by Universitat Politècnica de València's Machine Learning and Language Processing (MLLP) research group. It provides support for the transcription and translation of the full content of MOOCs, and integrates other MLLP-developed technologies such as Text-to-Speech synthesis for enhanced accessibility.
The Machine Learning and Language Processing (MLLP) research group is composed of researchers based at the Universitat Politècnica de València's Departament de Sistemes Informàtics i Computació. Our main research areas of interest are: machine learning and applications; natural language processing; and educational technologies and big data.
One of the main activities of the MLLP group has been the development of technologies for the automatic transcription and translation of video, audio and learning contents, most recently within the EU projects transLectures and EMMA. These technologies have been deployed within these two projects, and are also being provided now to other universities and organisations.
The MLLP Platform has been developed 100% at Universitat Politècnica de València (UPV).
The statistical models we use for automatic transcription and translation have been developed at the UPV's Machine Learning and Language Processing research group (MLLP). Our speech recognition engine is our own TLK: The transLectures-UPV Toolkit, while the MLLP Platform, its API and its advanced post-editing interface are based on our own TLP: The transLectures-UPV Platform software.
These technologies have matured and been put into practice for large video repositories and full MOOC courses in the EU projects transLectures and EMMA, and are now also available for use by other universities and organizations.
Our technology adapts to your videos for enhanced accuracy, going beyond what generalist speech recognition systems can provide. Furthermore, using our own technology makes us able to customize our systems for interested organizations through premium accounts.
We are continuously adding new languages to our automatic transcription services. Currently the transcription languages we support are:
As in transcription, we are continuously adding new language pairs to our automatic translation services. Currently our supported languages are:
Nothing at all. Our transcription and translation services are 100% cloud based, so you can access and transcribe your media files through the Internet.
Indeed! We have developed an advanced API through which you can ingest media and text, and manage your transcriptions and translations. You will find customized API information within your account when you register.
Of course, you can register right now and upload up to 5 videos (or 2 hours in total) and 50 text documents to be transcribed and translated by our automatic services.
After registering and logging into your account, just go to "Upload media" in the left-hand menu to upload a file (video or audio). You will be asked for the original language of the file ("Media language") and to which languages you would like to translate the subtitles. The platform will then process your file, and you will be sent an email to let you know when the transcription and translations are ready.
Then, log into your account and you will find the video in "My videos". You will be able to watch the video with the automatic subtitles and edit them for corrections in our TLP Media Player. You will also find options to download or upload subtitles as SRT files.
The speech recognition system will work with three elements that you provide: the video or audio file itself; the title of the recording; and the slides and external documents (if there are any).
To post-edit the generated automatic subtitles to your liking with the minimum possible effort, we recommend following this order: