The MLLP transcription and translation platform is an online platform for automated and assisted mutilingual media subtitling and text translation created by Universitat Politècnica de València's Machine Learning and Language Processing (MLLP) research group. It provides support for the transcription and translation of video, audio and the full content of MOOCs, and integrates other MLLP-developed technologies such as Text-to-Speech synthesis for enhanced accessibility.
The Machine Learning and Language Processing (MLLP) research group (www.mllp.upv.es) is composed of researchers based at the Universitat Politècnica de València's Departament de Sistemes Informàtics i Computació. Our main research areas of interest are: machine learning and applications; natural language processing; and educational technologies and big data.
One of the main activities of the MLLP group has been the development of technologies for the automatic transcription and translation of video, audio and learning contents, most recently within the EU projects transLectures and EMMA. These technologies have been deployed within these two projects, and are also being provided now to other universities and organizations.
The MLLP Platform has been developed 100% at Universitat Politècnica de València (UPV).
The statistical models we use for automatic transcription and translation have been developed at the UPV's Machine Learning and Language Processing research group (MLLP). Our speech recognition engine is our own TLK: The transLectures-UPV Toolkit, while the MLLP Platform, its API and its advanced post-editing interface are based on our own TLP: The transLectures-UPV Platform software.
These technologies have matured and been put into practice for large video repositories and full MOOC courses in the EU projects transLectures and EMMA, and are now also available for use by other universities and organizations.
Our technology adapts to your videos for enhanced accuracy, going beyond what generalist speech recognition systems can provide. Furthermore, using our own technology makes us able to customize our systems for interested organizations through premium accounts.
We are continuously adding new languages to our automatic transcription services. Currently the transcription languages we support are:
As with transcription, we are continuously adding new language pairs to our automatic translation services. Currently our supported languages are:
No. Our transcription and translation services are 100% cloud based, so you can access and transcribe your media files through the Internet.
Indeed! We have developed an advanced API through which you can ingest media and text, and manage your transcriptions and translations. You will find customized API information within your account when you register.
Of course. You can register right now and upload up to 5 videos (or 2 hours in total) and 50 text documents to be transcribed and translated by our automatic services.
These are the basic steps for the most important functionalities of the MLLP platform:
The speech recognition system will work with three elements that you provide: the video or audio file itself; the title of the recording; and the slides and external documents (if there are any).
To post-edit the generated automatic subtitles to your liking with the minimum possible effort, we recommend following this order:
If the instructions and tips above didn't cover what you need, here is the MLLP transcription and translation platform User Guide.
(Please note: as the MLLP platform is constantly evolving, parts of the guide might not correspond exactly to the most recent version).
For news and updates, you can visit the MLLP website. And follow us on Twitter! @mllpresearch.
For support and information, contact us at mllp-support@upv.es.