"It is the democratic faith that [intelligence] is sufficiently general so that each individual
has something to contribute, and the value of each contribution can be assessed only as it entered
into the final pooled intelligence constituted by the contributions of all."
--John Dewey
Youcef BEY
PhD - resume: FR
Supervisors: Pr Christian Boitet | Pr Kyo Kageura
- Joseph
Fourier University, Grenoble, France | LIG laboratory (Laboratoire
d'Informatique de Grenoble)
- Graduate School of Education, TODAI, Japan
- National Institute of Informatics, Japan
- University Ferhat Abbes , Sétif, Algéria
Previous position: November 2008 - April 2010
- Research and development (Startup creation and business development) at lts-tech (Translation Services)
Research interests
- Volunteer and crowdsourced translation
- Wiki-based collaborative technologies
- Translation software components
- Recycling translated data from the Web
- High scale linguistic resources processing
- Open source localization problems
Research abstract
Abstract :
The research work presented in my thesis belongs to the machine and automated translation field.
I'm particularly interested in tools that help and promote online, free, non-commercial and incremental translation.
The staggering growth of multilingual document translations on the Web (W3C, Traduct, free translation of human right
documents, Mozilla, etc.) is the result of painstaking work of volunteer communities who are unfortunately technologically
marginalized and deprived of the possibility to use any linguistic technology and aided-translation system (CAT).
I have conducted an in-depth study of volunteer communities and identified the most interesting and challenging problems.
I propose BEYTrans, a collaborative, non-commercial environment which offers linguistic helps and meets the specific needs
of these communities. Its 3 main modules are: (i) a multilingual Web editor integrating advanced and varied language
functionalities, (ii) a module for recycling existing translations and (iii) a module for managing and treating high scale
multilingual data. Each solves a challenging problem, and they are fully integrated (with other functionalities) in a collaborative
environment which has been successfully tested in real-world situations (ex. localization collaborative du Tikiwiki).
International research projects
- International project C-STAR III:
responsible of the evaluation compaign IWSLT-2006.
- The QRLex project: environment for helping Human rights japanese volunteer
translators.
- International project UNL
(Universal Networking Language): the Universal Networking Language
(UNL). The project seeks to develop and promote a multilingual communication
platform, with the purpose of enabling all peoples to share information
and knowledge in their native language.
References
Thesis
Download it here.
- Crowdsourcing translation experiementation
| Humain duration | Delay |
| Human translation | > 2 hours | > 40 minutes |
| BEYTrans v 2.0 | > 1 hour | > 24 minutes |
| Gain | > 50% | > 40% |
Table 1: Crowdsourcing experimentation (collabotive localization of Tikiwiki)
Get the new protocole
(Translation into English will be done as soon as possible)
Book chapter
Bey, Y., K. Kageura and C. Boitet (2008). BEYTrans: A Wiki-based
Environment for Helping Online Volunteer Translators. In Topics in Language Resources for Translation and Localisation, Yuste Rodrigo, Elia (ed.), pp. 135–150. (John Benjamins publishing company)
International journals with selection panel
Bey, Y., K. Kageura and C. Boitet (2006). Data Management in QRLex, an
Online Aid System for Volunteer Translators. International Journal of
Computational Linguistics and Chinese Language Processing, 11/4, pp. 349-376. [PDF]
International manifestation with selection panel
Bey, Y., K. Kageura, C. Boitet, F. Marzari (2008).
Translating the DEMGOL Etymological Dictionary of Greek Mythology with the BEYTrans Wiki. In the Proceeding
of the 4th International Symposium on Wikis, Wikisym2008. Porto, Potugal, 8-10 September, 2008.
[PDF] [Slides]
Bey, Y., K. Kageura and C. Boitet (2007). BEYTRans: A Free Online
Collaborative Wiki-Based CAT Environment Dedicated for Online
Translation Communities. The 21th Pacific Asia Conference on Language,
Information and Computation, PACLIC21. Seoul, Korea, 1-3 November,
2007. [PDF]
Bey, Y. (2006). Toward a Collaborative
Wiki-based Translation Tools for Volunteer Translators 'TRANSBey'. In
the Proceedings of the 'Colloque International des Etudiants Chercheurs
en
Didactiques des Langues et en Linguistique', CEDILL-2006, Grenoble,
France.[PDF] [Slides (PDF)]
Bey, Y., C. Boitet and K. Kageura (2006). The TRANSBey Prototype:
An
Online Collaborative Wiki-Based CAT Environment for Volunteer
Translators.
Yuste, E., (Ed.). In the Proceedings of the 3rd International Workshop
on
Language Resources for Translation Work, Research & Training
(LR4Trans-III).
LREC 2006 - Fifth International Conference on Language Resources and
Evaluation. Sunday, 28th May 2006. Magazzini del Cotone Conference
Centre, Genoa, Italy. Paris: ELRA / ELDA (European Language Resources
Association, European Language
Resources Distribution Association), pp. 49-54.[PDF]
Bey, Y., K. Kageura and C. Boitet (2005). A Framework for Data
Management for the Online Volunteer Translators' Aid System QRLex. In
the
Proceedings of the 19th Pacific Asia Conference on Language,
Information and
Computation, PACLIC19. Taipei, Taiwan, 1-3 December, 2005, pp. 51-60.[PDF]
Boitet, C, Y. Bey, M. Tomokiyo, C. Cao and H. Blanchon (2006).
IWSLT-06: Experiments with Commercial MT Systems and Lessons from
Subjective
Evaluations. In the Proceedings of the International Workshop on Spoken
Language Translation, IWSLT-06,
Kyoto, Japan, pp. 23-30. [PDF]
Boitet, C., Y. Bey and K. Kageura (2005). Main Research Issues
in Building Web Services for Mutualized, Non-Commercial Translation. In
the
Proceeding of the 6th Symposium on Natural Language Processing, Human
and
Computer Processing of Language and Speech, SNLP-05, Thailand.[PDF]
Blanchon, H., C. Boitet, F. Brunet-Manquat,
M. Tomokiyo, A. Hamon, H. Vo-Trung and Y. Bey (2004). Towards Fairer
Evaluations of Commercial MT Systems on BTEC corpora. In the
Proceedings of the
International Workshop on Spoken Language Translation IWSLT-04, Kyoto,
pp. 21-26. [PDF]
- National manifestation with selection panel
Bey, Y., (2003). Génération aléatoire de
corpus et calcul de relation de
dépendance par apprentissage. Rencontre Jeune Chercheurs,
RJC-2003, Grenoble,
France.
- Additional reports
Bey, Y. (2006). Data Management in QRLex, JASSO Internship Report,
National Institute of Informatics (NII), Tokyo, Japan, 20 p.
Bey, Y. (2003). Génération Aléatoire de
Corpus et Calcul de Relation de Dépendance par Apprentissage,
MSc Report, GETALP-LIG-IMAG, Grenoble, France, 150 p.
Wikis, Conferences, Multilingual CMS, and translation
- M@il: youcef.beyANTISPAM@lts-tech.com
Last update: Monday 31, August 2009