Machine Translation and Computational
Linguistics Resources
About Machine Translation
The purpose of this site is to re-establish the truth about the real (im)possibilities of Machine Translation systems (also called automatic translators).
Asia-Pacific Association for Machine Translation (AAMT)
The Association for Computational Linguistics
Computational Linguistics/Language Engineering Resources
Computational Linguistics Resources
The EMILLE project
EMILLE (Enabling Minority Language Engineering) is a 3 year EPSRC project at Lancaster University and Sheffield University, designed to build a 63 million word electronic corpus of South Asian languages, especially those spoken in the UK.
Euralex - European Association for Lexicography
The European Association for Machine Translation
German Morphology Browser
The German Morphology Browser is powered by the WordManager Transducer Toolkit. The program knows 1.2 million German words, their morphosyntactic features, and their mapping to some 130'000 lexemes. In addition, it contains
all relationships between simple and complex lexemes. Out of the 130'000 lexemes, only 13.4% are simple. The other entries are complex and are constructed out of simple lexemes.
This resource serves as basis for various innovative Web applications as e.g. in the areas of Language Learning, Information Retrieval, Hyperlink Generation.
Like other forthcoming Language Products of the Canoo Engineering AG the German Morphology Browser is based on a cooperation with University of Basel, Vrije Universiteit Amsterdam, and IDSIA Lugano.
IMS Stuttgart list of Institutes active in the areas of Computational Lingustics and Natural Language Processing, also various resources
IMS Stuttgart - Machine Translation Overview
Language Technology newletter published by the government of India
List of CAT tools, MT software and companies, on-line publications, MT engines
List of MT companies, products, prices
Machine Translation Overview by Jean Choong
MT-Grid: list of on-line MT engines
Multiconcord, Multilingual Parallel Concordancer
Multiconcord is a Multilingual Parallel Concordancer which operates in all
Western European Languages, including Greek, and a version which operates in
addition in Central European languages and Russian. There is also a
version which offers tri-lingual concordancing in Chinese/Pinyin and
English which works if Chinese is available on the computer.
Natural Language Software Registry
The Natural Language Software Registry (NLSR) is a concise summary of the capabilities and sources of a large amount of natural language processing (NLP) software available to the NLP community.
NTT Machine Translation Research Group
The NTT Machine Translation Research Group carries out research into Natural Language Processing, centered on the development of ALT-J/E, a practical Japanese-to-English machine translation system based on semantic analysis.
OLIF - Open Lexicon Information Format
OLIF, the Open Lexicon Interchange Format, is a user-friendly vehicle for exchanging terminological and lexical data.
OLIF is XML-compliant and offers support for natural language processing (NLP) systems, such as machine translation, by providing coverage of a wide and detailed range of linguistic features.
Prospector - list of on-line MT engines
TransSearch, the
RALI's bilingual concordancing tool
TransSearch is a bilingual concordancing tool. It was originally developped by the Computer-Aided Translation (CAT) research team of the Centre for Information Technology Innovation (CITI). This team now constitutes the core of the RALI laboratory.
Resources for evaluating MT systems (by ISI)
Word2Word's machine translation list
Van Slype 1979 report on MT evaluation (see the Working Group documents)
This report contains valuable analyses of criteria for MT
evaluation, and was produced for the European Community by
synthesizing the contributions of a wide set of experts.