Literature

1

John Hutchins, Evgenii Lovtskii: Petr Petrovich Troyanskii (1894–1950): A forgotten pioneer of mechanical translation. In: Machine Translation, volume 15 (September 2000), p.187-221

2

Hutchins, William John: Machine Translation: Past, Present, Future. Longman Higher Education, Chichester, 1986

3

Lingenio GmbH: How does machine translation work?

http://www.lingenio.de/English/Language-Technology/machine-translation.htm

4

Ruslan Mitkov: The Oxford Handbook of Computational Linguistics. Oxford University Press, 2003

5

Hans-Georg Gadamer: Speech and Language Processing.  Draft of August 2006

http://www1.cs.columbia.edu/~julia/jmchapters/ch24.pdf

6

Andy Way, Nano Gough: Comparing example-based and statistical machine translation. In Natural Language Engineering, volume 11, issue 3 (September 2005), p.295-309

7

D.J. Arnold, Lorna Balkan, Siety Meijer, R.Lee Humphreys, Louisa Sadler, Machine Translation: an Introductory Guide, Blackwell Pub, 1994

8

Makoto Nagao, A framework of a mechanical translation between Japanese and English by analogy principle

http://www.mt-archive.info/Nagao-1984.pdf

9

Dr Uta Seewald, Antibabylonisch. In iX 12/95, Heise Verlag, p.88ff

10

Scott Bass, Machine vs. Human Translation, 1999

http://www.advancedlanguage.com/articles/machine_vs_human_translation.pdf

11

Harald H. Zimmermann: Stand und Perspektiven der Sprachtechnologie mit dem Beispiel der Maschinellen Übersetzung

http://is.uni-sb.de/zimmermann/pdf/2003b.pdf

12

http://www.wikipedia.org

13

http://www.heisoft.com

14

http://portal.acm.org

15

http://www.eamt.org

16

http://googleresearch.blogspot.com

17

http://googlesystem.blogspot.com

18

http://ourworld.compuserve.com/homepages/WJHutchins/Weaver49.htm

Efficiency

Often the following question of what an efficiency human translators are arises. That's why I ask you to tell me how many hours a human translator spends coping with a translation (English – German) of about 3000 words in avarage without the help of automatic translation tools. We are talking about a non-fictional text. Please tell me from your experience how long it takes to translate a text of such an amount which is of moderate difficulty. 

Introduction

The European Association for Machine Translation defines the term "machine translation" as follows: "Machine translation (MT) is the application of computers to the task of translating texts from one natural language to another." [15]In our multicultural society people have always dreamt of the ability to understand and know a lot of languages without effort. As time drew on, computing science provided a chance for such a communication in the mid-20th century.This essay introduces the topic with a synopsis on the history and development of machine translation. Next an explanation of the way machines deal with texts follows. The operating principle steps are successively described and demonstrated by a simple example translation. Furthermore the text gives a general overview of important translation systems, how the different systems try to achieve proper translations and how they have developed. Knowing the basic ideas, the barriers and difficulties, translation system are confronted with, are pointed out. After a description of disadvantages a consideration follows which brings human and machine face to face to lead to a solution according to business decisions. The last paragraphs give an outlook to the future and sum up some expectations in computational linguistics and prospects of machine translation.

Prospects

Users expect machine translation systems to be more compatible. That’s why standards are necessary to make it possible to swap dictionaries or knowledge databases among each other software producer. The combination of technological achievements by different developers may gather a number of advantages, however not for the manufacturer that has to comply with the new standards, but for the users. In addition to adaptability machine translation users desire more flexibility. Today you can find already browser solutions which allow to get a website translation on the fly by one click. The range of application will be expanded further on and there are even first visions for mobile phones which can translate conversations in real time. Apart from these features a lot of users are looking forward to artificial intelligence. If one day machines are able to “understand” human language and learn semantic correlations, they will be able to produce market-ready translations which do no longer require corrections by human translators.

All in all, the market of machine translation requires no small number of improvements and further research but also hold out the prospect of success in the near to remote future.

Machine Translation System vs. Human Translator

The main target groups of machine translation have always been companies because they need translation resources more urgently than personal users at home. That's why, even today, the fact that most effective translation systems are very expensive and only affordable for companies and not for single users is obvious. It always takes a few years’ time until a software gets cheaper, because then new developments are rising and new technologies are born. For example Google's free translation tool can be used by everyone but we have to keep in mind, the poor translations this service generates are the results of an old version (dated 1998) of the famous Systran software. Current Systran applications cost about 300$ depending on the language package. Translations generated by today’s Systran applications are well-known for being very close to original meanings and exemplary grammar. A lot of human translators feel in danger of losing their jobs. However, at the moment machine translation systems are not likely to replace humans but they are thought to speed up the human translator's work and to make the translation process more effective.

Difficulties for Machine Translation Systems

The difficulties and barriers in machine translation are not always comparable to human translators' difficulties. A lot of people wonder why computers are not able to generate perfect solutions although they are very fast in executing even complex calculations and have more knowledge available of nearly every topic in the world. But the sober reality is that a machine has approximately no text comprehension and often can not understand any context and relation between two sentences or words. That's why it cannot interpret or infer words or terms in some cases. In addition, although there are electronic information sources and more than one digital encyclopaedia the system is missing common knowledge and general education. The problems of machine translation can be separated into ambiguous terms, grammatical structures and syntactic relations.

Disposition

1. Introduction to Machine Translation                                            

2. Information about Machine Translation                                       
2.1 Historical Background                                                                      
2.2 Operating Principle                                                                           
2.3 Various Translation Systems                                                             
          2.3.1 Dictionary-Based Machine Translation                                                           
          2.3.2 Interlingual Machine Translation                                                                       
          2.3.3 Transfer-Based Machine Translation                                                              
          2.3.4 Example-Based Machine Translation                                                             
          2.3.5 Statistical Machine Translation Systems                                                      
          2.3.6 Machine-Aided and Human-Aided Translation
    2.4 Difficulties for Machine Translation Systems                                   
          2.4.1 Ambiguous Words and Structures                                                                 
          2.4.2 Compound Nouns                                                                                            
          2.4.3 Syntactic and Semantic Difficulties                                                               
    2.5 Machine Translation Systems versus Human Translators?              
          2.5.1 Expense Factor                                                                                               
          2.5.2 Speed, Efficiency and Productivity                                                                
          2.5.3 Accuracy                                                                                                           
          2.5.4 Improvisation and Consistency                                                                      
          2.5.5 Solution                                               
                                                             
3. Prospects                                                                                           

Operating Principle

OP

HOW MACHINES TRANSLATE

First, I want to point out how conventional machine translation systems work. The course of human translation which consists of reading and comprehending a text, transfering words and converting grammar structures, and finally, if necessary, improving the translation, is a great contrast to most translation softwares' operating principles which works like the following scheme, explanation and a simple example demonstrate:

1.   Parsing

First of all parsing, a preparation which splits a text, leads the way. Documents are divided into sentences, sentences are partitioned into single words, terms or formatting information. Time or date information, names and terms of compound nouns, for example, must not be torn apart. 

2.   Morphological Decomposition
Bilingual dictionaries consist only of word stems and words' basic forms because it is impossible to save each form and modification of words in both the source and target language. To illustrate the problem, e.g. one German verb would require six entries in the dictionary only for its present indicative forms. So if each form were added, the size would explode. That's why an additional step to identify the basic forms and stems of the words – especially verbs and compound nouns – is necessary before the transfer can start. Often these algorithms are called "morphological decomposition".
3. Analysis
A translation's quality is determined by both syntactic and semantic analysis. Translation systems must be able to deal with sentential and grammatical structures, detect relations, references and general contexts to generate useful translations.
4. Transfer
After a text has been analysed, the systems try to find equivalents to the individual parts of a sentence by the aid of dictionary look-ups. The preparations from step one and two make these look-ups possible. The transfer is also influenced by step three because it considers the previous analysis' results. Some machine translation systems use so-called "intelligent dictionaries". These help to distinguish between the meanings of words which have more than one corresponding translation in the target language with regard on the subject of the whole text.
5. Finalisation
Finally the new sentences in the target language are changed with respect of structures and grammatical constructions. If it is necessary, the system changes the position of a word, for example.

Historical Background

Troyanskii suggested a concept of machine translator that was able to look up words in a bilingual dictionary. He also outlined the principles of an interlingual system. This system was expected to parse and analyse a text and to save it in a language, called interlingua, similar to the human language Esperanto. The next step was a synthesis, and afterwards the text was converted into the target language again. This had been Troyanskii's proposal for a solution before the computer was invented. Because his ideas were unreachable and unrealistic in those days, Troyanskii has been remaining unknown for the next 20 years, although even today his ideas are relevant for many translation systems which are based on the so-called interlingua method.
After
World War II, America was prepared to support linguistic research. Some of the army's researchers, who had assisted in developing code-breaking systems to decrypt secret messages during the war, were cooperating with linguists and scientists like Warren Weaver. Such expert teams were able to accumulate knowledge of both languages and machines. The best known collection of information on this topic is the "Weaver Memorandum" (1949). This document can be seen as the starting point of machine translation research.  The first success in translating showed the public demo of a machine translation. There, a machine was working on 52 Russian sentences using six grammar rules and a memory of about 250 words. The sentences were simple and short like They prepared TNT.Though, the Georgetown University students were proud of the output in 1954 and were able to convince a lot of companies to support and invest in machine translation research.