he demand for machine-trans-
lation technology is rising as
business, finance, education,
and the Internet become
increasingly international and
multilingual.
Since the 1950s, universities, re-
search institutions, and vendors have
developed translation technologies,
most using detailed rules based on a
sophisticated knowledge of linguistics.
Relatively few researchers worked on
approaches that compare and analyze
documents and their already-available
translations to determine statistically,
without prior linguistic knowledge, the
likely meanings of phrases. These sta-
tistical systems use this information to
translate new documents. For years,
because processors were not fast
enough to handle the extensive compu-
tation these systems require, many
experts considered statistical systems
inferior to rules-based systems.
However, when the Speech Group of
the US National Institute of Standards
and Technology’s Information Access
Division recently tested 20 machine
translation technologies, a statistical
system developed by Google finished
in first place.
The NIST test results’ significance is
that Google and other organizations
will invest more time, money, and tal-
ent into researching this approach, said
Dimitris Sabatakakis, CEO of transla-
tion vendor Systran, whose software
appears in many search engines, online
translators, and other products. Mean-
while, faster processors and other
advances are making statistical trans-
lation technology more accurate and
thus more useful.
However, the approach must still
clear several hurdles—such as still-
inadequate accuracy and problems
recognizing idioms—before it can be
useful for mission-critical tasks.