Strojový překlad

Představ si, že jseš stroj, který překládá

Proč se překladače nikdy neptají, jak to myslíme?

Michal Měchura
2022-02-28

Představ si, že jseš stroj, který překládá – třeba Google Translate nebo DeepL – a někdo po tobě chce, abys přeložil z angličtiny do češtiny větu I am a good student. Jak to přeložíš? Jako já jsem dobrý student, nebo já jsem dobrá studentka? Jakožto stroj nemáš ponětí, jestli se za tím anglickým zájmenem I skrývá v téhle větě muž, nebo žena. Ty jseš umělá inteligence, tak se prostě rozhodneš pro tu nejpravděpodobnější variantu, což je nejspíš ta mužská.

Nebo vezměme jinou větu: please sit down. Přeložíš to jako posaďte se, nebo posaď se? Jako stroj nevíš, jestli je věta určená jedné osobě, nebo několika a jestli by si ti lidé v češtině vykali, nebo tykali. Vybereš tedy to, co ti přijde nejvíc pravděpodobné, nejvíc obvyklé – asi tu variantu s vy, že ano.

V obou případech by bylo lepší, kdyby se stroj uživatele zeptal, jak to vlastně myslí. Strojové překladače ale neumí klást otázky. Jediné, co umí, je vyprodukovat to, nejčastěji viděly v datech, ze kterých se to překládání strojově naučily. Že uživatel měl možná na mysli něco úplně jiného, to je jim jedno.

Mně vždycky trochu lezlo na nervy, že strojové překladače se na nic neptají. Můj ideální překladač by dokázal rozeznat, že něco není jednoznačné, a zeptal by se mě, co přesně mám na mysli. Teprve potom by vydal překlad.

Tak dlouho mi to lezlo na nervy, až jsem sám takový překladač vyrobil: Fairslator. Lépe řečeno, Fairslator není překladač, je to spíše plug-in pro jakýkoliv jiný překladač. Fairslator vezme výstup ze strojového překladu, prozkoumá ho, automaticky rozpozná nejasnosti a v případě potřeby se zeptá.

Jak to tedy funguje, když požádáš Fairslator, aby ti přeložil I am a good student nebo sit down please do češtiny? Nejprve si musíš vybrat, kterou službou si to chceš nechat přeložit: DeepL, Google Translate nebo Microsoft Translator. Fairslator se s touto službou v zákulisí spojí a získá od ní překlad. Jako druhý krok Fairslator prozkoumá obě věty (anglický originál plus český překlad) a pokusí se zjistit, zda se tam nenachází nějaká nejasnost, nějaká možnost volby, třeba mezi mužem a ženou nebo mezi vy a ty. Pokud ne, tak ti prostě ukáže překlad a je ho hotovo. Pokud ano, zobrazí ti Fairslator nejen překlad, ale také nabídku, kde si můžeš vybrat, co přesně máš na mysli: kdo tu větu říká (muž, nebo žena), komu je určena (jedné osobě, nebo několika a pokud jedné, zda si tykáme, nebo vykáme) a tak dále. Podle toho, co vybereš, se pozmění i překlad.

Fairslator je můj pokus dát člověku kontrolu nad strojovým překladem. V každém lidském jazyce existují nejednoznačné věty, které jdou přeložit do druhého jazyka jen tehdy, když se doptáme, co přesně se jimi myslí. Jsou situace, kdy ani ta nejchytřejší umělá inteligence nedokáže odhadnout, co má člověk na mysli. Překladače jako Google Translate a DeepL se o to pokouší i přesto, ale tudy cesta nevede. Jediná správná cesta je prostě se zeptat. Jako Fairslator.

Kontaktovat autora

michmech@lexiconista.com

Sdílet tento článek

Twitter LinkedIn Facebook Reddit

What next?

Read more about bias and ambiguity in machine translation.

We need to talk about bias
in machine translation

The Fairslator whitepaper

Version 1.0

Download

Sign up for my very low-traffic mailing list. I'll keep you updated on what's new with Fairslator and what's happening with bias in machine translation generally.

Your address is safe here. I will only use it to send you infrequent updates about Fairslator. I will not give or sell it to anyone. You can ask me to be taken off the list at any time.

Faislator blog

Infographic

How gender rewriting works in machine translation

This is how Fairslator deals with gender-biased translations.

Announcement

Introducing the Fairslator API

Like what Fairslator does? Want to have something similar in your own application? There's an API for that!

Machine translation

Google Translate versus gender bias

How does Google Translate handle gender-ambiguous input? With difficulty.

Gendergerechte Sprache

Kann man das Gendern automatisieren?

Überall Gendersternchen verstreuen und fertig? Von wegen. Geschlechtergerecht zu texten, das braucht vor allem Kreativität.

Oh là là

Three reasons why you shouldn’t use machine translation for French

But if you must, at least run it through Fairslator.

Ó Bhéarla go Gaeilge

Tusa, sibhse agus an meaisínaistriúchán ó Bhéarla

Tugaimis droim láimhe leis an mhíthuiscint nach bhfuil ach aon aistriúchán amháin ar gach rud.

Machine translation

Finally, an Irish translation app that knows the difference between ‘tú’ and ‘sibh’

It asks you how you want to translate ‘you’.

Forms of address

Why machine translation has a problem with ‘you’

This innocent-looking English pronoun is surprisingly difficult to translate into other languages.

Male and female

10 things you should know about gender bias in machine translation

Machine translation is getting better all the time, but the problem of gender bias remains. Read these ten questions and answers if you want to understand all about it.

Machine translation in Czech

Finally, a translation app that knows the difference between Czech ‘ty’ and ‘vy’!

Wouldn’t it be nice if machine translation asked how you want to translate ‘you’?

German machine translation

Finally, a translation app that knows the difference between German ‘du’ and ‘Sie’!

Wouldn’t it be nice if machine translation asked how you want to translate ‘you’?

Gender bias in machine translation

Gender versus Czech

In Czech we don’t say ‘I am happy’, we say ‘I as a man am happy’ or ‘I as a woman am happy’.

Maschinelle Übersetzung

Stell dir vor, du bist DeepL

Warum fragt der Übersetzer eigentlich nicht, was ich meine?

Fairslator timeline

October 2024 — We were talking about bias in machine translation at a Translating Europe Workshop organised by the European Commission in Prague as part of Jeronýmovy dny, a series of public lectures and seminars on translation and interpreting. Video here »

September 2024 — We presented a half-day tutorial on bias in machine translation at this year's biennial conference of AMTA, the Association for Machine Translation in the Americas.

December 2023 — Fairslator presented a workshop on bias in machine translation at the European Commission's Directorate-General for Translation, attended by translation-related staff from all EU institutions.

November 2023 — Fairslator went to Translating and the Computer, an annual conference on translation technology in Luxembourg, to present its brand new API. Proceedings from this conference are here, our paper starts on page 98.

November 2023 — We were talking about gender bias, gender rewriting and Fairslator at the EAFT Summit in Barcelona where we also launched an exciting spin-off project there: Genderbase, a multilingual database of gender-sensitive terminology.

November 2023 — English–French language pair added to the Fairslator API.

July 2023 — The Fairslator API was launched. Explore the API or read the announcent: Introducing the Fairslator API »

February 2023 — We spoke to machinetranslation.com about bias in machine translation, about Fairslator, and about our vision for “human-assisted machine translation”. Read the interview here: Creating an Inclusive AI Future: The Importance of Non-Binary Representation »

October 2022 — We presented Fairslator at the Translating and the Computer (TC44) conference, Europe's main annual event for computer-aided translation, in Luxembourg. Proceedings from this conference are here, the paper that describes Fairslator starts on page 90. Read our impressions from TC44 in this thread on Twitter and Mastodon.

September 2022 — In her article Error sources in machine translation: How the algorithm reproduces unwanted gender roles (German: Fehlerquellen der maschinellen Übersetzung: Wie der Algorithmus ungewollte Rollenbilder reproduziert), Jasmin Nesbigall of oneword GmbH talks about bias in machine translation and recommends Fairslator as a step towards more gender fairness.

September 2022 — Fairslator was presented at the Text, Speech and Dialogue (TSD) conference in Brno.

August 2022 — Translations in London are talking about Fairslator in their blog post Overcoming gender bias in MT. They think the technology behind Fairslator could be useful in the translation industry for faster post-editing of machine-translated texts.

August 2022 — A fourth language pair released: English → French.

July 2022 — We presented a paper titled A Taxonomy of Bias-Causing Ambiguities in Machine Translation at a Workshop on Gender Bias in Natural Language Processing during the 2022 Annual Conference of the North American Chapter of the Association for Computational Linguistics in Seattle.

July 2022 — Germany's Goethe-Institut interviewed us for the website of their project Artificially Correct. Read the interview in German: Wenn die Maschine den Menschen fragt or in English: When the machine asks the human, or see this short video on Twitter.

May 2022 — Slator.com, a website for the translation industry, asked us for a guest post and of course we didn't say no. Read What You Need to Know About Bias in Machine Translation »

April 2022 — A third language pair added: English → Irish.

February 2022 — Fairslator launched with two language pairs: English → German, English → Czech.