Google translation AI botches legal terms 'enjoin,' 'garnish': research

Researchers previously have found programs that learn translations by studying non-diverse text perpetuate historical gender biases, such as associating "doctor" with "he”

April 19, 2021 05:57 pm | Updated April 20, 2021 12:05 pm IST

The new paper raises concerns about a popular method companies use to broaden the vocabulary of their translation software.

The new paper raises concerns about a popular method companies use to broaden the vocabulary of their translation software.

(Subscribe to our Today's Cache newsletter for a quick snapshot of top 5 tech stories. Click here to subscribe for free.)

Translation tools from Alphabet Inc's Google and other companies could be contributing to significant misunderstanding of legal terms with conflicting meanings such as "enjoin," according to research due to be presented at an academic workshop on Monday.

Google's translation software turns an English sentence about a court enjoining violence, or banning it, into one in the Indian language of Kannada that implies the court ordered violence, according to the new study.

"Enjoin" can refer to either promoting or restraining an action. Mistranslations also arise with other contronyms, or words with contradictory meanings depending on context, including "all over," "eventual" and "garnish," the paper said.

Google said machine translation is "still just a complement to specialised professional translation" and that it is "continually researching improvements, from better handling ambiguous language, to mitigating bias, to making large quality gains for under-resourced languages."

Also Read | MIT team builds machine learning model for fact-checking

The study's findings add to scrutiny of automated translations generated by artificial intelligence software. Researchers previously have found programs that learn translations by studying non-diverse text perpetuate historical gender biases, such as associating "doctor" with "he."

The new paper raises concerns about a popular method companies use to broaden the vocabulary of their translation software. They translate foreign text into English and then back into the foreign language, aiming to teach the software to associate similar ways of saying the same phrase.

Known as back translation, this process struggles with contronyms, said Vinay Prabhu, chief scientist at authentication startup UnifyID and one of the paper's authors.

When they translated a sentence about a court enjoining violence into 109 languages supported by Google's software, most results erred. When spun back to English, 88 back translations said the court called for violence and only 10 properly said the court prohibited it. The remainder generated other issues.

Also Read | Google to spend $3.8 million to settle accusations of hiring, pay biases

Another researcher, Abubakar Abid, tweeted in December that he found possible bias in back translation through Turkish. Using Google, short phrases with "enjoin" translated to "people" and "Muslims" ordering violence but the "government" and "CIA" outlawing it.

The new paper said translation issues could lead to severe consequences as more businesses use AI to generate or translate legal text. One example in the paper is a news headline about nonlethal domestic violence turning "hit" into "killed" during translation, a potentially true but problematic association.

Authors also expressed concern about the lack of warnings and confidence scores in tools from Google and others. Google in support materials warns it may not have the best solution "for specialised translation in your own fields."

0 / 0
Sign in to unlock member-only benefits!
  • Access 10 free stories every month
  • Save stories to read later
  • Access to comment on every story
  • Sign-up/manage your newsletter subscriptions with a single click
  • Get notified by email for early access to discounts & offers on our products
Sign in

Comments

Comments have to be in English, and in full sentences. They cannot be abusive or personal. Please abide by our community guidelines for posting your comments.

We have migrated to a new commenting platform. If you are already a registered user of The Hindu and logged in, you may continue to engage with our articles. If you do not have an account please register and login to post comments. Users can access their older comments by logging into their accounts on Vuukle.